I had posted a visualization of NYC taxis using ggplot2. Due to popular demand, I've cleaned up the code and have released it open source, with a few improvements.
If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.
With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.
No, this is not an error. You can watch the video yourself on YouTube and verify the view count.
There are many patterns for numbers in passwords, which involve surprising yet intuitive logic.
I constructed a database to store all Reddit Submissions from November 2007 to the end of October 2014: 142,159,793 submissions in total. And this data is very curious and very, *very* memetic.
Hopefully, these comments will answer whether Hacker News is experiencing a rise in quality, or if the complaints levied against HN are valid.