Quantifying and Visualizing the Reddit Hivemind
If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.
If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.
With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.
I constructed a database to store all Reddit Submissions from November 2007 to the end of October 2014: 142,159,793 submissions in total. And this data is very curious and very, very memetic.
I analyzed the daily number of submissions from over two years to see which events, if any, have affected Reddit’s growth rate. As it turns out, Reddit grows by itself.