Before Reddit added native image hosting, Imgur accounted for 15% of all submissions to Reddit. Now it's below 9%.
Are commenters 'late to this thread' indeed late?
Fancy machine learning approaches may not be required to help Redditors discover new things.
There is very little discussion on how to gather the data for large-scale network graph visualizations, and how to make them. It is time to fix that.
If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.
With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.
I constructed a database to store all Reddit Submissions from November 2007 to the end of October 2014: 142,159,793 submissions in total. And this data is very curious and very, *very* memetic.
When clicking a direct image link for the first time on Facebook, Twitter, and Google+, Imgur will redirect you to the main image page instead.
I analyzed the daily number of submissions from over two years to see which events, if any, have affected Reddit's growth rate. As it turns out, Reddit grows by itself.
By tabulating the data of 37,561,369 sitewide Link submissions, I created a tree map of the Top 100 subreddits by the number of Link submissions to that subreddit.