Last week, Bloomberg reported that Reddit was raising about $150 Million in venture capital at a valuation of $1.7 billion. Since Reddit’s data is public on BigQuery, I quickly checked if there were any recent user engagement growth spurts which could justify such a high worth. Here’s an example BigQuery which aggregates the total number of Reddit submissions made for each month until the end of April 2017:
#standardSQL SELECT DATE_TRUNC(DATE(TIMESTAMP_SECONDS(created_utc)), MONTH) as mon, COUNT(*) as num_submissions, FROM `fh-bigquery.reddit_posts.*` WHERE (_TABLE_SUFFIX BETWEEN '2016_01' AND '2017_04' OR _TABLE_SUFFIX = 'full_corpus_201512') GROUP BY mon ORDER BY mon
As it turns out, Reddit did indeed get a large boost in activity toward the end of 2016, likely due to the heated discussions and events around the U.S. Presidential Election. But Reddit has maintained the growth rate since then, which is very appealing to potential investors.
How are other sites benefiting from Reddit’s growth? Imgur, an image-host developed to be the de facto image hosting service for Reddit, shared in Reddit’s continual growth…
…until mid-2016, when Imgur submission activity abruptly dropped. What happened?
Coincidentally in mid-2016, Reddit made itself an image host for submissions to the site. Initially limited to uploads via the iOS/Android apps, Reddit then allowed desktop users to upload images through a beta rollout starting May 24th, and a full sitewide release on June 21st.
How many Reddit-hosted image submissions are there compared to the number of Imgur submissions?
Wow, native Reddit images caught on.
Did the rise of Reddit-hosted images cause the decline of Imgur on Reddit? Let’s look at the daily number of Imgur submissions and Reddit-hosted Image submissions from December 2015 to April 2017, normalized by the total number of sitewide submissions on that day. This gives us a Reddit “market share” metric for both services.
Additionally, we can plot vertical lines representing the dates when Reddit-hosted images rolled out in the limited beta release and the full sitewide release to see if there is a link between those events and submission behavior.
Before Reddit added native image hosting, Imgur accounted for 15% of all submissions to Reddit. Now it’s below 9%. More Reddit-hosted images are being shared on Reddit than images from Imgur.
Instead of looking at all of Reddit, where spam subreddits could skew the results, we can also look at the largest image-only subreddits: /r/pics and /r/gifs, both of which were a part of the beta rollout.
Here, the impact of the two rollouts is much noticeable, with immediate increases in Reddit-hosted image market share after each rollout, and proportional decreases in Imgur market share. The growth rate after the beta release is flat for both services, but when Reddit image hosting becomes sitewide, the market shares of Reddit-hosted/Imgur images increase/decrease linearly over time once users officially learn that the native image upload functionality exists. And these trends do not appear to be slowing down.
A Silver Lining?
Obviously Imgur does not like losing a large chunk of traffic, but there’s a possibility that this outcome will be better for the business than what’s implied from the charts above.
Hosting images on the internet isn’t free, and bandwidth costs are the primary reason dedicated image hosts have died off over the years. Direct image links which show the user only the image and nothing else are convenient, but they are pure loss for the service. That’s why image hosts encourage linking to the image on a landing page of the website, filled with ads which generate an expected revenue greater than the cost of serving the image.
After a user uploads an image to Imgur on the desktop, the user is given two share links that can be submitted to sites like Reddit: an image link that goes to the image + ads, and a direct link to the image.
Recently, Imgur has pushed app downloads when visiting the site on an iOS/Android device, including disabling uploads in the mobile browser. When sharing an image from the Imgur app, the only way to share an image is through the image link, which could lead to an increase in the proportion of ad-filled Imgur image links on Reddit. Said increase could counteract the decrease in total Imgur submissions, and Imgur could actually come out ahead.
With BigQuery, we can check the percentage of all Imgur submissions to Reddit which are direct links and the percentage which are indirect/lead to a landing page, and see if the ratio changes along the same time horizon used above:
Welp. No significant change in the ratio over time, eliminating that possible silver lining.
Note that the decline of Imgur on Reddit says nothing about Imgur as a business; it’s entirely possible that Imgur’s traffic on the main site itself is sufficient for growth. But the loss of Reddit traffic certainly can’t be ignored, and it’s interesting to visualize how quickly a service can be replaced when there’s an equivalent native feature.
It’s worth nothing that new competitors in the image space such as Giphy utilize image hosting as a secondary service. Instead, they focus on building a repository of images which can be licensed and accessed programmatically by other services like Slack, Facebook, and Twitter. And Giphy has raised $150 Million total with this approach, so perhaps the image hosting market itself has indeed changed.
You are free to use the data visualizations from this article however you wish, but it would be greatly appreciated if proper attribution is given to this article and/or myself!
I am currently looking for a job in data analysis/software engineering in San Francisco. If you liked this post and have a lead, feel free to shoot me an email.
Since I currently do not have a full-time salary to subsidize my machine learning/deep learning/software/hardware needs for these blog posts, I have set up a Patreon, and any monetary contributions to the Patreon are appreciated and will be put to good creative use.