Quantifying and Visualizing the Reddit Hivemind

If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.

October 9, 2015 · 6 min

How to Analyze Every Reddit Submission and Comment, in Seconds, for Free

With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.

October 2, 2015 · 7 min

Coding, Visualizing, and Animating Bootstrap Resampling

I have reverse-engineered data and code with R and ggplot2 in order to create detailed implementations of bootstrapping, and also to add a few visual improvements.

September 22, 2015 · 6 min

Plotting a Map of New York City Using Only Taxi Location Data

In theory, plotting a million little points in close proximity should simulate the lines of the streets of New York City.

August 7, 2015 · 4 min

How to Scrape Data From Facebook Page Posts for Statistical Analysis

It is pretty easy to scrape Facebook Posts data and make into a spreadsheet for easy analysis, although there are a large number of gotchas.

July 20, 2015 · 7 min