How to Visualize New York City Using Taxi Location Data and ggplot2

I had posted a visualization of NYC taxis using ggplot2. Due to popular demand, I’ve cleaned up the code and have released it open source, with a few improvements.

November 16, 2015 · 8 min

Quantifying and Visualizing the Reddit Hivemind

If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.

October 9, 2015 · 6 min

How to Analyze Every Reddit Submission and Comment, in Seconds, for Free

With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.

October 2, 2015 · 7 min

Coding, Visualizing, and Animating Bootstrap Resampling

I have reverse-engineered data and code with R and ggplot2 in order to create detailed implementations of bootstrapping, and also to add a few visual improvements.

September 22, 2015 · 6 min

Plotting a Map of New York City Using Only Taxi Location Data

In theory, plotting a million little points in close proximity should simulate the lines of the streets of New York City.

August 7, 2015 · 4 min