Run Any Scheduled Task/Cron Super-Cheap on Google Cloud Platform

Thanks to a few new synergies within GCP products, it’s possible to get the cost of running a scheduled task down to less than a dollar a month.

November 19, 2018 · 6 min

Things About Real-World Data Science Not Discussed In MOOCs and Thought Pieces

MOOCs and thought pieces overfit to a certain style of data science that is not robust to the vast uncertainties of the real world.

October 22, 2018 · 9 min

Problems with Predicting Post Performance on Reddit and Other Link Aggregators

The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.

September 10, 2018 · 10 min

Visualizing One Million NCAA Basketball Shots

Although visualizing basketball shots has been done before, this time we have access to an order of magnitude more public data to do some really cool stuff.

March 19, 2018 · 6 min

How to Make High Quality Data Visualizations for Websites With R and ggplot2

In general, it takes little additional effort to make something unique with ggplot2, and the effort is well worth it.

August 14, 2017 · 9 min