Run Any Scheduled Task/Cron Super-Cheap on Google Cloud Platform

Thanks to a few new synergies within GCP products, it’s possible to get the cost of running a scheduled task down to less than a dollar a month.

November 19, 2018 · 6 min

Things About Real-World Data Science Not Discussed In MOOCs and Thought Pieces

MOOCs and thought pieces overfit to a certain style of data science that is not robust to the vast uncertainties of the real world.

October 22, 2018 · 9 min

Problems with Predicting Post Performance on Reddit and Other Link Aggregators

The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.

September 10, 2018 · 10 min

Analyzing IMDb Data The Intended Way, with R and ggplot2

For IMDb’s big-but-not-big data, you have to play with the data smartly, and both R and ggplot2 have neat tricks to do just that.

July 16, 2018 · 11 min

How to Quickly Train a Text-Generating Neural Network for Free

Train your own text-generating neural network and generate text whenever you want with just a few clicks!

May 18, 2018 · 9 min