Fun and Dystopia With AI-Based Code Generation Using GPT-J-6B

At the least, AI-generated code is much more readable than the average human’s.

June 14, 2021 · 16 min

Visualizing Airline Flight Characteristics Between SFO and JFK

Box plots, when used correctly, can be a very fun way to visualize big data.

October 23, 2019 · 8 min

Run Any Scheduled Task/Cron Super-Cheap on Google Cloud Platform

Thanks to a few new synergies within GCP products, it’s possible to get the cost of running a scheduled task down to less than a dollar a month.

November 19, 2018 · 6 min

Problems with Predicting Post Performance on Reddit and Other Link Aggregators

The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.

September 10, 2018 · 10 min

Analyzing IMDb Data The Intended Way, with R and ggplot2

For IMDb’s big-but-not-big data, you have to play with the data smartly, and both R and ggplot2 have neat tricks to do just that.

July 16, 2018 · 11 min