Problems with Predicting Post Performance on Reddit and Other Link Aggregators
The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.
Sep 10, 2018
Analyzing IMDb Data The Intended Way, with R and ggplot2
For IMDb's big-but-not-big data, you have to play with the data smartly, and both R and ggplot2 have neat tricks to do just that.
Jul 16, 2018
How to Quickly Train a Text-Generating Neural Network for Free
Train your own text-generating neural network and generate text whenever you want with just a few clicks!
May 18, 2018
Visualizing One Million NCAA Basketball Shots
Although visualizing basketball shots has been done before, this time we have access to an order of magnitude more public data to do some really cool stuff.
Mar 19, 2018
A Visual Overview of Stack Overflow's Question Tags
I was surprised to see that all types of programming languages have quick answer times and a high probability of receiving an acceptable answer!
Feb 9, 2018
Benchmarking Modern GPUs for Maximum Cloud Cost Efficiency in Deep Learning
A 36% price cut to GPU instances, in addition to the potential new benefits offered by software and GPU updates, however, might be enough to tip the cost-efficiency scales back in favor of GPUs.
Nov 28, 2017