Max Woolf's Blog
Max Woolf's Blog
Posts
Light
Dark
Automatic
Posts
Visualizing Airline Flight Characteristics Between SFO and JFK
Box plots, when used correctly, can be a very fun way to visualize big data.
October 23, 2019
8 min read
Data Visualization
,
Data Science
,
Big Data
Experiments with Making Convincing AI-Generated Fake News
Can the CTRL model create the “fake news” OpenAI was concerned about? Let’s put it to the test.
September 30, 2019
10 min read
AI
,
Text Generation
How To Make Custom AI-Generated Text With GPT-2
Thanks to gpt-2-simple and this Colaboratory Notebook, you can easily finetune GPT-2 on your own dataset!
September 4, 2019
10 min read
AI
,
Text Generation
Run Any Scheduled Task/Cron Super-Cheap on Google Cloud Platform
Thanks to a few new synergies within GCP products, it’s possible to get the cost of running a scheduled task down to less than a dollar a month.
November 19, 2018
6 min read
DevOps
,
Cost Savings
Things About Real-World Data Science Not Discussed In MOOCs and Thought Pieces
MOOCs and thought pieces overfit to a certain style of data science that is not robust to the vast uncertainties of the real world.
October 22, 2018
9 min read
Thought Piece
Problems with Predicting Post Performance on Reddit and Other Link Aggregators
The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.
September 10, 2018
10 min read
Data Visualization
,
Data Science
,
Big Data
Analyzing IMDb Data The Intended Way, with R and ggplot2
For IMDb’s big-but-not-big data, you have to play with the data smartly, and both R and ggplot2 have neat tricks to do just that.
July 16, 2018
11 min read
Data Visualization
,
Data Science
,
Big Data
How to Quickly Train a Text-Generating Neural Network for Free
Train your own text-generating neural network and generate text whenever you want with just a few clicks!
May 18, 2018
9 min read
AI
,
Text Generation
Visualizing One Million NCAA Basketball Shots
Although visualizing basketball shots has been done before, this time we have access to an order of magnitude more public data to do some really cool stuff.
March 19, 2018
6 min read
Data Science
,
Data Visualization
,
Big Data
A Visual Overview of Stack Overflow's Question Tags
I was surprised to see that all types of programming languages have quick answer times and a high probability of receiving an acceptable answer!
February 9, 2018
6 min read
Data Science
,
Data Visualization
«
»
Cite
×