Methods for Finding Related Reddit Subreddits with Simple Set Theory

Fancy machine learning approaches may not be required to help Redditors discover new things.

June 20, 2016 · 5 min

How to Create a Network Graph Visualization of Reddit Subreddits

There is very little discussion on how to gather the data for large-scale network graph visualizations, and how to make them. It is time to fix that.

May 27, 2016 · 7 min

Creating Stylish, High-Quality Word Clouds Using Python and Font Awesome Icons

Why not make a word cloud which looks like a line chart?

May 9, 2016 · 6 min

Blockbuster Movies with Male Leads Earn More Than Those with Female Leads

On average, blockbuster movies with male leads generate 22% more domestic box office revenue, and this difference is statistically significant.

April 13, 2016 · 8 min

The Importance of Sanity-Checking Datasets Before Analysis

The 1972 TV Special ‘The Lorax’ is the best movie ever, earning $1.2 billion?

April 6, 2016 · 6 min