Analyzing the Patterns of Numbers in 10 Million Passwords

There are many patterns for numbers in passwords, which involve surprising yet intuitive logic.

February 24, 2015 · 5 min

An Introduction on How to Make Beautiful Charts With R and ggplot2

Adding a touch of color and design can help make more compelling visualizations, thanks to ggplot2 syntax and chaining capabilities.

February 12, 2015 · 8 min

Quantifying the Clickbait and Linkbait in BuzzFeed Article Titles

You probably do not know that the 3 most interesting things I found will blow your mind.

January 15, 2015 · 6 min

Locating All the Christmas Trees on Instagram

I downloaded hundreds of thousands of #tree images and found 25,432 images which were taken on Christmas, have a #tree, and, most importantly, contain location data where the photo was taken.

January 1, 2015 · 5 min

A Statistical Analysis of 142 Million Reddit Submissions

I constructed a database to store all Reddit Submissions from November 2007 to the end of October 2014: 142,159,793 submissions in total. And this data is very curious and very, very memetic.

December 16, 2014 · 8 min