Analyzing San Francisco Crime Data to Determine When Arrests Frequently Occur
Spoilers: Most arrests in San Francisco happen Wednesdays at 4-5 PM. For some reason.
Spoilers: Most arrests in San Francisco happen Wednesdays at 4-5 PM. For some reason.
I had posted a visualization of NYC taxis using ggplot2. Due to popular demand, I’ve cleaned up the code and have released it open source, with a few improvements.
If we can find out which topics Reddit users tend to upvote, we can identify what keywords are most attractive to the Reddit hivemind.
With Reddit data in BigQuery, quantifying all the hundreds of millions of Reddit submissions and comments is trivial.
I have reverse-engineered data and code with R and ggplot2 in order to create detailed implementations of bootstrapping, and also to add a few visual improvements.