Things About Real-World Data Science Not Discussed In MOOCs and Thought Pieces
MOOCs and thought pieces overfit to a certain style of data science that is not robust to the vast uncertainties of the real world.
MOOCs and thought pieces overfit to a certain style of data science that is not robust to the vast uncertainties of the real world.
The nature of algorithmic feeds like Reddit inherently leads to a survivorship bias: although users may recognize certain types of posts that appear on the front page, there are many more which follow the same patterns but fail.
For IMDb’s big-but-not-big data, you have to play with the data smartly, and both R and ggplot2 have neat tricks to do just that.
Train your own text-generating neural network and generate text whenever you want with just a few clicks!
Although visualizing basketball shots has been done before, this time we have access to an order of magnitude more public data to do some really cool stuff.