After thinking about it for a while now, I’ve finally decided to start a blog. I’ll post my thesis on marketplaces, hypotheses on human behavior and general thoughts on the future.
In an era of “Data Science”, it’s easy to lose sight of simple answers to powerful questions.
To start, let’s look at a simple chart from Google’s Ngram viewer. The below chart shows the percentage occurrence of the word data in Google’s corpus of books since 1800 with a smoothing of 3 years. Trends are more apparent if you deploy a moving average – this allows to “smooth” anomalies such as seasonal variations. So in this case each data point is actually an average of the last three years.
The Y-axis shows of all the unigrams in Google Books sample, the unigram (data in this case) was used Y % of the time.
Now, why is this chart important? The use of data in the common lexicon skyrocketed in the 1900s. We also started seeing unprecedented productivity gains starting in the 1900s. The use of data allows us to make intelligent calculated decisions to set us up for success.
Of course, we must not forget : correlation doesn’t imply causation.