Do you even Data, bro

After thinking about it for a while now, I’ve finally decided to start a blog. I’ll post my thesis on marketplaces, hypotheses on human behavior and general thoughts on the future.

In an era of “Data Science”, it’s easy to lose sight of simple answers to powerful questions.

To start, let’s look at a simple chart from Google’s Ngram viewer. The below chart shows the percentage occurrence of the word data in Google’s corpus of books since 1800 with a smoothing of 3 years. Trends are more apparent if you deploy a moving average – this allows to “smooth” anomalies such as seasonal variations. So in this case each data point is actually an average of the last three years.

The Y-axis shows of all the unigrams in Google Books sample, the unigram (data in this case) was used Y % of the time.

Now, why is this chart important? The use of data in the common lexicon skyrocketed in the 1900s. We also started seeing unprecedented productivity gains starting in the 1900s. The use of data allows us to make intelligent calculated decisions to set us up for success.

Of course, we must not forget : correlation doesn’t imply causation.

Screen Shot 2015-05-23 at 12.35.08 PM

Advertisements

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s