You only have to pick up an old novel to realize that people write very differently than they did 100 years ago, or even a couple generations ago. But is it possible to quantify those differences? Could pick apart centuries' worth of literature, and track their age based only on the words on the page?

Top image:

A group of researchers have done just that. They trawled through nearly 7,750 works by more than 500 authors from the Project Gutenberg Digital Library — but only paid attention to a comparatively small handful of words. They used just 307 "content-free" words — including prepositions, articles, and conjunctions — that were commonly used throughout the centuries.


It might not surprise you to find that authors clustered stylistically to their chronal contemporaries, over 85% of authors had an associated temporal disparity of less than 37 years. As the years between writers increases, the similarities between their styles drop.

What's especially interesting is what has happened since the 1900s: alongside an explosion in the number of authors, this effect has become more and more marked. With every passing decade, contemporary authors become stylistically more similar, but this bubble of similarity covers a smaller and smaller time frame. The authors are more closely linked, but for shorter periods.


Which is to say that modern authors are less influenced by their predecessors, and much more by their immediate peers. I guess a classical education won't take you too far, any more.