• RRP: £17.86
  • You Save: £3.13 (18%)
FREE Delivery in the UK.
In stock.
Dispatched from and sold by Amazon.
Gift-wrap available.
Trade in your item
Get a £0.29
Gift Card.
Have one to sell?
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See this image

Uncharted: Big Data as a Lens on Human Culture Hardcover – 26 Dec 2013

See all 2 formats and editions Hide other formats and editions
Amazon Price New from Used from
"Please retry"
£3.42 £1.92

Frequently Bought Together

Uncharted: Big Data as a Lens on Human Culture + Big Data: A Revolution That Will Transform How We Live, Work and Think
Price For Both: £22.72

Buy the selected items together

Earn a Free Kindle Book
Earn a Free Kindle Book
Buy a book between now and 31 March and receive a promotional code good for one free Kindle book. Terms and conditions apply. Learn more

Product details

  • Hardcover: 280 pages
  • Publisher: Riverhead Books a Member of Penguin Group (US (26 Dec 2013)
  • Language: English
  • ISBN-10: 1594487456
  • ISBN-13: 978-1594487453
  • Product Dimensions: 14.6 x 2.4 x 21.7 cm
  • Average Customer Review: 4.0 out of 5 stars  See all reviews (2 customer reviews)
  • Amazon Bestsellers Rank: 317,629 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, and more.

Product Description

Uncharted "Breaking open Big Data, two Harvard scientists reveal a ground-breaking way of looking at history and culture"--

Inside This Book (Learn More)
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index
Search inside this book:

What Other Items Do Customers Buy After Viewing This Item?

Customer Reviews

4.0 out of 5 stars
5 star
4 star
3 star
2 star
1 star
See both customer reviews
Share your thoughts with other customers

Most Helpful Customer Reviews

2 of 2 people found the following review helpful By Mac McAleer TOP 1000 REVIEWER on 28 Jan 2014
Format: Hardcover Verified Purchase
Google has been creating its own version of the ancient Library of Alexandria by digitising books for its Google Books project. This project has had many obstacles, none greater than American copyright law, which has extended the copyright of a book to 70 years after the author's death. Thus, a large proportion of books published in the 20th century are still under copyright. Despite this, two young researchers at Harvard convinced Google that they could access Google Books in a general way without infringing the copyright. This involved searching the text of these books for ngrams, where a 1-gram is a single word, a 2-gram is two words, a 3-gram is a three-word phrase, etc. The results of their research included the creation of the Google Books N-gram Viewer and this book.

Most big data has been collected in the last few decades. Google Books is unusual in being both big data and long data, where long data indicates its historical reach. The authors were interested in the history of change of English grammar, which is a perfect subject for the data held in Google Books. They investigated the frequency over time for the 1-grams "burnt" and "burned" presenting their results in a chart of frequency against time showing how "burned" is taking over. Other questions were posed and the results charted. Near the start of the book the authors are careful to point out that correlation of data does not mean causation.
Read more ›
1 Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again
By John M. Ford TOP 500 REVIEWER on 10 July 2014
Format: Hardcover
Erez Aiden and Jean-Baptiste Michel are interested in word and phrase frequency and what it can reveal about history and culture. They illustrate their approach with a timeline graph of the phrases "The United States are" and "the United States is." We are unsurprised to see the "is" phrase increase in frequency after the Civil War, as the "are" phrase fades from view. This example supports our intuitions about allegiance to the Union supplanting allegiance to one's home state. It also builds our confidence in their historical profiling method for those other times when it finds a counterintuitive result.

The authors are confident in the value of historical word frequency analysis. "Big data is going to change the humanities, transform the social sciences, and renegotiate the relationship between the world of commerce and the ivory tower." They begin searching for larger and larger collections of text to analyze. They eventually wind up in the office of Peter Norvig, Google's Director of Research. They convince him to grant them access to Google Books, a tremendous digital library containing more books than have ever before been collected online. Not only do Aiden and Michel spend several years conducting historical-linguistic research, but they also author a tool (available at books dot google dot com forward-slash ngrams) that allows everyone else to do the same kind of studies.

Their book outlines how word and phrase frequency can be used to learn about cultural and historical change. It tells the story of Google Books and how the authors began to use this collection of digitized documents in their research. And it provides examples of interesting trends they have brought to light.
Read more ›
Comment Was this review helpful to you? Yes No Sending feedback...
Thank you for your feedback. If this review is inappropriate, please let us know.
Sorry, we failed to record your vote. Please try again

Most Helpful Customer Reviews on Amazon.com (beta)

Amazon.com: 53 reviews
35 of 36 people found the following review helpful
A decent book about some amazing research 7 Nov 2013
By Neurasthenic - Published on Amazon.com
Format: Hardcover Vine Customer Review of Free Product ( What's this? )
I was lucky enough to read Aiden & Michel's original study, "Quantitative Analysis of Culture Using Millions of Digitized Books," when it appeared in Science on 14 January 2011. It was an astonishing piece of scholarship, one of the rare papers that divides an entire branch of human learning into "before" and "after." I felt the hair on the back of my neck rise as I read it. In essence, they mined through the Google Books database to answer concrete questions about linguistics, culture, politics, even topics such as the nature of fame and the pace of propagation of new technologies. It was a tour de force.

The title of this book, "Big Data as a Lens on Human Culture," suggests that it will be a general text on Big Data, but it is not. It covers only this body of work by these two researchers and their assistants.

The book repeats the contents of that 2011 article, explaining the results for the general public, adding some discussion of the origins of the work and the researchers' thoughts about the future. In the process, they expand the original piece, which was about six pages long excluding notes, to about 220 pages. Some of the new material is fun; I got a kick out the story about a romance novel that had been alphabetized and the information that could still be gleaned from it. Others seem like padding; who cares about this history of lexical concordances?

It's a shame that Aiden & Michel wrote this book themselves; the same material coming from a third party would not have seemed so self-congratualtory and, sometimes, smug. Stylistically, the book has some flaws, including an odd 'cutesy' tone and repeated reliance on lousy puns for humor (see for example the discussion of the plague sent by God to punish King Samuel in the old testament, and the authors' rather forced questioning if our decisions will similarly "come back to plague us").

Readers who make it to the end of the book may find the last couple of chapters a little disturbing. Aiden & Michel seem to lament that anything goes unrecorded, that anything is forgotten. But forgetting is healthy and can be vital to society. Though they include a few paragraphs on possible abuses of big data, this is clearly an afterthought and I suspect these guys read about NSA databases of e-mail and text messages and thought "if only we could read those too!"

The book has ample footnotes for those who want more detail, and many excellent graphs. I wish they had provided a footnote for the software package they used to generate these, the design is quite nice while remaining unusually clear. Edward Tufte would approve.

In summary, read the original paper if you have access to it. If not, give the book a try. The original can be found in Science 14 January 2011: Vol. 331 no. 6014 pp. 176-182.
11 of 12 people found the following review helpful
A fascinating look at big data 27 Nov 2013
By Jojoleb - Published on Amazon.com
Format: Hardcover Vine Customer Review of Free Product ( What's this? )
Uncharted: Big Data as a Lens on Human Culture, is a fun look at a pretty amazing research project. Starting as graduate students, authors Erez Aiden and Jean-Baptiste Michel wanted to use big data to answer interesting questions. What started out as a simple research question ended up jump starting the authors' careers and an entirely new way to look at big data.

They came up with an idea to make a tool that could query Google's digitized library in order to determine word frequencies. Using the tool they invented, called the Google Ngram Viewer, they have been able to answer interesting questions that relate to word frequencies, explore how language changes over time, assess the adoption of new technology, assess fame, and conjecture as to how the answers to the questions they pose reflect on the prevailing culture.

Although the idea is simple in concept, it wasn't so simple in execution. They had to wiggle their way into the Googleverse to get permission to use the database, write a lot of code, and iron out certain legal/copyright problems. But once all this was done, the magic began.

I won't go into detail about their findings, but suffice it to say, they not only created the Ngram Viewer but used it intelligently to come to some very interesting (and often humorous) conclusions. Their analogy of Ngram Viewer as a modern equivalent of Galileo's telescope is an apt one. Without the telescope, Galileo couldn't have made some of his most important astronomic observations. Without the Ngram Viewer, it would be much impossible to look at; things like the transformation of irregular verbs over time or get a good idea when writers really started to refer to The United States in the singular (the results are surprising).

However, like Galileo's telescope, the Ngram Viewer is still a somewhat primitive tool. First, it is limited by the number of books that are available in Google's digital database. (Google is trying to digitize every book in existence, but it still has a ways to go.) Second, the authors limited themselves to books only--they did not look at other printed media; digitized periodicals or newspapers, for example. Third, the database is limited to the printed word and does not include usage in other media. (A fourth limitation that the authors did not mention is that Google is only trying to digitize one edition of each book. Therefore the database doesn't account for a book's popularity or circulation. Obscure scholarly books would therefore be weighted as equal with popular novels which introduces a certain bias as well. Moreover, there are books that are quite popular that people buy and read voraciously but have little social impact beyond a short period of time (e.g. Fifty Shades of Grey), extraordinarily popular books that lots of people may have but never read (Godel, Escher, Bach), books that come out in modern editions but are written in an archaic (e.g. The Bible).

So there are limitations. Still, if archeologists can garner incredible insights about the past by looking at the contents of ancient waste dumps, even with limitations there is a wealth of information that the Ngram Viewer can tap into that is there for the taking.

Aiden and Michel write with a great amount of scholarship, humility, and humor. The book was a easy and quick to read but insightful as well. They do spend a fair amount of time on the trials and tribulations of how they developed the Ngram viewer. This history of the Ngram viewer takes up a fair amount of copy and interesting to read. However, the insights that the authors are able to obtain actually using the Ngram viewer are far more interesting and I would have been more than happy to read about more of them. If there is one major downside to the book, it is that the authors got me turned on to the Ngram Viewer, which is majorly addictive and can consume a lot of your time. (Once you start, it's hard to stop. Trust me, this is highly addictive and more of a time sink than Twitter or Facebook.)

All in all, an insightful, engaging, interesting, and entertaining read. Highly recommended.
8 of 9 people found the following review helpful
Interesting research, to a point 21 Nov 2013
By Bella Rosa - Published on Amazon.com
Format: Hardcover Vine Customer Review of Free Product ( What's this? )
This book began as a scientific paper, and I think perhaps it should have stayed in that shorter form. The authors try to spin their ngram research to book length and it stretches somewhat thin. The beginning of the book is promising - the depiction of irregular verbs becoming regularized over time is interesting to anyone who looks forward to the word of the year, and I confess to being fascinated at what can be revealed by alphabetizing the words in a novel. Past that point, however, vignettes and anecdotes became more disjointed, and I'm not a fan of the authors' style of humor. Frankly, I enjoyed pondering the ngram graphs in the back of the book (the occurrences of "turnip" vs "tomato" over time, "slavery is" vs "slavery was", "werewolf" vs "zombie") more than the majority of the text.
10 of 13 people found the following review helpful
Is Big Data the End of Scholarship? 31 Jan 2014
By Roger Shepherd - Published on Amazon.com
Format: Kindle Edition Verified Purchase
Two young research scientists from Harvard University, Erez Aiden and Jean-Baptiste Michel teamed up with Google in 2010 to create the Ngram Viewer. It sifts through millions of digitized books and charts the frequency with which words have been used. On the day that the Ngram Viewer debuted, more than one million queries were run through it. Some consider it to be at the center of a major revolution.

In an interview with Studio 360`s Kurt Andersen, Aiden and Michele said how pleased they are that the new technology can open up academic research to the "independently curious."

"It's good that a tool that's at the leading edge of science can generate so much enthusiasm in the general public." Michele cautions however, "it's inevitable that a tool like that will generate a large number of discussions that are actually irrelevant or that are flat-out wrong . . . it's still important that bona fide experts are the ones interpreting the research." [1]

In their new book Uncharted: Big Data as a Lens on Human Culture, however, they are nowhere near so humble about the so-called "big data revolution," nor are they convinced about the value of "bona fide experts."

"At its core, this big data revolution is about how humans create and preserve a historical record of their activities. Its consequences will transform how we look at ourselves. It will enable the creation of new scopes that make it possible for our society to more effectively probe its own nature. Big data is going to change the humanities, transform the social sciences, and renegotiate the relationship between the world of commerce and the ivory tower." [2]

Well, if for whatever reason this is going to be a contest between capital and academia, or academics versus the "independently curious," then let's hear first from the so-called "ivory tower." The following passage is from Simon Schama's introduction to his The Embarrassment of Riches: An Interpretation of Dutch Culture in the Golden Age:

". . . there is nothing especially daring about a working definition of culture drawn from social anthropology. I follow the kind of characterization offered by Mary Douglas of cultural bias as "an array of beliefs locked together into relational patterns." In the same essay, however, she cautions that for those beliefs to be considered the matrix of a culture, they should be treated as part of the [social] action and not separated from it." I have tried to follow this rather Durkheimian command in what is, essentially, a descriptive enterprise that emphasizes social process rather than social structure, habits rather than intuitions. Acting upon one another, beliefs and customs together form what Emile Durkeim called "a determinate system that has it's own life: . . . the collective or common conscience . . . it is by definition diffuse in every reach of society, Nevertheless it has specific conditions that make it a distinct reality." [3]

Now, let's hear from the big data revolutionaries:

"Consider the following question: Which would help you more if your quest was to learn about contemporary human society--unfettered access to a leading university's department of sociology, packed with experts on how societies function, or unfettered access to Facebook, a company whose goal is to help mediate human social relationships online?"

"On the one hand, the members of the sociology faculty benefit from brilliant insights culled from many lifetimes dedicated to learning and study.

"On the other hand, Facebook is part of the day-to-day social lives of a billion people. It knows where they live and work, where they play and with whom, what they like, when they get sick, and what they talk about with their friends. So the answer to our question may very well be Facebook. And if it isn't--yet--then what about a world twenty years down the line, when Facebook or some other site like it stores ten thousand times as much information, about every single person on the planet?" [4]

Aside from the vague and uninformed illogicality that pervades Uncharted, I am particularly struck by the air of self-congratulatory triumph that permeates the entire book, suggesting that big data has already won--hands down.

Why are so many enthralled by this stuff? All I can say is, "In the land of the blind, the one-eyed man is king."

[1] from Studio 360, Public Radio International, broadcast August 9, 2013.

[2] Aiden, Erez; Michel, Jean-Baptiste (2013-12-26). Uncharted: Big Data as a Lens on Human Culture (Kindle Locations 133-137). Penguin Group US. Kindle Edition.

[3] Simon Schama. The Embarrassment of Riches: An Interpretation of Dutch Culture in the Golden Age. New York: Random House, 1987, p. 9.

[4] Aiden, Erez; Michel, Jean-Baptiste (2013-12-26). Uncharted: Big Data as a Lens on Human Culture (Kindle Locations 185-189). Penguin Group US. Kindle Edition.
5 of 6 people found the following review helpful
Try It Yourself: Mining Google Books for History of Ideas, Language, and Culture 13 Dec 2013
By Book Fan - Published on Amazon.com
Format: Hardcover Vine Customer Review of Free Product ( What's this? )
This is essentially an interactive book, because as you read, you can go to the Ngram site and try out your own queries (or try the link in my comment below). This book is a lot of fun to read but is also quite interesting, though it could be better (below).

The Google Books Ngram Viewer has access to the words and phrases used in a significant chunk of Google's digitized book archive, By using it, you can ask questions about things such as the changing use of language, the rise and fall in popularity of ideas, or celebrities, and so forth. For example, I found that early uses of the phrase "rock and roll" were in accounts of traveling in rickety wagons, and in sailor songs. It is a great way of viewing social and cultural history through the lens of big data.

This book consists of some of the interesting ideas the researchers uncovered, alternating with the story of how the Ngram Viewer came about, and the issues they dealt with in doing so, such as privacy, and copyrights of the authors, access to the archive of information, and so forth. Along the way they utilize a number of interesting episodes of history, e.g. Helen Keller's open letter to the German people in the 1930s. Or, they analyze the corpus to show that there are over a million words used in the English language, but the Oxford English Dictionary has only about half a million of them. It instructive to see how they wring information out of this data.

Also, they discuss some of the foibles of the data, for example, one of the most mentioned people is an academic that no one has heard up -- this is because published books are skewed towards academic content. It closes with a brief discussion of how access to big data changes the questions we can ask and what is knowable. In the appendix they show charts of additional comparisons, but at that point, it is really more interesting to go to the site and input your own queries.

For an example of the kind of results it produces, and how easy it is to use, look at the link I've put in the comment section below, where mentions of different book review publications over time are charted, along with mentions of Amazon.

A glaring omission is lack of any information about how to do advanced searches, such as constrain words by part of speech, using wildcard, what does "smoothing" mean, etc. However, googling for "ngram advanced" will lead you to the online documentation. Additionally, I wouldn't have minded some more technical information about how things were implemented. Also, at points, the book gets a bit diary-like, and could use some tightening up.

However, setting tools like this loose for anyone to use is a game-changer, and thus for those interested, it is a five star book. Try it yourself!
Were these reviews helpful? Let us know