Big Data is Hard Work

Although a Google Ngram like that in Chapter 14 (Exhibit 14.11) provides a quick way to discover the potential for Big Data, getting all that data ready for analysis involves a lot of hard work behind the scenes.  Research conducted for a recent New York Times article estimates that this “janitor work” takes between 50% and 80% of the time of the data scientists who work with Big Data.

As with every form of social science data, good quality datasets don’t just “happen”!

