Start reading Hadoop: The Definitive Guide on your Kindle in under a minute. Don't have a Kindle? Get your Kindle here or start reading now with a free Kindle Reading App.

Deliver to your Kindle or other device


Try it free

Sample the beginning of this book for free

Deliver to your Kindle or other device

Hadoop: The Definitive Guide

Hadoop: The Definitive Guide [Kindle Edition]

Tom White
4.3 out of 5 stars  See all reviews (6 customer reviews)

Print List Price: £29.59
Kindle Price: £16.06 includes VAT* & free wireless delivery via Amazon Whispernet
You Save: £13.53 (46%)
* Unlike print books, digital books are subject to VAT.

Free Kindle Reading App Anybody can read Kindle books—even without a Kindle device—with the FREE Kindle app for smartphones, tablets and computers.

To get the free app, enter your e-mail address or mobile phone number.


Amazon Price New from Used from
Kindle Edition £16.06  
Paperback £22.10  
Kindle Daily Deal
Kindle Daily Deal: At least 60% off
Each day we unveil a new book deal at a specially discounted price--for that day only. Learn more about the Kindle Daily Deal or sign up for the Kindle Daily Deal Newsletter to receive free e-mail notifications about each day's deal.

Product Description

Book Description

Storage and Analysis at Internet Scale

Product Description

Ready to unlock the power of your data? With this comprehensive guide, you’ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.

You’ll find illuminating case studies that demonstrate how Hadoop is used to solve specific problems. This third edition covers recent changes to Hadoop, including material on the new MapReduce API, as well as MapReduce 2 and its more flexible execution model (YARN).

  • Store large datasets with the Hadoop Distributed File System (HDFS)
  • Run distributed computations with MapReduce
  • Use Hadoop’s data and I/O building blocks for compression, data integrity, serialization (including Avro), and persistence
  • Discover common pitfalls and advanced features for writing real-world MapReduce programs
  • Design, build, and administer a dedicated Hadoop cluster—or run Hadoop in the cloud
  • Load data from relational databases into HDFS, using Sqoop
  • Perform large-scale data processing with the Pig query language
  • Analyze datasets with Hive, Hadoop’s data warehousing system
  • Take advantage of HBase for structured and semi-structured data, and ZooKeeper for building distributed systems

Product details

  • Format: Kindle Edition
  • File Size: 4901 KB
  • Print Length: 688 pages
  • Simultaneous Device Usage: Unlimited
  • Publisher: Yahoo Press; 3 edition (10 May 2012)
  • Sold by: Amazon Media EU S.à r.l.
  • Language: English
  • ASIN: B0082FE448
  • Text-to-Speech: Enabled
  • X-Ray:
  • Word Wise: Not Enabled
  • Average Customer Review: 4.3 out of 5 stars  See all reviews (6 customer reviews)
  • Amazon Bestsellers Rank: #65,615 Paid in Kindle Store (See Top 100 Paid in Kindle Store)
  •  Would you like to give feedback on images?

More About the Author

Discover books, learn about writers, and more.

What Other Items Do Customers Buy After Viewing This Item?

Customer Reviews

3 star
2 star
1 star
4.3 out of 5 stars
4.3 out of 5 stars
Most Helpful Customer Reviews
5 of 5 people found the following review helpful
4.0 out of 5 stars Great read if you have some hadoop experience 19 Feb 2013
I read this book while preparing for the CCDH exam. It's a solid all around book covering several different areas of the Hadoop ecosystem. However, it shouldn't be the first book someone reads about Hadoop. For that purpose maybe Hadoop in Action is a better fit. Once you get the high level of what Hadoop is all about, you can proceed with this book, keeping in mind that for every page in the book there is 5-10X amount of info available online.
It's a good book for covering the breadth and getting into quite some depth in each topic if you are willing to extend from the examples provided. It gets into quite some depth with aspects that matter like the inner workings of Mapper/Shuffle+Sort/Reducer etc.
Missed the 5th star for not covering the 2.0 info or even the annotations behind the API evolution which is a big point for newcomers to the ecosystem.
Comment | 
Was this review helpful to you?
1 of 1 people found the following review helpful
5.0 out of 5 stars Great Book 13 Aug 2013
Format:Paperback|Verified Purchase
At the first time I was little unsure about this one because comments were a little bit confusing. However I am happy that most part of comments weren't true. Having almost no information and knowledge about Hadoop I was able to understand everything and would like to recommend it to all the Hadoop beginners.
Comment | 
Was this review helpful to you?
4.0 out of 5 stars Wide coverage for Hadoop Eco-System 19 April 2014
Format:Paperback|Verified Purchase
I was looking for a book that explained Hadoop & HDFS with some technical depth to understand the practical implications of building solutions on Hadoop with a starting point of unix skills, data-warehousing but zero knowledge of Hadoop or MapReduce.
This book definitely meets the bill. It provides clear explanations of HDFS, MapReduce, HIVE, HBASE and more both in terms of what how they work and what they are good for but also provides some technical Java based examples (which I have largely skipped through). The book also covers real world implementations showing various patterns used by major Hadoop consumers that make use of the various toolsets which for me helped to cement the ideas and strengths of the various elements.
I have already recommended this book to others.
The only reason for not rating it higher is that I cannot yet testify to the quality of the code samples.
Comment | 
Was this review helpful to you?
Would you like to see more reviews about this item?
Were these reviews helpful?   Let us know

Customer Discussions

This product's forum
Discussion Replies Latest Post
Figures and Tables 0 9 Jul 2012
See all discussions...  
Start a new discussion
First post:
Prompts for sign-in

Search Customer Discussions

Look for similar items by category