FREE Delivery in the UK.
Only 7 left in stock (more on the way).
Dispatched from and sold by Amazon.
Gift-wrap available.
Hadoop in Practice has been added to your Basket
+ £2.80 UK delivery
Used: Good | Details
Condition: Used: Good
Comment: Expedited shipping available on this book. The book has been read but remains in clean condition. All pages are intact and the cover is intact. Some minor wear to the spine.
Trade in your item
Get a £0.29
Gift Card.
Have one to sell?
Flip to back Flip to front
Listen Playing... Paused   You're listening to a sample of the Audible audio edition.
Learn more
See all 2 images

Hadoop in Practice Paperback – 13 Oct 2012

See all 2 formats and editions Hide other formats and editions
Amazon Price New from Used from
"Please retry"
£21.85 £5.27

There is a newer edition of this item:

Hadoop in Practice
In stock.
£31.99 FREE Delivery in the UK. Only 7 left in stock (more on the way). Dispatched from and sold by Amazon. Gift-wrap available.

Frequently Bought Together

Hadoop in Practice + Hadoop: The Definitive Guide
Price For Both: £54.09

Buy the selected items together

Trade In this Item for up to £0.29
Trade in Hadoop in Practice for an Amazon Gift Card of up to £0.29, which you can then spend on millions of items across the site. Trade-in values may vary (terms apply). Learn more

Product details

  • Paperback: 536 pages
  • Publisher: Manning Publications; 1 edition (13 Oct. 2012)
  • Language: English
  • ISBN-10: 1617290238
  • ISBN-13: 978-1617290237
  • Product Dimensions: 18.7 x 3.3 x 23.5 cm
  • Amazon Bestsellers Rank: 711,384 in Books (See Top 100 in Books)

More About the Author

Discover books, learn about writers, and more.

Product Description

About the Author

Alex Holmes is a senior software engineer with extensive expertise in solving big data problems using Hadoop. He has presented at JavaOne and Jazoon and is a technical lead at VeriSign.

Inside This Book (Learn More)
Browse Sample Pages
Front Cover | Copyright | Table of Contents | Excerpt | Index | Back Cover
Search inside this book:

What Other Items Do Customers Buy After Viewing This Item?

Customer Reviews

There are no customer reviews yet on
5 star
4 star
3 star
2 star
1 star

Most Helpful Customer Reviews on (beta) 16 reviews
20 of 21 people found the following review helpful
Way too old 15 Dec. 2013
By Michael - Published on
Format: Paperback Verified Purchase
This book is way too old, it's only really good for giving you ideas of what to try. The versions of all the software that this book is based on are often times impossible to obtain now, many of the hadoop sites are in the habit of removing old versions of their projects so you can't download them. For example, the book is primarily based on using hadoop 0.20 and/or cloudera's hadoop version three. It's no longer possible to download those, if you have an old version then hold on to it in case you need it.

None of the installation instructions in the book will work with the newer versions of applications. In some cases the entire idea of how you would run and use a tool has changed. Also, the entire way that HDFS and Map-reduce works has changed since YARN was added, so the books explanation of that is old.

The book often omits important details like which jar you need to use for a particular piece of code. Classpath and dependency issues are always a nightmare to deal with and the book offers little help with this. He should list everything that you would put in a maven dependency. He often omits the import lines in java code, so you have little idea which class he is referring to in the code.

There are often times when he requires you to use software written by him, such as the "File Slurper" that Alex wrote. I am very wary of using any code like that, if it doesn't have the support of the apache/hadoop community then it's very likely to be out of date and unsupported sooner or later. I skipped any chapter I saw like that. I kept seeing this reference to a bash script called "" in the book, and could not figure out what he was referring to. I could find no such shell script in any software I downloaded. I think it must be a bash script in his git project, like I said I don't want to depend on any code that is not supported by the community.

There were also COUNTLESS compatibility issues I found when I tried to do anything. Almost no two pieces of hadoop software work together out of the box. It's so bad that using anything besides cloudera's hadoop was practically impossible. I am not a stupid guy either.

Here is my advice to you:
1. Use cloudera's pre-built CDH VM, at least at first. I used the CDH 4.5 pre-built VM, and that is the only thing I got to work.
2. Do not follow any installation instructions in the HIP book
3. Do not follow any installation instructions on the hadoop websites
4. Only follow installation/re-configuration instructions found in Cloudera's manual for CDH 4.5 installation
5. Do not deviate your configuration from what is norm. For example, I encountered a lot of bugs when I tried switching to java 7.
5. You might want to hold off from buying this book until a newer issue is released
6. If you use maven for dependencies, make sure you get your hadoop dependencies from the cloudera repository, not maven central
7. Instead of reading the book, just go into each of the hadoop project's websites. Skip their installation instructions like I said before, but try to follow any tutorials you see, and try to practice using everything you read.
8. After you figure out how to do everything, only then should you try to install stuff from scratch on a new VM. If you try to set up a VM on your own from the start, all the frustration will kill your motivation to learn hadoop.

The one thing this book was good for was giving me ideas of what things to try, which is why I give it two stars instead of one.
16 of 17 people found the following review helpful
85 advanced hadoop techniques 15 Nov. 2012
By Jeanne Boyarsky - Published on
Format: Paperback
"Hadoop in Practice" covers recipes/techniques for working with Hadoop. The 85 techniques range from pure Hadoop to related technologies like Mahout and Pig. There was good discussion of algorithms.

Java is definitely a pre-requisite. The book says you should have some knowledge of HDFS and MapReduce. Yet chapter one starts with "what is hadoop." It reads better as a review than an intro and doesn't fit with the rest of the book. It also assumes you haven't installed/started Hadoop. You really should read an intro book first and skim chapter one.

I particularly liked the chapters on MapReduce and performance. The overview of iostat and vmstat was clear and better than in many UNIX books. I also liked the AST explain plan. The techniques about when to use joins and sorts seemed like they would be in "Hadoop in Action" as well. Yet the comparison of different types fit well.

Each chapter begins with a conceptual overview which was very useful. The book also contains many diagrams to add clarity.

Disclosure: I received a copy of this book from the publisher in exchange for writing this review.
12 of 14 people found the following review helpful
Too much and too little 2 Jun. 2013
By J. Underwood - Published on
Format: Paperback Verified Purchase
I'm a Hadoop beginner. If you are too, this is not the book for you. The author has stuffed a bit about every conceivable Hadoop-related application of every related Apache project into this book and, beyond being a useful overview discovery guide, it is simultaneously too wide and too shallow (or brief) to be useful to those without substantial knowledge of many of the topics discussed. I'm sure some of the optimization material is useful to those with well-formed questions but this format does not lend itself well to a reader's bootstrap effort on the subject of Hadoop. In all fairness, it does not claim to. Just sayin'.
6 of 8 people found the following review helpful
Excellent Hadoop Book 11 Jan. 2013
By telescope7 - Published on
Format: Paperback
Hadoop in Practice is a very high quality and well written book and is packed with useful examples. The breadth and detail of the material is by far superior than other Hadoop reference guides. This book is perfect for someone who likes to learn new tools/technology while following pragmatical and real-world examples.

This is a must buy for any serious Hadoop user / developer..
1 of 1 people found the following review helpful
The only book you need to learn hadoop and mapreduce! 6 Feb. 2014
By Chenghai - Published on
Format: Paperback Verified Purchase
I'm a beginner and have no experience with any hadoop development before. Before I started to read this book, I had read some articles on the internet. Also read "the hadoop in action". This book really has everything I want to know and have very detailed explanation about the concepts. Every parts in the book are well organized and well written. The author's writing skill is really very amazing, he explains the very complicated concepts in a very plain and easy to understand way. No hypos. The examples in the book are also very practical and very useful and touched many different areas of where hadoop could be used. Not like the online tutorials only play with word count game. The examples in this book are real and practical!
Were these reviews helpful? Let us know