- Save 10% on Books for Schools offered by Amazon.co.uk when you purchase 10 or more of the same book. Here's how (terms and conditions apply) Enter code SCHOOLS2016 at checkout. Here's how (terms and conditions apply)
Hadoop Operations Paperback – 19 Oct 2012
|New from||Used from|
- Choose from over 13,000 locations across the UK
- Prime members get unlimited deliveries at no additional cost
- Find your preferred location and add it to your address book
- Dispatch to this address when you check out
Special Offers and Product Promotions
Frequently Bought Together
Customers Who Bought This Item Also Bought
Enter your mobile number or email address below and we'll send you a link to download the free Kindle App. Then you can start reading Kindle books on your smartphone, tablet, or computer - no Kindle device required.
To get the free app, enter your mobile phone number.
A Guide for Developers and Administrators
About the Author
Eric Sammer is currently a Principal Solution Architect at Cloudera where he helps customers plan, deploy, develop for, and use Hadoop and the related projects at scale. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He's been involved in the open source community and has contributed to a large number of projects over the last decade.
What Other Items Do Customers Buy After Viewing This Item?
Most Helpful Customer Reviews on Amazon.com (beta)
take this review as indicative of the entire book.
Whether the topic is HDFS and how data is ingested and replicated, or how Map/Reduce "finds" the most suitable
node to run it's tasks on, or what the cost and performance advantages are of adopting the shared-nothing, commodity
model recommended for Hadoop clusters, etc., etc., etc., this book provides the how, what, when, where and why of
Hadoop (the missing manual, of sorts).
Cluster Administrators as well as Map/Reduce programmers benefit from it's through, no-shortcuts-taken, breakdown of
the Hadoop platform. I highly recommend it.
This book could not have come into better time as production team is look to both contract the support team and to have a manual for Hadoop operations.
There are two things that I like about this book:
1. It covers all of the topics that matter. It covers most important aspects of the Hadoop platform and its architecture but from the operational perspective - HDFS architecture and cluster configuration, MapReduce and YARN execution models, cluster setup and most importantly a very detailes review of options and recommendations related to operating system, network and storage setup.
2. There are dedicated chapters to cluster maintenance, backups, monitoring and, very importantly, troubleshooting that go into very solid level of details on many of the problems or intricacies that one should better know about Hadoop in an operational setting.
These chapters are obviously written by someone who ran Hadoop many times before and in a large, production setting.
War stories and "mystery bottleneck" sections are great.
In summary, right book in a right time, although I feel we should have had similar book maybe a year ago. I am guessing that Cloudera wanted to get their solid cut first at consulting and support fees before making such material available ;-) (author Eric Sammer is Cloudera's solutions architect)
That said, still tons of good information here on how hadoop works and on topics like security and monitoring.
Mainly its for Admins, not much stuff for developers.
Great high level overview if you are tasked to deploy production cluster (requirements, deployment,tuning,monitoring,backup..)
Even if you don't need to deploy Hadoop cluster its great read from a system engineer perspective, talks about hight availability concepts, network bandwidth, memory limitations... Very good explained, first tells what are the limitations then how to outcome them. Also explains what were limitations in earlier hadoop versions and how they were resolved in Hadoop 2 (high availability, feferation)
I would not say this book is suitable for developers, unleast they want to learn some of the admin tasks. Just high overview of MapReduce is explained, but nothing from programing perspective. No other projects explained that are often used with Hadoop like, Hive, Hbase, Pig...
Look for similar items by category
- Books > Computing & Internet > Computer Science > Architecture & Microprocessors
- Books > Computing & Internet > Computer Science > Information Systems
- Books > Computing & Internet > Databases > Data Storage & Management > Data Mining
- Books > Computing & Internet > Programming > Languages
- Books > Computing & Internet > Web Development > Web Scripting & Programming