A Guide for Developers and Administrators
Eric Sammer

#Hadoop
#MapReduce
#HDFS
#Cloudera
#Data
If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance.
Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments.
Table of Contents
Chapter 1. Introduction
Chapter 2. HDFS
Chapter 3. MapReduce
Chapter 4. Planning a Hadoop Cluster
Chapter 5. Installation and Configuration
Chapter 6. Identity, Authentication, and Authorization
Chapter 7. Resource Management
Chapter 8. Cluster Maintenance
Chapter 9. Troubleshooting
Chapter 10. Monitoring
Chapter 11. Backup and Recovery
Appendix. Deprecated Configuration Properties
About the Author
Eric Sammer is an Engineering Manager and technical lead at Cloudera where he works on various projects in the Hadoop ecosystem. His background is in the development and operations of distributed, highly concurrent, data ingest and processing systems. He's been involved in the open source community and has contributed to a large number of projects over the last decade.


