Free Sample
+ Collection

Cloudera Administration Handbook

Starting
Rohit Menon

A complete, hands-on guide to building and maintaining large Apache Hadoop clusters using Cloudera Manager and CDH5
$32.99
$54.99
RRP $32.99
RRP $54.99
eBook
Print + eBook

Want this title & more?

$16.99 p/month

Subscribe to PacktLib

Enjoy full and instant access to over 2000 books and videos – you’ll find everything you need to stay ahead of the curve and make sure you can always get the job done.

Book Details

ISBN 139781783558964
Paperback254 pages

About This Book

  • Understand the CDH architecture and its components and successfully set up a Hadoop cluster
  • Maintain, troubleshoot, and secure your cluster using Cloudera Manager
  • Easy-to-follow administrator’s guide with step-by-step explanations to help you master Apache Hadoop

Who This Book Is For

This book is great for administrators interested in setting up and managing a large Hadoop cluster. If you are an administrator, or want to be an administrator, and you are ready to build and maintain a production-level cluster running CDH5, then this book is for you.

Table of Contents

Chapter 1: Getting Started with Apache Hadoop
History of Apache Hadoop and its trends
Components of Apache Hadoop
Understanding the Apache Hadoop daemons
Introducing Cloudera
Introducing CDH
Responsibilities of a Hadoop administrator
Summary
Chapter 2: HDFS and MapReduce
Essentials of HDFS
The read/write operational flow in HDFS
Understanding the namenode UI
Understanding the secondary namenode UI
Exploring HDFS commands
Getting acquainted with MapReduce
Summary
Chapter 3: Cloudera's Distribution Including Apache Hadoop
Getting started with CDH
Understanding the CDH components
Installing CDH
Installing the CDH components
Summary
Chapter 4: Exploring HDFS Federation and Its High Availability
Implementing HDFS Federation
Implementing HDFS High Availability
Jobtracker high availability
Summary
Chapter 5: Using Cloudera Manager
Introducing Cloudera Manager
Understanding the Cloudera Manager architecture
Installing Cloudera Manager
Navigating the Cloudera Manager Web console
Configuring High Availability using Cloudera Manager
Summary
Chapter 6: Implementing Security Using Kerberos
Understanding authentication and authorization
Introducing Kerberos
Understanding the Kerberos Architecture
Installing Kerberos
Configuring Kerberos for Apache Hadoop
Authorization in Apache Hadoop
Summary
Chapter 7: Managing an Apache Hadoop Cluster
Configuring Hadoop services using Cloudera Manager
Role management in Cloudera Manager
Managing hosts using Cloudera Manager
Managing multiple clusters with Cloudera Manager
Rebalancing a Hadoop cluster from Cloudera Manager
Summary
Chapter 8: Cluster Monitoring Using Events and Alerts
Monitoring Hadoop services from Cloudera Manager
Understanding events and alerts
Summary
Chapter 9: Configuring Backups
Understanding backups
Understanding HDFS backups
Using the distributed copy (DistCp)
Configuring backups using Cloudera Manager
Summary

What You Will Learn

  • Understand the Apache Hadoop architecture and the future of distributed processing frameworks
  • Use HDFS and MapReduce for all file-related operations
  • Install and configure CDH to bring up an Apache Hadoop cluster
  • Configure HDFS High Availability and HDFS Federation to prevent single points of failure
  • Install and configure Cloudera Manager to perform administrator operations
  • Implement security by installing and configuring Kerberos for all services in the cluster
  • Add, remove, and rebalance nodes in a cluster using cluster management tools
  • Understand and configure the different backup options to back up your HDFS

In Detail

Apache Hadoop is an open source distributed computing technology that assists users in processing large volumes of data with relative ease, helping them to generate tremendous insights into their data. Cloudera, with their open source distribution of Hadoop, has made data analytics on big data possible and accessible to anyone interested.

This book fully prepares you to be a Hadoop administrator, with special emphasis on Cloudera's CDH. It provides step-by-step instructions on setting up and managing a robust Hadoop cluster running CDH5. This book will also equip you with an understanding of tools such as Cloudera Manager, which is currently being used by many companies to manage Hadoop clusters with hundreds of nodes. You will learn how to set up security using Kerberos. You will also use Cloudera Manager to set up alerts and events that will help you monitor and troubleshoot cluster issues.

Authors

Read More