Cloudera Administration Handbook

A complete, hands-on guide to building and maintaining large Apache Hadoop clusters using Cloudera Manager and CDH5
Preview in Mapt

Cloudera Administration Handbook

Rohit Menon

A complete, hands-on guide to building and maintaining large Apache Hadoop clusters using Cloudera Manager and CDH5
Mapt Subscription
FREE
$29.99/m after trial
eBook
$23.10
RRP $32.99
Save 29%
Print + eBook
$54.99
RRP $54.99
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$23.10
$54.99
$29.99p/m after trial
RRP $32.99
RRP $54.99
Subscription
eBook
Print + eBook
Start 30 Day Trial

Frequently bought together


Cloudera Administration Handbook Book Cover
Cloudera Administration Handbook
$ 32.99
$ 23.10
PowerShell 3.0 Advanced Administration Handbook Book Cover
PowerShell 3.0 Advanced Administration Handbook
$ 29.99
$ 21.00
Buy 2 for $35.00
Save $27.98
Add to Cart
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 

Book Details

ISBN 139781783558964
Paperback254 pages

Book Description

Apache Hadoop is an open source distributed computing technology that assists users in processing large volumes of data with relative ease, helping them to generate tremendous insights into their data. Cloudera, with their open source distribution of Hadoop, has made data analytics on big data possible and accessible to anyone interested.

This book fully prepares you to be a Hadoop administrator, with special emphasis on Cloudera's CDH. It provides step-by-step instructions on setting up and managing a robust Hadoop cluster running CDH5. This book will also equip you with an understanding of tools such as Cloudera Manager, which is currently being used by many companies to manage Hadoop clusters with hundreds of nodes. You will learn how to set up security using Kerberos. You will also use Cloudera Manager to set up alerts and events that will help you monitor and troubleshoot cluster issues.

Table of Contents

Chapter 1: Getting Started with Apache Hadoop
History of Apache Hadoop and its trends
Components of Apache Hadoop
Understanding the Apache Hadoop daemons
Introducing Cloudera
Introducing CDH
Responsibilities of a Hadoop administrator
Summary
Chapter 2: HDFS and MapReduce
Essentials of HDFS
The read/write operational flow in HDFS
Understanding the namenode UI
Understanding the secondary namenode UI
Exploring HDFS commands
Getting acquainted with MapReduce
Summary
Chapter 3: Cloudera's Distribution Including Apache Hadoop
Getting started with CDH
Understanding the CDH components
Installing CDH
Installing the CDH components
Summary
Chapter 4: Exploring HDFS Federation and Its High Availability
Implementing HDFS Federation
Implementing HDFS High Availability
Jobtracker high availability
Summary
Chapter 5: Using Cloudera Manager
Introducing Cloudera Manager
Understanding the Cloudera Manager architecture
Installing Cloudera Manager
Navigating the Cloudera Manager Web console
Configuring High Availability using Cloudera Manager
Summary
Chapter 6: Implementing Security Using Kerberos
Understanding authentication and authorization
Introducing Kerberos
Understanding the Kerberos Architecture
Installing Kerberos
Configuring Kerberos for Apache Hadoop
Authorization in Apache Hadoop
Summary
Chapter 7: Managing an Apache Hadoop Cluster
Configuring Hadoop services using Cloudera Manager
Role management in Cloudera Manager
Managing hosts using Cloudera Manager
Managing multiple clusters with Cloudera Manager
Rebalancing a Hadoop cluster from Cloudera Manager
Summary
Chapter 8: Cluster Monitoring Using Events and Alerts
Monitoring Hadoop services from Cloudera Manager
Understanding events and alerts
Summary
Chapter 9: Configuring Backups
Understanding backups
Understanding HDFS backups
Using the distributed copy (DistCp)
Configuring backups using Cloudera Manager
Summary

What You Will Learn

  • Understand the Apache Hadoop architecture and the future of distributed processing frameworks
  • Use HDFS and MapReduce for all file-related operations
  • Install and configure CDH to bring up an Apache Hadoop cluster
  • Configure HDFS High Availability and HDFS Federation to prevent single points of failure
  • Install and configure Cloudera Manager to perform administrator operations
  • Implement security by installing and configuring Kerberos for all services in the cluster
  • Add, remove, and rebalance nodes in a cluster using cluster management tools
  • Understand and configure the different backup options to back up your HDFS

Authors

Table of Contents

Chapter 1: Getting Started with Apache Hadoop
History of Apache Hadoop and its trends
Components of Apache Hadoop
Understanding the Apache Hadoop daemons
Introducing Cloudera
Introducing CDH
Responsibilities of a Hadoop administrator
Summary
Chapter 2: HDFS and MapReduce
Essentials of HDFS
The read/write operational flow in HDFS
Understanding the namenode UI
Understanding the secondary namenode UI
Exploring HDFS commands
Getting acquainted with MapReduce
Summary
Chapter 3: Cloudera's Distribution Including Apache Hadoop
Getting started with CDH
Understanding the CDH components
Installing CDH
Installing the CDH components
Summary
Chapter 4: Exploring HDFS Federation and Its High Availability
Implementing HDFS Federation
Implementing HDFS High Availability
Jobtracker high availability
Summary
Chapter 5: Using Cloudera Manager
Introducing Cloudera Manager
Understanding the Cloudera Manager architecture
Installing Cloudera Manager
Navigating the Cloudera Manager Web console
Configuring High Availability using Cloudera Manager
Summary
Chapter 6: Implementing Security Using Kerberos
Understanding authentication and authorization
Introducing Kerberos
Understanding the Kerberos Architecture
Installing Kerberos
Configuring Kerberos for Apache Hadoop
Authorization in Apache Hadoop
Summary
Chapter 7: Managing an Apache Hadoop Cluster
Configuring Hadoop services using Cloudera Manager
Role management in Cloudera Manager
Managing hosts using Cloudera Manager
Managing multiple clusters with Cloudera Manager
Rebalancing a Hadoop cluster from Cloudera Manager
Summary
Chapter 8: Cluster Monitoring Using Events and Alerts
Monitoring Hadoop services from Cloudera Manager
Understanding events and alerts
Summary
Chapter 9: Configuring Backups
Understanding backups
Understanding HDFS backups
Using the distributed copy (DistCp)
Configuring backups using Cloudera Manager
Summary

Book Details

ISBN 139781783558964
Paperback254 pages
Read More

Read More Reviews

Recommended for You

Hadoop Operations and Cluster Management Cookbook Book Cover
Hadoop Operations and Cluster Management Cookbook
$ 29.99
$ 21.00
Hadoop Cluster Deployment Book Cover
Hadoop Cluster Deployment
$ 20.99
$ 14.70
Big Data Analytics with R and Hadoop Book Cover
Big Data Analytics with R and Hadoop
$ 29.99
$ 21.00
Hadoop MapReduce Cookbook Book Cover
Hadoop MapReduce Cookbook
$ 29.99
$ 21.00
Hadoop Real-World Solutions Cookbook Book Cover
Hadoop Real-World Solutions Cookbook
$ 29.99
$ 21.00
Hadoop Beginner's Guide Book Cover
Hadoop Beginner's Guide
$ 29.99
$ 21.00