Monitoring Hadoop

More Information
  • Install Nagios and Ganglia and understand logging at the operating system level
  • Create and configure Nagios nodes for monitoring with custom checks
  • Monitor Hadoop daemons such as NameNode, DataNode, JobTracker, and so on
  • Configure logs for various daemons and set up audits for the options done on the cluster
  • Track important parameters for the File System, MapReduce, and other counters
  • Set up Nagios master and client nodes with checks for the system and applications running on it
  • Configure the Hadoop metrics collection and visualize it for nontechnical users
  • Understand the communication between different daemons and protocols and the ports they use

With the exponential growth of data and many enterprises crunching more and more data, Hadoop as a data platform has gained a lot of popularity. The Hadoop platform needs to be monitored with respect to how it works and functions. There is an ever-increasing need to keep the Hadoop platform clean and healthy.

This book will help you to integrate Hadoop and Nagios in a seamless and easy way. At the start, the book covers the basics of operating system logging and monitoring. Getting to grips with the characteristics of Hadoop monitoring, metrics, and log collection will help Hadoop users, especially Hadoop administrators, diagnose and troubleshoot clusters better. In essence, the book teaches you how to set up an all-inclusive and robust monitoring system for the Hadoop platform. The book also serves as a quick reference to the various metrics available in Hadoop.

Concluding with the visualization of Hadoop metrics, you will get acquainted with the workings of Hadoop in a short span of time with the help of step-by-step instructions in each chapter.

  • Track Hadoop operations, errors, and bottlenecks efficiently
  • Employ Hadoop logging features to help manage Hadoop clusters better
  • Visualize the data collected and present it in a systematic manner
Page Count 100
Course Length 3 hours 0 minutes
ISBN 9781783281558
Date Of Publication 27 Apr 2015


Gurmukh Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies.

He has worked with companies such as HP, JP Morgan, and Yahoo.

He has authored Monitoring Hadoop by Packt Publishing