Packt+ | Advance your knowledge in tech

You're reading from Monitoring Hadoop

Product typeBook

Published inApr 2015

Publisher

ISBN-139781783281558

Edition1st Edition

Tools

Hadoop

Concepts

Data Processing

Author (1)

Aman Singh

Chapter 4. HDFS Checks

The Hadoop distributed File System is an important component of the cluster. The state of the File System must be clean at all stages and the components related to it must be healthy.

In this chapter, we will look at the HDFS checks by using the Hadoop commands, and we will also discuss how to set up the Nagios monitoring for them.

The following topics will be covered in this chapter:

Replication consistency
Space utilization
CPU utilization
NameNode health checks
Number of DataNodes in a cluster

HDFS overview

HDFS is a distributed File System that has been designed for robustness by having multiple copies of blocks across the File System. The metadata for the File System is stored on NameNode and the actual data blocks are stored on DataNodes. For a healthy File System, the metadata must be consistent, DataNode blocks must be clean, and replication must be consistent. Let's look at each of these one by one and learn how they can be monitored. The protocol used for communication between NameNode and DataNodes is RPC, and the protocol used for data transfer is HDFS over HTTP.

HDFS checks: Hadoop natively provides the commands to verify the File System. The commands must be run by the user, with whom the HDFS is running. This is mostly HDFS, or you can have any other user. But do not run it as root. To run these commands, the PATH variable must be set and it must include the path to the Hadoop binaries.
- hadoop dfsadmin –report: This command provides an exclusive report of the HDFS...

Nagios master configuration

As discussed in Chapter 1, Introduction to Monitoring, Nagios is a monitoring platform, and it works very well for the Hadoop monitoring needs. Let's see how to configure Nagios for the Hadoop service checks.

On the Nagios server, called mnode, we need to set up the service definitions, the command definitions, and the host definitions as defined here. These definitions will enable checks, and by using these we can gather the status of a service or a node. The plugin needs to be downloaded and installed from http://www.nagios.org/download.

HDFS space check: Check the HDFS space usage on the cluster.

define command{
  command_name check_hadoop_space
  command_line $PATH$/check_hadoop_namenode.pl -H $HOSTADDRESS$ -u $USER8$ -P $PORT$ -s $ARG2$ -w $ARG3$ -c $ARG4$
}

define host {
  
  use hadoop-server
  host_name hadoopnode1
  alias Remote
  Host address 192.168.0.1
  contact_groups admins
}
Service definition:

define service {
  
  use generic-service
  service_description...

The Nagios client configuration

Every Hadoop node, whether NameNode, DataNode, or Zookeeper is a client node of the Nagios Server. Each node must have the NRPE plugin installed with the checks described under /usr/local/nagios/libexec and the commands specified under /usr/local/nagios/etc/nrpe.cfg as shown here:

command[check_balancer]=/usr/local/nagios/libexec/check_hadoop_namenode.pl -H $HOSTADDRESS$ -u $USER8$ -P $PORT$ -b $ARG2$
command[check_zkp]=/usr/local/nagios/libexec/check_zkpd

Similarly, entries need to be made for each check that is executed on the nodes.

In addition to the aforementioned plugins, checks must be in place for hardware, disk, CPU, and memory. You should check the number of processes running on a system by using the check_procs plugin, check the open ports by using check_tcp. Make sure that all the nodes have ntp running and that the time is synced by using check_ntp. All of these are provided as the standard Nagios system plugins, and they must be placed on each...

Summary

In this chapter, we looked at how to set up monitoring for the HDFS components, such as the HDFS space utilization, the number of DataNodes in a cluster, heap usage, replication, and the Zookeeper state. In the next chapter, we will look at checks and monitoring for the map reducing components, such as the JobTracker, the TaskTracker, and the various utilization parameters.

The rest of the chapter is locked

You have been reading a chapter from

Monitoring Hadoop

Published in: Apr 2015Publisher: ISBN-13: 9781783281558

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Aman Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies. He has worked with companies such as HP, JP Morgan, and Yahoo. He has authored Monitoring Hadoop by Packt Publishing
Read more about Aman Singh

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages