Reader small image

You're reading from  Hadoop 2.x Administration Cookbook

Product typeBook
Published inMay 2017
PublisherPackt
ISBN-139781787126732
Edition1st Edition
Tools
Right arrow
Author (1)
Aman Singh
Aman Singh
author image
Aman Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies. He has worked with companies such as HP, JP Morgan, and Yahoo. He has authored Monitoring Hadoop by Packt Publishing
Read more about Aman Singh

Right arrow

Namenode HA using Journal node


In this recipe, we look to configure Namenode high availability using Journal nodes, ZooKeeper, and ZooKeeper failover controller (ZKFC). This is an automatic failover, rather than the manual one discussed previously.

Getting ready

Before going through the recipes in this chapter, make sure you have two nodes that can be used as Namenodes. The following steps will guide you through the process of setting up Namenode HA using the Journal node.

The role of the Journal node is to keep the Namenodes in sync and prevent split brain. Journal nodes are distributed systems for storing edits.

How to do it...

  1. Connect to the master node in the cluster. The user can start with any master node, either master1 or master2. Both of these will be used Namenodes.

  2. Switch to the user hadoop.

  3. Navigate to the directory where Hadoop is installed:

    /opt/cluster/hadoop/etc/hadoop
    
  4. Edit core-site.xml to add the name service string instead of any specific Namenode address, as shown in the following...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Hadoop 2.x Administration Cookbook
Published in: May 2017Publisher: PacktISBN-13: 9781787126732

Author (1)

author image
Aman Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies. He has worked with companies such as HP, JP Morgan, and Yahoo. He has authored Monitoring Hadoop by Packt Publishing
Read more about Aman Singh