Reader small image

You're reading from  Hadoop 2.x Administration Cookbook

Product typeBook
Published inMay 2017
PublisherPackt
ISBN-139781787126732
Edition1st Edition
Tools
Right arrow
Author (1)
Aman Singh
Aman Singh
author image
Aman Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies. He has worked with companies such as HP, JP Morgan, and Yahoo. He has authored Monitoring Hadoop by Packt Publishing
Read more about Aman Singh

Right arrow

Hive metastore database


In this recipe, we will look at the MySQL database that is used as a metastore database. It is important to understand how the Hive-managed tables are depicted by metadata, and how the metadata database is queried to find the location of tables and their partitions.

Getting ready

For this recipe, you must have completed the Partitioning and Bucketing in Hive recipe and have a basic understanding of MySQL commands and SQL query syntax.

How to do it...

  1. Connect to the MySQL server from any node in the cluster using the following command:

    $ mysql –u hadoop –h master1.cyrus.com -p
    
  2. The username and password can be found in the hive-site.xml file.

  3. Switch to the Hive metastore database, which in our case is hive_db. There are many tables in the databases that together constitute metadata for the tables.

  4. The VERSION table stores information about the schema version, as shown in the following screenshot:

  5. The TBLS table stores information about the tables, as shown in the following...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Hadoop 2.x Administration Cookbook
Published in: May 2017Publisher: PacktISBN-13: 9781787126732

Author (1)

author image
Aman Singh

Gurmukh Singh is a seasoned technology professional with 14+ years of industry experience in infrastructure design, distributed systems, performance optimization, and networks. He has worked in big data domain for the last 5 years and provides consultancy and training on various technologies. He has worked with companies such as HP, JP Morgan, and Yahoo. He has authored Monitoring Hadoop by Packt Publishing
Read more about Aman Singh