Reader small image

You're reading from  HBase Administration Cookbook

Product typeBook
Published inAug 2012
PublisherPackt
ISBN-139781849517140
Edition1st Edition
Right arrow
Author (1)
Yifeng Jiang
Yifeng Jiang
author image
Yifeng Jiang

Yifeng Jiang is a Hadoop and HBase Administrator and Developer at Rakutenthe largest e-commerce company in Japan. After graduating from the University of Science and Technology of China with a B.S. in Information Management Systems, he started his career as a professional software engineer, focusing on Java development. In 2008, he started looking over the Hadoop project. In 2009, he led the development of his previous company's display advertisement data infrastructure using Hadoop and Hive. In 2010, he joined his current employer, where he designed and implemented the Hadoop- and HBase-based, large-scale item ranking system. He is also one of the members of the Hadoop team in the company, which operates several Hadoop/HBase clusters
Read more about Yifeng Jiang

Right arrow

HFile tool—viewing textualized HFile content


HFile is the internal file format for HBase to store its data. These are the first two lines of the description of HFile from its source code:

File format for hbase.

A file of sorted key/value pairs. Both keys and values are byte arrays.

We don't need to know the details of HFile for our administration tasks. However, by using the HFile tool, we can get some useful information from HFile.

The HFile tool provides the facility to view a textualized version of HFile content.

We can also get the metadata of an HFile file by using this tool. Some metadata, such as entry count and average Key/Value size, are important indicators of performance tuning.

We will describe how to use an HFile tool to show textualized content and metadata of HFile files.

Getting ready

Log in to your HBase client node.

Pick a region name or HFile file path to be viewed. A region name can be found in the Table Regions section of your HBase web UI. HFile files are stored under...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
HBase Administration Cookbook
Published in: Aug 2012Publisher: PacktISBN-13: 9781849517140

Author (1)

author image
Yifeng Jiang

Yifeng Jiang is a Hadoop and HBase Administrator and Developer at Rakutenthe largest e-commerce company in Japan. After graduating from the University of Science and Technology of China with a B.S. in Information Management Systems, he started his career as a professional software engineer, focusing on Java development. In 2008, he started looking over the Hadoop project. In 2009, he led the development of his previous company's display advertisement data infrastructure using Hadoop and Hive. In 2010, he joined his current employer, where he designed and implemented the Hadoop- and HBase-based, large-scale item ranking system. He is also one of the members of the Hadoop team in the company, which operates several Hadoop/HBase clusters
Read more about Yifeng Jiang