Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Apache Hive Essentials. - Second Edition

You're reading from  Apache Hive Essentials. - Second Edition

Product type Book
Published in Jun 2018
Publisher Packt
ISBN-13 9781788995092
Pages 210 pages
Edition 2nd Edition
Languages
Author (1):
Dayong Du Dayong Du
Profile icon Dayong Du

Table of Contents (12) Chapters

Preface Overview of Big Data and Hive Setting Up the Hive Environment Data Definition and Description Data Correlation and Scope Data Manipulation Data Aggregation and Sampling Performance Considerations Extensibility Considerations Security Considerations Working with Other Tools Other Books You May Enjoy

HCatalog

HCatalog (see https://cwiki.apache.org/confluence/display/Hive/HCatalog) is a metadata management system for Hadoop data. It stores consistent schema information for Hadoop ecosystem tools, such as Pig, Hive, and MapReduce. By default, HCatalog supports data in the format of RCFile, CSV, JSON, SequenceFile, ORC file, and a customized format if InputFormat, OutputFormat, and SerDe are implemented. By using HCatalog, users are able to directly create, edit, and expose (via its REST API) metadata, which becomes effective immediately in all tools sharing the same piece of metadata. At first, HCatalog was a separate Apache project from Hive. Eventually, HCatalog became part of the Hive project in 2013 starting with Hive v0.11.0. HCatalog is built on top of the Hive metastore and incorporates support for HQL DDL. It provides read and write interfaces and HCatLoader and HCatStorer...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}