Instant Apache Hive Essentials How-to

Leverage your knowledge of SQL to easily write distributed data processing applications on Hadoop using Apache Hive
Preview in Mapt

Instant Apache Hive Essentials How-to

Darren Lee

Leverage your knowledge of SQL to easily write distributed data processing applications on Hadoop using Apache Hive
Mapt Subscription
FREE
$29.99/m after trial
eBook
$10.50
RRP $14.99
Save 29%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$10.50
$29.99p/m after trial
RRP $14.99
Subscription
eBook
Start 30 Day Trial

Frequently bought together


Instant Apache Hive Essentials How-to Book Cover
Instant Apache Hive Essentials How-to
$ 14.99
$ 10.50
Instant Apache Solr for Indexing Data How-to Book Cover
Instant Apache Solr for Indexing Data How-to
$ 19.99
$ 14.00
Buy 2 for $24.50
Save $10.48
Add to Cart
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 

Book Details

ISBN 139781782169475
Paperback76 pages

Book Description

Hadoop provides a robust framework for building distributed applications, but working directly with Hadoop requires writing a lot of code. Adding structure to data and using a higher-level language such as SQL makes working with Hadoop both easier and faster.

"Instant Apache Hive Essentials How-to" contains a series of practical recipes that introduce the power and flexibility of Hive. Starting with your first query, this book will provide step-by-step instructions and behind-the-scenes explanations for how to effectively write MapReduce jobs with SQL.

This book looks at how Hive transforms SQL statements into MapReduce jobs and demonstrates how you can extend Hive to support your own use cases. Its recipes will teach you how to leverage the scale of Hadoop while retaining the benefits of using a structured query language.You will learn how Hive translates a query into MapReduce jobs and explore how to structure your queries for better performance. You will extend Hive to understand your own file formats, simplifying the loading of data into the warehouse. You will finally add your own custom functions to Hive to support whatever use cases you may have.

"Instant Apache Hive Essentials How-to" is a quick introduction for adding Hive to your data toolkit. It is packed with high-level instructions for making Hive work as well as drawing connections to the underlying Hadoop framework to explain how things happen.

Table of Contents

Chapter 1: Instant Apache Hive Essentials How-to
Tables and queries (Simple)
Understanding complex data types (Simple)
Using Hive non-interactively (Simple)
Join optimizations (Medium)
Setting the file format (Simple)
Writing a custom SerDe (Intermediate)
Using static partitions (Intermediate)
Using dynamic partitions (Intermediate)
Using functions (Simple)
Adding custom logic with streaming (Intermediate)
Simple user-defined functions (Intermediate)
Advanced user-defined functions (Advanced)
User-defined table-generating functions (Advanced)
User-defined aggregation functions (Advanced)

What You Will Learn

  • Start with the basics of loading data and writing your first query
  • Use de-normalized data efficiently by manipulating complex data types
  • Structure your data and queries to take advantage of Hive’s optimizations
  • Bring your own data files to Hive and teach Hive how to understand them
  • Access the specialized functions built-in to Hive to manipulate your data
  • Use Hive streaming to integrate code written in any language into your Extend Hive with user-defined functions

Authors

Table of Contents

Chapter 1: Instant Apache Hive Essentials How-to
Tables and queries (Simple)
Understanding complex data types (Simple)
Using Hive non-interactively (Simple)
Join optimizations (Medium)
Setting the file format (Simple)
Writing a custom SerDe (Intermediate)
Using static partitions (Intermediate)
Using dynamic partitions (Intermediate)
Using functions (Simple)
Adding custom logic with streaming (Intermediate)
Simple user-defined functions (Intermediate)
Advanced user-defined functions (Advanced)
User-defined table-generating functions (Advanced)
User-defined aggregation functions (Advanced)

Book Details

ISBN 139781782169475
Paperback76 pages
Read More

Read More Reviews

Recommended for You

Big Data Analytics with R and Hadoop Book Cover
Big Data Analytics with R and Hadoop
$ 29.99
$ 21.00
Hadoop Real-World Solutions Cookbook Book Cover
Hadoop Real-World Solutions Cookbook
$ 29.99
$ 21.00
Hadoop Beginner's Guide Book Cover
Hadoop Beginner's Guide
$ 29.99
$ 21.00
Practical Data Analysis Book Cover
Practical Data Analysis
$ 29.99
$ 21.00
Building Machine Learning Systems with Python Book Cover
Building Machine Learning Systems with Python
$ 29.99
$ 6.00
Fast Data Processing with Spark Book Cover
Fast Data Processing with Spark
$ 22.99
$ 16.10