Perform interactive real-time analytics on large amount of data using Cloudera Impala with Packt’s new book and eBook

March 2014 | Open Source

Packt is pleased to announce the release of Learning Cloudera Impala, a step-by-step guide to get readers started with Impala on their Hadoop cluster. With the help of this book, readers will learn to manipulate data rapidly by writing proper SQL statements, and explore the concepts of Impala security, administration, and troubleshooting in detail to maintain the Impala cluster. The print book is 150 pages long and is competitively priced at $34.99, while the eBook is available in Kindle or PDF formats for just $17.84

About the author:

Avkash Chauhan is a software technology veteran with more than 12 years of industry experience in various disciplines such as embedded engineering, cloud computing, big data analytics, data processing, and data visualization. He has an extensive global work experience with Fortune 100 companies worldwide. He has spent the last eight years at Microsoft before moving on to Silicon Valley to work with a big data and analytics start-up. He started his career as an embedded engineer, and during his eight-year long gig at Microsoft, he worked on Windows CE, Windows Phone, Windows Azure, and HDInsight. He spent several years working with the Windows Azure team to develop world-class cloud technology, and his last project was Apache Hadoop on Windows Azure, also known as HDInsight. He worked on the HDInsight project since its incubation at Microsoft, and helped its early development and then deployment on the cloud. For the past three years, he has been working on big data- and Hadoop-related technologies by developing applications to make Hadoop easy to use for large- and mid-market companies. He is a prolific blogger and very active on the social networking sites at http://cloudcelebrity.wordpress.com/.

Cloudera Impala is the industry’s leading massively parallel processing (MPP) SQL query engine that runs natively in Apache Hadoop. It provides a database management system for data stored in a computer cluster that runs Apache Hadoop.

With Learning Cloudera Impala, readers get to know the various ways of installing Impala in their Hadoop cluster, along with how to utilize Impala Query Language and built-in functions to play with data. The book also gets readers familiar with various input data formats in Hadoop and using them with Impala. This practical guide helps readers understand how third-party applications can connect with Impala to provide data visualization and various other enhancements. Through this book, readers will learn to identify and troubleshoot problems in a variety of ways, administrate, and fine-tune Impala for high availability and also use the Impala shell API to interact with Impala components.

This book covers the following essential topics:

Chapter 1: Getting Started with Impala

Chapter 2: The Impala Shell Commands and Interface

Chapter 3: The Impala Query Language and Built-in Functions

Chapter 4: Impala Walkthrough with an Example

Chapter 5: Impala Administration and Performance Improvements

Chapter 6: Troubleshooting Impala

Chapter 7: Advanced Impala Concepts

This book is perfect for those who really want to take advantage of their Hadoop cluster by processing extremely large amounts of raw data in Hadoop at real-time speed. Prior knowledge of Hadoop and some exposure to HIVE and MapReduce is expected. To get more details on the book, please visit: http://www.packtpub.com/using-cloudera-impala/book.


Learning Cloudera Impala
Explore the concepts of Impala security, administration, and troubleshooting in detail to maintain your Impala cluster

For more information, please visit: http://www.packtpub.com/using-cloudera-impala/book

Code Download and Errata
Packt Anytime, Anywhere
Register Books
Print Upgrades
eBook Downloads
Video Support
Contact Us
Awards Voting Nominations Previous Winners
Judges Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software
Resources
Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software