Reader small image

You're reading from  Apache Hive Essentials

Product typeBook
Published inFeb 2015
Reading LevelIntermediate
PublisherPackt
ISBN-139781783558575
Edition1st Edition
Languages
Right arrow
Author (1)
Dayong Du
Dayong Du
author image
Dayong Du

Dayong Du has all his career dedicated to enterprise data and analytics for more than 10 years, especially on enterprise use case with open source big data technology, such as Hadoop, Hive, HBase, Spark, etc. Dayong is a big data practitioner as well as author and coach. He has published the 1st and 2nd edition of Apache Hive Essential and coached lots of people who are interested to learn and use big data technology. In addition, he is a seasonal blogger, contributor, and advisor for big data start-ups, co-founder of Toronto big data professional association.
Read more about Dayong Du

Right arrow

Installing Hive from vendor packages


Right now, many companies, such as Cloudera, MapR, IBM, and Hortonworks, have packaged Hadoop into more easily manageable distributions. Each company takes a slightly different strategy, but the consensus for all of these packages is to make Hadoop easier to use for enterprise. For example, we can easily install Hive from Cloudera Distributed Hadoop (CDH), which can be downloaded from http://www.cloudera.com/content/cloudera/en/downloads/cdh.html.

Once CDH is installed to have the Hadoop environment ready, we can add Hive to the Hadoop cluster by following a few steps:

  1. Log in to the Cloudera manager and click on the dropdown button after the cluster name to choose Add a Service.

    Cloudera manager main page

  2. In the first Add Service Wizard page, choose Hive to install.

  3. In the second Add Service Wizard page, set the dependencies for the service. Sentry is the authorization policy service for Hive.

  4. In the third Add Service Wizard page, choose the proper hosts for...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Apache Hive Essentials
Published in: Feb 2015Publisher: PacktISBN-13: 9781783558575

Author (1)

author image
Dayong Du

Dayong Du has all his career dedicated to enterprise data and analytics for more than 10 years, especially on enterprise use case with open source big data technology, such as Hadoop, Hive, HBase, Spark, etc. Dayong is a big data practitioner as well as author and coach. He has published the 1st and 2nd edition of Apache Hive Essential and coached lots of people who are interested to learn and use big data technology. In addition, he is a seasonal blogger, contributor, and advisor for big data start-ups, co-founder of Toronto big data professional association.
Read more about Dayong Du