OpenStack Sahara Essentials

Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara

OpenStack Sahara Essentials

This ebook is included in a Mapt subscription
Omar Khedher

Integrate, deploy, rapidly configure, and successfully manage your own big data-intensive clusters in the cloud using OpenStack Sahara
$0.00
$16.00
$39.99
$29.99p/m after trial
RRP $31.99
RRP $39.99
Subscription
eBook
Print + eBook
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 4,000+ eBooks & Videos
  • 40+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Code Files
Preview in Mapt

Book Details

ISBN 139781785885969
Paperback178 pages

Book Description

The Sahara project is a module that aims to simplify the building of data processing capabilities on OpenStack.

The goal of this book is to provide a focused, fast paced guide to installing, configuring, and getting started with integrating Hadoop with OpenStack, using Sahara.

The book should explain to users how to deploy their data-intensive Hadoop and Spark clusters on top of OpenStack. It will also cover how to use the Sahara REST API, how to develop applications for Elastic Data Processing on Openstack, and setting up hadoop or spark clusters on Openstack.

Table of Contents

Chapter 1: The Essence of Big Data in the Cloud
It is all about data
OpenStack crossing big data
Summary
Chapter 2: Integrating OpenStack Sahara
Preparing the test infrastructure environment
Installing OpenStack
Integrating Sahara
Summary
Chapter 3: Using OpenStack Sahara
Planning a Hadoop deployment
Creating a Hadoop cluster
Summary
Chapter 4: Executing Jobs with Sahara
Job glossary in Sahara
Running jobs in Sahara
Summary
Chapter 5: Discovering Advanced Features with Sahara
Sahara plugins
Boosting Elastic Data Processing performance
Defining the network
Increasing data reliability
Summary
Chapter 6: Hadoop High Availability Using Sahara
HDP high-availability support
CDH high-availability support
Summary
Chapter 7: Troubleshooting
Troubleshooting OpenStack
Troubleshooting data processing
Summary

What You Will Learn

  • Integrate and Install Sahara with OpenStack environment
  • Learn Sahara architecture under the hood
  • Rapidly configure and scale Hadoop clusters on top of OpenStack
  • Explore the Sahara REST API to create, deploy and manage a Hadoop cluster
  • Learn the Elastic Processing Data (EDP) facility to execute jobs in clusters from Sahara
  • Cover other Hadoop stable plugins existing supported by Sahara
  • Discover different features provided by Sahara for Hadoop provisioning and deployment
  • Learn how to troubleshoot OpenStack Sahara issues

Authors

Table of Contents

Chapter 1: The Essence of Big Data in the Cloud
It is all about data
OpenStack crossing big data
Summary
Chapter 2: Integrating OpenStack Sahara
Preparing the test infrastructure environment
Installing OpenStack
Integrating Sahara
Summary
Chapter 3: Using OpenStack Sahara
Planning a Hadoop deployment
Creating a Hadoop cluster
Summary
Chapter 4: Executing Jobs with Sahara
Job glossary in Sahara
Running jobs in Sahara
Summary
Chapter 5: Discovering Advanced Features with Sahara
Sahara plugins
Boosting Elastic Data Processing performance
Defining the network
Increasing data reliability
Summary
Chapter 6: Hadoop High Availability Using Sahara
HDP high-availability support
CDH high-availability support
Summary
Chapter 7: Troubleshooting
Troubleshooting OpenStack
Troubleshooting data processing
Summary

Book Details

ISBN 139781785885969
Paperback178 pages
Read More

Read More Reviews