Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Mastering DynamoDB

You're reading from  Mastering DynamoDB

Product type Book
Published in Aug 2014
Publisher Packt
ISBN-13 9781783551958
Pages 236 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Tanmay Deshpande Tanmay Deshpande
Profile icon Tanmay Deshpande

Integrating with AWS EMR


Hadoop and Big Data is one of the most used extract, transform, and load (ETL) tools these days. Most of the companies are using it to fetch more and more information from the data available with them. But sometimes it is found that creating and maintaining the Hadoop cluster is quite a time-consuming job, especially when you don't have much exposure to the Linux/Unix environment. Also, if you need to use Hadoop in production, you would need to hire a specialist Hadoop admin, which is an overhead in terms of cost. To solve this, AWS has introduced a hosted Hadoop as a service where you just need to provide your requirement in terms of cluster configuration (number of data nodes and the size of instances based on the size of data you want to process), additional services such as Hive, Pig, and so on, if required, and once done, on a single click of the button, you have your Hadoop cluster ready.

You can find more details about how to launch Elastic MapReduce EMR cluster...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}