Apache Hadoop 3 Quick Start Guide

A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem

Apache Hadoop 3 Quick Start Guide

Hrishikesh Karambelkar

A fast paced guide that will help you learn about Apache Hadoop 3 and its ecosystem
Packt Subscription
FREE
$9.99/m after trial
eBook
$10.00
RRP $23.99
Save 58%
Print + eBook
$29.99
RRP $29.99
What do I get with a Packt subscription?
  • Exclusive monthly discount - no contract
  • Unlimited access to entire Packt library of 6500+ eBooks and Videos
  • 120 new titles added every month, on new and emerging tech
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$0.00
$10.00
$29.99
$9.99 p/m after trial
RRP $23.99
RRP $29.99
Subscription
eBook
Print + eBook
Start a FREE 10-day trial

Frequently bought together


Apache Hadoop 3 Quick Start Guide Book Cover
Apache Hadoop 3 Quick Start Guide
$ 23.99
$ 10.00
Internet of Things Programming Projects Book Cover
Internet of Things Programming Projects
$ 31.99
$ 10.00
Buy 2 for $20.00
Save $35.98
Add to Cart

Book Details

ISBN 139781788999830
Paperback226 pages

Book Description

Apache Hadoop is a widely used distributed data platform. It enables large datasets to be efficiently processed instead of using one large computer to store and process the data. This book will get you started with the Hadoop ecosystem, and introduce you to the main technical topics, including MapReduce, YARN, and HDFS.

The book begins with an overview of big data and Apache Hadoop. Then, you will set up a pseudo Hadoop development environment and a multi-node enterprise Hadoop cluster. You will see how the parallel programming paradigm, such as MapReduce, can solve many complex data processing problems.

The book also covers the important aspects of the big data software development lifecycle, including quality assurance and control, performance, administration, and monitoring.

You will then learn about the Hadoop ecosystem, and tools such as Kafka, Sqoop, Flume, Pig, Hive, and HBase. Finally, you will look at advanced topics, including real time streaming using Apache Storm, and data analytics using Apache Spark.

By the end of the book, you will be well versed with different configurations of the Hadoop 3 cluster.

Table of Contents

What You Will Learn

  • Store and analyze data at scale using HDFS, MapReduce and YARN
  • Install and configure Hadoop 3 in different modes
  • Use Yarn effectively to run different applications on Hadoop based platform
  • Understand and monitor how Hadoop cluster is managed
  • Consume streaming data using Storm, and then analyze it using Spark
  • Explore Apache Hadoop ecosystem components, such as Flume, Sqoop, HBase, Hive, and Kafka

Authors

Table of Contents

Book Details

ISBN 139781788999830
Paperback226 pages
Read More

Read More Reviews

Recommended for You

Internet of Things Programming Projects Book Cover
Internet of Things Programming Projects
$ 31.99
$ 10.00
Matplotlib 3.0 Cookbook Book Cover
Matplotlib 3.0 Cookbook
$ 35.99
$ 10.00
Hands-On Machine Learning with Azure Book Cover
Hands-On Machine Learning with Azure
$ 35.99
$ 10.00
Data Science Algorithms in a Week - Second Edition Book Cover
Data Science Algorithms in a Week - Second Edition
$ 31.99
$ 10.00
Hands-On Cloud Administration in Azure Book Cover
Hands-On Cloud Administration in Azure
$ 35.99
$ 10.00
TensorFlow Reinforcement Learning Quick Start Guide Book Cover
TensorFlow Reinforcement Learning Quick Start Guide
$ 19.99
$ 10.00