Building a Big Data Analytics Stack [Video]

Preview in Mapt

Building a Big Data Analytics Stack [Video]

Tomasz Lelek

Learn about Big Data tools needed to create Big Data Stack

Quick links: > What will you learn?> Table of content

Mapt Subscription
FREE
$29.99/m after trial
Video
$5.00
RRP $124.99
Save 95%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$5.00
$29.99 p/m after trial
RRP $124.99
Subscription
Video
Start 14 Day Trial

Frequently bought together


Building a Big Data Analytics Stack [Video] Book Cover
Building a Big Data Analytics Stack [Video]
$ 124.99
$ 5.00
From 0 to 1: Hive for Processing Big Data [Video] Book Cover
From 0 to 1: Hive for Processing Big Data [Video]
$ 49.99
$ 5.00
Buy 2 for $10.00
Save $164.98
Add to Cart

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes

Video Description

Building a Big Data ecosystem is hard. There are a variety of technologies available and every one of them has its pros and cons. When building a big data pipeline for software engineers, we need to use more low-level tools and APIs such as HBase and Apache Spark.

In this course, we’ll check out HBase, a database built by optimizing on the HDFS. Moving on, we’ll have a bit of fun with Spark MLlib. Finally, you’ll get an understanding of ETL and deploy a Hadoop project to the cloud. Building Big Data Ecosystem is hard. There are a variety of technologies available and every one of them has own pros and cons. Software Engineers we need to use more low-level tools and APIs like HBase and Apache Spark while building big data pipeline.

By the end of the course, you’ll be able to use more high-level tools that have more user-friendly, declarative APIs such as Pig and Hive.

Style and Approach

This course will give you both a knowledge-based understanding and practical hands-on experience of Hadoop 2.7. It also looks at Spark, Pig, Hive, HBase, and YARN, so you can understand how to implement these components while using Hadoop clusters.

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

What You Will Learn

  • Use Pig and Hive in a non-Java way to understand the power of Hadoop
  • Explore Spark and use it to stream and batch process
  • Use HBase database from Java application
  • Find out more about the machine learning toolkit and its use with Spark
  • Know how to leverage the pros of Big Data tools

Authors

Table of Contents

Pig and Hive
The Course Overview
Introduction to Pig
Introduction to Hive
Hive Query Language
Spark Your Engines
Writing Spark Jobs
Introducing YARN
Creating Spark Job
HBase the Hadoop Database
HBase and HDFS
Using HBase Database from Java Application
Machine Learning Toolkit
Composing Spark ML Pipelines
Build a Recommendation System Using Collaborative Filtering
AWS EMR
ETL
Introducing AWS EMR
Creating S3 and EMR Cluster
Running Jobs in Series Using EMR Java API

Video Details

ISBN 139781787125018
Course Length1 hour and 31 minutes
Read More

Read More Reviews

Recommended for You

From 0 to 1: Hive for Processing Big Data [Video] Book Cover
From 0 to 1: Hive for Processing Big Data [Video]
$ 49.99
$ 5.00
LaTeX A-Z: from beginner to advanced in less than 3 hours [Video] Book Cover
LaTeX A-Z: from beginner to advanced in less than 3 hours [Video]
$ 94.99
$ 5.00
Big Data Analytics with SAS Book Cover
Big Data Analytics with SAS
$ 35.99
$ 5.00
Basic Statistics and Data Mining for Data Science [Video] Book Cover
Basic Statistics and Data Mining for Data Science [Video]
$ 124.99
$ 5.00
Full Stack Kotlin Development [Video] Book Cover
Full Stack Kotlin Development [Video]
$ 124.99
$ 5.00
Infrastructure as a Service Solutions with Azure [Video] Book Cover
Infrastructure as a Service Solutions with Azure [Video]
$ 124.99
$ 5.00