Big Data Processing using Apache Spark [Video]

Preview in Mapt

Big Data Processing using Apache Spark [Video]

Tomasz Lelek

1 customer reviews
Leverage one of the most efficient and widely adopted Big Data processing framework - Apache Spark

Quick links: > What will you learn?> Table of content

Mapt Subscription
FREE
$29.99/m after trial
Video
$106.25
RRP $124.99
Save 14%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$106.25
$29.99 p/m after trial
RRP $124.99
Subscription
Video
Start 14 Day Trial

Frequently bought together


Big Data Processing using Apache Spark [Video] Book Cover
Big Data Processing using Apache Spark [Video]
$ 124.99
$ 106.25
From 0 to 1: Hive for Processing Big Data [Video] Book Cover
From 0 to 1: Hive for Processing Big Data [Video]
$ 49.99
$ 42.50
Buy 2 for $35.00
Save $139.98
Add to Cart

Video Details

ISBN 139781788398367
Course Length1 hour and 24 minutes

Video Description

Every year we have a big increment of data that we need to store and analyze. When we want to aggregate all data about our users and analyze that data to find insights from it, terabytes of data undergo processing. To be able to process such amounts of data, we need to use a technology that can distribute multiple computations and make them more efficient. Apache Spark is a technology that allows us to process big data leading to faster and scalable processing.

In this course, we will learn how to leverage Apache Spark to be able to process big data quickly. We will cover the basics of Spark API and its architecture in detail. In the second section of the course, we will learn about Data Mining and Data Cleaning, wherein we will look at the Input Data Structure and how Input data is loaded In the third section we will be writing actual jobs that analyze data. By the end of the course, you will have sound understanding of the Spark framework which will help you in writing the code understand the processing of big data.

Style and Approach

Filled with hands-on examples, this course will help you learn how to process big data using Apache.

Table of Contents

Writing Big Data Processing Using Apache Spark
The Course Overview
Overview of the Apache Spark and its Architecture
Start a Project Using Apache Spark, Look at build.sbt
Creating the Spark Context
Looking at API of Spark
Data Mining and Data Cleaning
Looking at the Input Data Structure
Using RDD API in the Data Mining Process
Loading Input Data
Cleaning Input Data
Writing Job Logic
Logic for Counting Words
Using RDD API Transformations and Actions to Solve a Problem
Testing Spark Job
Summary of Data Processing

What You Will Learn

  • Understand Spark API and its Architecture.
  • Know the difference between RDD and DataFrame API.
  • Learn to join big amounts of data.
  • Start a project using Apache Spark.
  • Discover how to write efficient jobs using Apache Spark.
  • Test Spark code correctly
  • Leverage Apache Spark to process big data faster.

Authors

Table of Contents

Writing Big Data Processing Using Apache Spark
The Course Overview
Overview of the Apache Spark and its Architecture
Start a Project Using Apache Spark, Look at build.sbt
Creating the Spark Context
Looking at API of Spark
Data Mining and Data Cleaning
Looking at the Input Data Structure
Using RDD API in the Data Mining Process
Loading Input Data
Cleaning Input Data
Writing Job Logic
Logic for Counting Words
Using RDD API Transformations and Actions to Solve a Problem
Testing Spark Job
Summary of Data Processing

Video Details

ISBN 139781788398367
Course Length1 hour and 24 minutes
Read More
From 1 reviews

Read More Reviews

Recommended for You

From 0 to 1: Hive for Processing Big Data [Video] Book Cover
From 0 to 1: Hive for Processing Big Data [Video]
$ 49.99
$ 42.50
Docker, Apache Mesos & DCOS: Run and manage cloud datacenter [Video] Book Cover
Docker, Apache Mesos & DCOS: Run and manage cloud datacenter [Video]
$ 194.99
$ 165.75
Practical Big Data Analytics Book Cover
Practical Big Data Analytics
$ 35.99
$ 25.20
Tensorflow Solutions for Data [Video] Book Cover
Tensorflow Solutions for Data [Video]
$ 124.99
$ 106.25
Data Visualization Solutions for Beginners [Video] Book Cover
Data Visualization Solutions for Beginners [Video]
$ 124.99
$ 106.25
Building a Big Data Analytics Stack [Video] Book Cover
Building a Big Data Analytics Stack [Video]
$ 124.99
$ 106.25