Big Data Processing using Apache Spark [Video]

Preview in Mapt

Big Data Processing using Apache Spark [Video]

Tomasz Lelek

1 customer reviews
Leverage one of the most efficient and widely adopted Big Data processing framework - Apache Spark
Mapt Subscription
FREE
$29.99/m after trial
Video
$25.00
RRP $124.99
Save 79%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$25.00
$29.99 p/m after trial
RRP $124.99
Subscription
Video
Start 14 Day Trial

Frequently bought together


Big Data Processing using Apache Spark [Video] Book Cover
Big Data Processing using Apache Spark [Video]
$ 124.99
$ 25.00
Apache Spark with Python - Big Data with PySpark and Spark [Video] Book Cover
Apache Spark with Python - Big Data with PySpark and Spark [Video]
$ 149.99
$ 30.00
Buy 2 for $35.01
Save $239.97
Add to Cart

Video Details

ISBN 139781788398367
Course Length1 hour and 24 minutes

Video Description

Every year we have a big increment of data that we need to store and analyze. When we want to aggregate all data about our users and analyze that data to find insights from it, terabytes of data undergo processing. To be able to process such amounts of data, we need to use a technology that can distribute multiple computations and make them more efficient. Apache Spark is a technology that allows us to process big data leading to faster and scalable processing.

In this course, we will learn how to leverage Apache Spark to be able to process big data quickly. We will cover the basics of Spark API and its architecture in detail. In the second section of the course, we will learn about Data Mining and Data Cleaning, wherein we will look at the Input Data Structure and how Input data is loaded In the third section we will be writing actual jobs that analyze data. By the end of the course, you will have sound understanding of the Spark framework which will help you in writing the code understand the processing of big data.

Style and Approach

Filled with hands-on examples, this course will help you learn how to process big data using Apache.

Table of Contents

Writing Big Data Processing Using Apache Spark
The Course Overview
Overview of the Apache Spark and its Architecture
Start a Project Using Apache Spark, Look at build.sbt
Creating the Spark Context
Looking at API of Spark
Data Mining and Data Cleaning
Looking at the Input Data Structure
Using RDD API in the Data Mining Process
Loading Input Data
Cleaning Input Data
Writing Job Logic
Logic for Counting Words
Using RDD API Transformations and Actions to Solve a Problem
Testing Spark Job
Summary of Data Processing

What You Will Learn

  • Understand Spark API and its Architecture.
  • Know the difference between RDD and DataFrame API.
  • Learn to join big amounts of data.
  • Start a project using Apache Spark.
  • Discover how to write efficient jobs using Apache Spark.
  • Test Spark code correctly
  • Leverage Apache Spark to process big data faster.

Authors

Table of Contents

Writing Big Data Processing Using Apache Spark
The Course Overview
Overview of the Apache Spark and its Architecture
Start a Project Using Apache Spark, Look at build.sbt
Creating the Spark Context
Looking at API of Spark
Data Mining and Data Cleaning
Looking at the Input Data Structure
Using RDD API in the Data Mining Process
Loading Input Data
Cleaning Input Data
Writing Job Logic
Logic for Counting Words
Using RDD API Transformations and Actions to Solve a Problem
Testing Spark Job
Summary of Data Processing

Video Details

ISBN 139781788398367
Course Length1 hour and 24 minutes
Read More
From 1 reviews

Read More Reviews

Recommended for You

Apache Spark with Python - Big Data with PySpark and Spark [Video] Book Cover
Apache Spark with Python - Big Data with PySpark and Spark [Video]
$ 149.99
$ 30.00
Apache Spark with Scala - Learn Spark from a Big Data Guru [Video] Book Cover
Apache Spark with Scala - Learn Spark from a Big Data Guru [Video]
$ 149.99
$ 30.00
Apache Spark with Java - Learn Spark from a Big Data Guru [Video] Book Cover
Apache Spark with Java - Learn Spark from a Big Data Guru [Video]
$ 197.99
$ 39.60
Advanced Analytics and Real-Time Data Processing in Apache Spark [Video] Book Cover
Advanced Analytics and Real-Time Data Processing in Apache Spark [Video]
$ 124.99
$ 25.00
Modern Big Data Processing with Hadoop Book Cover
Modern Big Data Processing with Hadoop
$ 31.99
$ 16.00
Advanced Machine Learning with Spark 2.x [Video] Book Cover
Advanced Machine Learning with Spark 2.x [Video]
$ 124.99
$ 25.00