Taming Big Data with Spark Streaming and Scala - Hands On! [Video]

Preview in Mapt

Taming Big Data with Spark Streaming and Scala - Hands On! [Video]

Frank Kane

Process large amounts of data in real time using Spark Streaming
Mapt Subscription
FREE
$29.99/m after trial
Video
$10.00
RRP $79.99
Save 87%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$10.00
$29.99 p/m after trial
RRP $79.99
Subscription
Video
Start 14 Day Trial

Frequently bought together


Taming Big Data with Spark Streaming and Scala - Hands On! [Video] Book Cover
Taming Big Data with Spark Streaming and Scala - Hands On! [Video]
$ 79.99
$ 10.00
R Troubleshooting Solutions [Video] Book Cover
R Troubleshooting Solutions [Video]
$ 124.99
$ 10.00
Buy 2 for $20.00
Save $184.98
Add to Cart

Video Details

ISBN 139781787123915
Course Length5 hours 54 minutes

Video Description

Businesses these days require constant, real-time analysis of large amounts of data, along with meaningful insights which influence business decisions. Apache Spark has emerged as the most popular tool in the Big Data market for efficient real-time analytics of Big Data. Spanning over 5 hours, this course will teach you the basics of Apache Spark and how to use Spark Streaming - a module of Apache Spark which involves handling and processing of Big Data on a real-time basis. You will learn how to create Spark applications with Scala to process streams of real-time data. Whether you want to analyze continuously incoming website traffic, analyze real-time streams of Twitter feeds or query your streaming data in real time, this course has got you covered. You will also learn how to use the MLlib module of Spark to train machine learning models with streaming data, and use those models to make real-time predictions. The course assumes some programming experience, and uses Scala to develop Spark applications. It includes a crash course in the Scala programming language in case you're new to it.

Table of Contents

Getting Started
Introduction and Getting Set Up
Stream Live Tweets with Spark Streaming!
A Crash Course in Scala
Scala Basics – Part 1
Scala Basics – Part 2
Flow Control in Scala
Functions in Scala
Data Structures in Scala
Spark Streaming Concepts
Introduction to Spark
The Resilient Distributed Dataset (RDD)
RDDs in Action – Simple Word Count Application
Introduction to Spark Streaming
Revisiting the PrintTweets Application
Windowing – Aggregating Data over Longer Time Spans
Fault Tolerance in Spark Streaming
Spark Streaming Examples with Twitter
Saving Tweets to Disk
Tracking the Average Tweet Length
Tracking the Most Popular Hashtags
Spark Streaming Examples with Clickstream / Apache Access Log Data
Tracking the Top URLs Requested
Alarming on Log Errors
Integrating Spark Streaming with Spark SQL
Intro to Structured Streaming in Spark 2
Analyzing Apache Log files with Structured Streaming
Integrating with Other Systems
Integrating with Apache Kafka
Integrating with Apache Flume
Integrating with Amazon Kinesis
Writing Custom Data Receivers
Integrating with Cassandra
Advanced Spark Streaming Examples
Stateful Information in Spark Streams
Streaming K-Means Clustering
Streaming Linear Regression
Spark Streaming in Production
Running with spark-submit
Packaging Your Code with SBT
Running on a Real Hadoop Cluster with EMR
Troubleshooting and Tuning Spark Jobs
You Made It!
Learning More

What You Will Learn

  • Process large amounts of real-time data using the Spark Streaming module
  • Create efficient Spark applications using the Scala programming language
  • Integrate Spark Streaming with various data sources
  • Integrate Spark Streaming with Spark SQL to query your data in real time
  • Train machine learning models with streaming data, and use them for making real-time predictions

Authors

Table of Contents

Getting Started
Introduction and Getting Set Up
Stream Live Tweets with Spark Streaming!
A Crash Course in Scala
Scala Basics – Part 1
Scala Basics – Part 2
Flow Control in Scala
Functions in Scala
Data Structures in Scala
Spark Streaming Concepts
Introduction to Spark
The Resilient Distributed Dataset (RDD)
RDDs in Action – Simple Word Count Application
Introduction to Spark Streaming
Revisiting the PrintTweets Application
Windowing – Aggregating Data over Longer Time Spans
Fault Tolerance in Spark Streaming
Spark Streaming Examples with Twitter
Saving Tweets to Disk
Tracking the Average Tweet Length
Tracking the Most Popular Hashtags
Spark Streaming Examples with Clickstream / Apache Access Log Data
Tracking the Top URLs Requested
Alarming on Log Errors
Integrating Spark Streaming with Spark SQL
Intro to Structured Streaming in Spark 2
Analyzing Apache Log files with Structured Streaming
Integrating with Other Systems
Integrating with Apache Kafka
Integrating with Apache Flume
Integrating with Amazon Kinesis
Writing Custom Data Receivers
Integrating with Cassandra
Advanced Spark Streaming Examples
Stateful Information in Spark Streams
Streaming K-Means Clustering
Streaming Linear Regression
Spark Streaming in Production
Running with spark-submit
Packaging Your Code with SBT
Running on a Real Hadoop Cluster with EMR
Troubleshooting and Tuning Spark Jobs
You Made It!
Learning More

Video Details

ISBN 139781787123915
Course Length5 hours 54 minutes
Read More

Read More Reviews

Recommended for You

R Troubleshooting Solutions [Video] Book Cover
R Troubleshooting Solutions [Video]
$ 124.99
$ 10.00
Machine Learning with Core ML in iOS 11 [Video] Book Cover
Machine Learning with Core ML in iOS 11 [Video]
$ 124.99
$ 10.00
Data Visualization Projects in Python [Video] Book Cover
Data Visualization Projects in Python [Video]
$ 124.99
$ 10.00
Maven Tutorial - Manage Java Dependencies in 20 Steps [Video] Book Cover
Maven Tutorial - Manage Java Dependencies in 20 Steps [Video]
$ 192.99
$ 10.00
Fun Excel Learning [Video] Book Cover
Fun Excel Learning [Video]
$ 19.99
$ 4.00
From Voices to Results - Voice of Customer Questions, Tools and Analysis Book Cover
From Voices to Results - Voice of Customer Questions, Tools and Analysis
$ 31.99
$ 10.00