Real Time Streaming using Apache Spark Streaming [Video]

Real Time Streaming using Apache Spark Streaming [Video]

Tomasz Lelek

Analyze data in real-time using the Apache Spark Streaming API
Mapt Subscription
FREE
€29.98/m after trial
Video
€121.38
RRP €142.78
Save 14%
What do I get with a Mapt Pro subscription?
  • Unlimited access to all Packt’s 5,000+ eBooks and Videos
  • Early Access content, Progress Tracking, and Assessments
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
€0.00
€121.38
€29.98p/m after trial
RRP €142.78
Subscription
Video
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 5,000+ eBooks & Videos
  • 50+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Video Details

ISBN 139781788391528
Course Length59 minutes

Video Description

Spark is the technology that allows us to perform big data processing in the MapReduce paradigm very rapidly, due to performing the processing in memory without the need for extensive I/O operations.

Recently, the streaming approach to processing events in near real time became more widely adopted and more necessary. In this course, you will learn how to handle big amount of unbounded infinite streams of data. You will analyze data and draw conclusions from it. Furthermore, we will look at common problems when processing event streams: sorting, watermarks, deduplication, and keeping state (for example, user sessions). You will also implement streaming processing using Spark Streaming and analyze traffic on a web page in real time.

Style and Approach

This course promotes a practical approach to dealing with large amounts of online, unbounded data and drawing conclusions from it. You will implement streaming logic to handle huge amount of infinite streams of data.

Table of Contents

Understanding a Spark Streaming
The Course Overview
Introduction to Spark Streaming API
Creating a Project in Spark Streaming
Defining Data Source and Data Sink
Creating Base for Testing Spark Streaming
Implementing Stream Processing
Handling Unbounded Data
Using Event Time and Processing Time
Sorting Stream Data
Deduplicating Data
Implementing Transformations and Processing Logic
Implementing Job Processing Logic
Writing Test for Steaming Job
Creating Processing Logic That Needs to Keep State of the User Session
Summary of Stream Processing

What You Will Learn

  • Implement stream processing using Apache Spark Streaming
  • Consume events from the source (for instance, Kafka), apply logic on it, and send it to a data sink.
  • Understand how to deduplicate events when you have a system that ensures at-least-once deliver.
  • Learn to tackle common stream processing problems.
  • Create a job to analyze data in real time using the Apache Spark Streaming API.
  • Master event time and processing time
  • Single event processing and the micro-batch approach to processing events
  • Learn to sort infinite event streams

Authors

Table of Contents

Understanding a Spark Streaming
The Course Overview
Introduction to Spark Streaming API
Creating a Project in Spark Streaming
Defining Data Source and Data Sink
Creating Base for Testing Spark Streaming
Implementing Stream Processing
Handling Unbounded Data
Using Event Time and Processing Time
Sorting Stream Data
Deduplicating Data
Implementing Transformations and Processing Logic
Implementing Job Processing Logic
Writing Test for Steaming Job
Creating Processing Logic That Needs to Keep State of the User Session
Summary of Stream Processing

Video Details

ISBN 139781788391528
Course Length59 minutes
Read More

Read More Reviews

Recommended for You

Real-time Data Processing with Azure Stream Analytics [Video] Book Cover
Real-time Data Processing with Azure Stream Analytics [Video]
€ 142.78
€ 121.38
Learning Real-time Processing with Spark Streaming Book Cover
Learning Real-time Processing with Spark Streaming
€ 34.78
€ 24.36
Taming Big Data with Spark Streaming and Scala - Hands On! [Video] Book Cover
Taming Big Data with Spark Streaming and Scala - Hands On! [Video]
€ 83.98
€ 71.40
Building Real-time Communication Applications Using Twilio [Video] Book Cover
Building Real-time Communication Applications Using Twilio [Video]
€ 85.18
€ 72.42
Taming Big Data with Apache Spark and Python - Hands On! [Video] Book Cover
Taming Big Data with Apache Spark and Python - Hands On! [Video]
€ 83.98
€ 71.40
Big Data Processing using Apache Spark [Video] Book Cover
Big Data Processing using Apache Spark [Video]
€ 142.78
€ 121.38