Learning PySpark [Video]

Learning PySpark [Video]

Tomasz Drabas

Building and deploying data-intensive applications at scale using Python and Apache Spark
Video
$10.00
RRP $124.99
Save 91%
What do I get with a Mapt subscription?
  • Unlimited access to all Packt’s 6,000+ eBooks and Videos
  • 100+ new titles a month, learning paths, assessments & code files
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the subscription reader
$10.00
RRP $124.99

Frequently bought together


Learning PySpark [Video] Book Cover
Learning PySpark [Video]
$ 124.99
$ 10.00
PySpark for Beginners [Video] Book Cover
PySpark for Beginners [Video]
$ 124.99
$ 10.00
Buy 2 for $20.00
Save $229.98
Add to Cart

Video Details

ISBN 13 9781788396592
Course Length 2 hours 29 minutes

Table of Contents

Video Description

Apache Spark is an open-source distributed engine for querying and processing data. In this tutorial, we provide a brief overview of Spark and its stack. This tutorial presents effective, time-saving techniques on how to leverage the power of Python and put it to use in the Spark ecosystem. You will start by getting a firm understanding of the Apache Spark architecture and how to set up a Python environment for Spark.

You'll learn about different techniques for collecting data, and distinguish between (and understand) techniques for processing data. Next, we provide an in-depth review of RDDs and contrast them with DataFrames. We provide examples of how to read data from files and from HDFS and how to specify schemas using reflection or programmatically (in the case of DataFrames). The concept of lazy execution is described and we outline various transformations and actions specific to RDDs and DataFrames.

Finally, we show you how to use SQL to interact with DataFrames. By the end of this tutorial, you will have learned how to process data using Spark DataFrames and mastered data collection techniques by distributed data processing.

Style and Approach

Filled with hands-on examples, this course will help you understand RDDs and how to work with them; you will learn about RDD actions and Spark DataFrame transformations. You will learn how to perform big data processing and use Spark DataFrames.

Video Preview

What You Will Learn

  • Learn about Apache Spark and the Spark 2.0 architecture.
  • Understand schemas for RDD, lazy executions, and transformations.
  • Explore the sorting and saving elements of RDD.
  • Build and interact with Spark DataFrames using Spark SQL
  • Create and explore various APIs to work with Spark DataFrames.
  • Learn how to change the schema of a DataFrame programmatically.
  • Explore how to aggregate, transform, and sort data with DataFrames.

Authors

Table of Contents

Video Details

ISBN 139781788396592
Course Length2 hours 29 minutes
Read More

Read More Reviews

These popular $10 titles might interest you

PySpark for Beginners [Video] Book Cover
PySpark for Beginners [Video]
$ 124.99
$ 10.00
Apache Spark with Python - Big Data with PySpark and Spark [Video] Book Cover
Apache Spark with Python - Big Data with PySpark and Spark [Video]
$ 149.99
$ 10.01
PySpark Cookbook Book Cover
PySpark Cookbook
$ 31.99
$ 10.00
Learning PySpark Book Cover
Learning PySpark
$ 35.99
$ 10.00
Real-World Machine Learning Projects with Scikit-Learn [Video] Book Cover
Real-World Machine Learning Projects with Scikit-Learn [Video]
$ 124.99
$ 10.00
Machine Learning and Tensorflow - The Google Cloud Approach [Video] Book Cover
Machine Learning and Tensorflow - The Google Cloud Approach [Video]
$ 41.99
$ 10.00