Spark for Data Analysis in Scala [Video]

Spark for Data Analysis in Scala [Video]

This video is included in a Mapt subscription
Anatolii Kmetiuk

Spark the new Data Analysis Library in Scala
$0.00
$106.25
$29.99p/m after trial
RRP $124.99
Subscription
Video
Start 30 Day Trial
Subscribe and access every Packt eBook & Video.
 
  • 4,000+ eBooks & Videos
  • 40+ New titles a month
  • 1 Free eBook/Video to keep every month
Start Free Trial
 
Preview in Mapt

Video Details

ISBN 139781787281165
Course Length2 hours and 5 minutes

Video Description

Scala has emerged as an important tool for performing various data analysis tasks efficiently. This video will help you leverage popular Scala libraries and tools to perform core data analysis tasks with ease.

This course will give you everything that you need to perform data analysis with Scala libraries. You will master loading raw datasets with Spark, and perform exploratory data analysis on them via plotting. Along the way you will learn what Spark has to offer when it comes to transforming datasets and how you can build a statistical model of a dataset with Spark.

Style and Approach

This friendly course takes you through the tools Apache Spark has to offer for a standard data science workflow. It is packed with step-by-step instructions and working examples. This comprehensive course is divided into clear bite size chunks so you can learn at your own pace and focus on the areas of most interest to you.

Table of Contents

Setting Up the Environment
The Course Overview
Downloading the Competition Dataset
Installing Spark Notebook
Loading the Data
Spark Abstractions: RDD, DataFrame
Loading CSV data into DataFrame
Exploratory Data Analysis
Different types of Widgets Supported for Spark Notebook for DataFrame Visualization
Statistical Functions Supported by Spark
Data Processing in Spark
Operations on DataFrame
Feature Transformers
Feature Selectors
Machine Learning in Spark with House Prices
Architecture
Algorithms: Linear Regression and Regression Trees

What You Will Learn

  • Learn to load your data in Spark
  • Work and plot with you data
  • Transform your data
  • Work with machine learning in Spark

Authors

Table of Contents

Setting Up the Environment
The Course Overview
Downloading the Competition Dataset
Installing Spark Notebook
Loading the Data
Spark Abstractions: RDD, DataFrame
Loading CSV data into DataFrame
Exploratory Data Analysis
Different types of Widgets Supported for Spark Notebook for DataFrame Visualization
Statistical Functions Supported by Spark
Data Processing in Spark
Operations on DataFrame
Feature Transformers
Feature Selectors
Machine Learning in Spark with House Prices
Architecture
Algorithms: Linear Regression and Regression Trees

Video Details

ISBN 139781787281165
Course Length2 hours and 5 minutes
Read More

Read More Reviews

Recommended for You

Taming Big Data with Spark Streaming and Scala – Hands On! [Video] Book Cover
Taming Big Data with Spark Streaming and Scala – Hands On! [Video]
$ 68.00
Taming Big Data with Apache Spark and Python - Hands On! [Video] Book Cover
Taming Big Data with Apache Spark and Python - Hands On! [Video]
$ 68.00
Getting Started with Haskell Data Analysis [Video] Book Cover
Getting Started with Haskell Data Analysis [Video]
$ 63.75
SharePoint for Developers: Building Hosted Add-Ins [Video] Book Cover
SharePoint for Developers: Building Hosted Add-Ins [Video]
$ 106.25