Learning Apache Spark 2

Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analytics

Learning Apache Spark 2

Muhammad Asif Abbasi

Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analytics
Mapt Subscription
FREE
$0.00/m after trial
eBook
$10.00
RRP $35.99
Save 72%
Print + eBook
$44.99
RRP $44.99
What do I get with a Mapt subscription?
  • Unlimited access to all Packt’s 6,000+ eBooks and Videos
  • 100+ new titles a month, learning paths, assessments & code files
  • 1 Free eBook or Video to download and keep every month after trial
What do I get with an eBook?
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with Print & eBook?
  • Get a paperback copy of the book delivered to you
  • Download this book in EPUB, PDF, MOBI formats
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
What do I get with a Video?
  • Download this Video course in MP4 format
  • DRM FREE - read and interact with your content when you want, where you want, and how you want
  • Access this title in the Mapt reader
$0.00
$10.00
$44.99
$29.99 p/m after trial
RRP $35.99
RRP $44.99
Subscription
eBook
Print + eBook
Start 14 Day Trial

Frequently bought together


Learning Apache Spark 2 Book Cover
Learning Apache Spark 2
$ 35.99
$ 10.00
Apache Spark 2.x Cookbook Book Cover
Apache Spark 2.x Cookbook
$ 39.99
$ 10.00
Buy 2 for $20.00
Save $55.98
Add to Cart

Book Details

ISBN 139781785885136
Paperback356 pages

Book Description

Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing Hadoop installation and configuring with Yarn and Mesos.

The next part of the journey after installation is using key components, APIs, Clustering, machine learning APIs, data pipelines, parallel programming. It is important to understand why each framework component is key, how widely it is being used, its stability and pertinent use cases.

Once we understand the individual components, we will take a couple of real life advanced analytics examples such as ‘Building a Recommendation system’, ‘Predicting customer churn’ and so on.

The objective of these real life examples is to give the reader confidence of using Spark for real-world problems.

Table of Contents

What You Will Learn

  • Get an overview of big data analytics and its importance for organizations and data professionals
  • Delve into Spark to see how it is different from existing processing platforms
  • Understand the intricacies of various file formats, and how to process them with Apache Spark.
  • Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager.
  • Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formats
  • Understand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark.
  • Introduce yourself to the deployment and usage of SparkR.
  • Walk through the importance of Graph computation and the graph processing systems available in the market
  • Check the real world example of Spark by building a recommendation engine with Spark using ALS.
  • Use a Telco data set, to predict customer churn using Random Forests.

Authors

Table of Contents

Book Details

ISBN 139781785885136
Paperback356 pages
Read More

Read More Reviews

Recommended for You

Apache Spark 2.x Cookbook Book Cover
Apache Spark 2.x Cookbook
$ 39.99
$ 10.00
Learning PySpark Book Cover
Learning PySpark
$ 35.99
$ 10.00
Mastering Apache Spark 2.x - Second Edition Book Cover
Mastering Apache Spark 2.x - Second Edition
$ 39.99
$ 10.00
Learning Spark SQL Book Cover
Learning Spark SQL
$ 43.99
$ 10.00
Apache Spark 2.x for Java Developers Book Cover
Apache Spark 2.x for Java Developers
$ 39.99
$ 10.00
Fast Data Processing with Spark 2 - Third Edition Book Cover
Fast Data Processing with Spark 2 - Third Edition
$ 31.99
$ 10.00