Apache Spark 2.x Machine Learning Cookbook: Over 100 recipes to simplify machine learning model implementations with Spark

Name: Apache Spark 2.x Machine Learning Cookbook
Brand: Packt
SKU: 9781783551606
Availability: InStock

By Mohammed Guller , Siamak Amirghodsi , Shuen Mei , Meenakshi Rajendran , Broderick Hall

~~$43.99~~ $29.99

Book Sep 2017 666 pages 1st Edition

What do you get with eBook?

Instant access to your Digital eBook purchase

Download this book in EPUB and PDF formats

Access this title in our online reader with advanced features

DRM FREE - Read whenever, wherever and however you want

Buy Now

Product Details

Publication date : Sep 22, 2017

Length 666 pages

Edition : 1st Edition

Language : English

ISBN-13 : 9781783551606

Vendor :

Apache

Category :

Data

Languages :

Scala (Intermediate)

Concepts :

Machine Learning

Tools :

Apache Spark (Intermediate)

View table of contents

Preview Book

Download Code

Key benefits

Solve the day-to-day problems of data science with Spark
This unique cookbook consists of exciting and intuitive numerical recipes
Optimize your work by acquiring, cleaning, analyzing, predicting, and visualizing your data

Description

Machine learning aims to extract knowledge from data, relying on fundamental concepts in computer science, statistics, probability, and optimization. Learning about algorithms enables a wide range of applications, from everyday tasks such as product recommendations and spam filtering to cutting edge applications such as self-driving cars and personalized medicine. You will gain hands-on experience of applying these principles using Apache Spark, a resilient cluster computing system well suited for large-scale machine learning tasks. This book begins with a quick overview of setting up the necessary IDEs to facilitate the execution of code examples that will be covered in various chapters. It also highlights some key issues developers face while working with machine learning algorithms on the Spark platform. We progress by uncovering the various Spark APIs and the implementation of ML algorithms with developing classification systems, recommendation engines, text analytics, clustering, and learning systems. Toward the final chapters, we’ll focus on building high-end applications and explain various unsupervised methodologies and challenges to tackle when implementing with big data ML systems.

What you will learn

[*]Get to know how Scala and Spark go hand-in-hand for developers when developing ML systems with Spark [*]Build a recommendation engine that scales with Spark [*]Find out how to build unsupervised clustering systems to classify data in Spark [*]Build machine learning systems with the Decision Tree and Ensemble models in Spark [*]Deal with the curse of high-dimensionality in big data using Spark [*]Implement Text analytics for Search Engines in Spark [*]Streaming Machine Learning System implementation using Spark

Core systems	Version
Spark	2.0.0
Java	1.8
IntelliJ IDEA	2016.2.4
Scala-sdk	2.11.8

Miscellaneous JARs	Version
`bliki-core`	3.0.19
`breeze-viz`	0.12
`Cloud9`	1.5.0
`Hadoop-streaming`	2.2.0
`JCommon`	1.0.23
`JFreeChart`	1.0.19
`lucene-analyzers-common`	6.0.0
`Lucene-Core`	6.0.0
`scopt`	3.3.0
`spark-streaming-flume-assembly`	2.0.0
`spark-streaming-kafka-0-8-assembly`	2.0.0

Apache Spark 2.x Machine Learning Cookbook: Over 100 recipes to simplify machine learning model implementations with Spark

What do you get with eBook?

Product Details