Machine Learning with Spark: RAW

Book and eBook expected December 2014. Pre-order now!
Machine Learning with Spark: RAW
eBook: $29.99
Formats: PDF, PacktLib, ePub and Mobi formats
save 20%!
Print + free eBook + free PacktLib access to the book: $79.98    Print cover: $49.99
save 37%!
Free Shipping!
UK, US, Europe and selected countries in Asia.
Also available on:
Table of Contents
Sample Chapters
  • Familiarise yourself with Spark and get up and running quickly
  • Design your machine learning system and prepare your raw data for input
  • Build a number of machine learning systems including a recommendation engine, classification model and clustering model

Book Details

Language : English
Paperback : 329 pages [ 235mm x 191mm ]
Release Date : December 2014
ISBN : 1783288515
ISBN 13 : 9781783288519
Author(s) : Nick Pentreath
Topics and Technologies : All Books, Big Data and Business Intelligence, Open Source, RAW books

Chapter Availability


Chapter Number Title Availability
1 Getting up and Running with Spark (aka Spark in X Minutes)
2 Designing a Machine Learning System using Spark IN THE BOOK
3 Obtaining, Processing and Preparing Data IN THE BOOK
4 Building a Recommendation Engine IN THE BOOK
5 Building a Classification Model
JULY 2014
6 Building a Regression Model
JULY 2014
7 Building a Clustering Model JULY 2014
8 Putting it all Together AUGUST 2014
9 Evaluating the Performance of the System AUGUST 2014
10 Advanced Topics SEPTEMBER 2014

Nick Pentreath

Sorry, we don't have any reviews for this title yet.

Submit Errata

Please let us know if you have found any errors not listed on this list by completing our errata submission form. Our editors will check them and add them to this list. Thank you.

Sorry, there are currently no downloads available for this title.

Frequently bought together

Machine Learning with Spark: RAW +    Machine Learning with R =
50% Off
the second eBook
Price for both: £27.35

Buy both these recommended eBooks together and get 50% off the cheapest eBook.

What you will learn from this book

  • Get up and running with Spark quickly
  • Design your machine learning system by example
  • Obtain and prepare your data for input into machine learning models
  • Build a recommendation engine, classification model, regression model and clustering model to process and evaluate your data
  • Utilise your results and model predictions in your system
  • Evaluate the performance of your system
  • Combine your results and predictions from different models into ensemble models

In Detail

Apache Spark is a fast, efficient and increasingly popular engine for processing big data. It is centralized and easily programmable and is one of the best tools for organising and analysing big data through machine learning.

This book is a practical, step-by-step guide which will walk you through using Spark to develop effective machine learning systems to sort and analyse your business data.

The book begins by familiarising you with Spark and teaching you how to get up and running with it quickly. We will move on to designing a machine learning system and you will learn how to obtain and prepare your data for input into Spark and machine learning models. Next the book will take you through building a recommendation engine, a classification model, a regression model and a clustering model, each one with different benefits and increasing your skills with Spark. The book will then cover how to put this knowledge together and how to utilise your results and model predications. Finally, you will be taught how you can evaluate the performance of your machine learning systems and how to combine your results and predictions into ensemble models along with other advanced topics including using Spark streaming in the final chapter.


This book is currently available as a RAW (Read As we Write) book. A RAW book is an ebook, and this one is priced at 20% off the usual eBook price. Once you purchase the RAW book, you can immediately download the content of the book so far, and when new chapters become available, you will be notified, and  can download the new version of the book. When the book is published, you will receive the full, finished eBook.

If you like, you can preorder the print book at the same time as you purchase the RAW book at a significant discount.

Since a RAW book is an eBook, a RAW book is non returnable and non refundable.

Local taxes may apply to your eBook purchase.


This book provides you with a practical step-by-step tutorial complete with examples, images and tips to machine learning with Spark.

Who this book is for

This book is perfect for those of you new to Spark and just starting out with machine learning but will also benefit those more familiar with machine learning in providing you with the help you need to utilize Spark for your business needs.

Code Download and Errata
Packt Anytime, Anywhere
Register Books
Print Upgrades
eBook Downloads
Video Support
Contact Us
Awards Voting Nominations Previous Winners
Judges Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software
Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software