Distributed Deep Learning with Apache Spark [Video]

More Information
  • The distributed nature of Apache Spark and how to compose it with deep learning programs
  • Adding the ND4J library for Spark distributed vector processing
  • Learn how to use recurrent neural network textual data
  • Plugging the Word2Vec algorithm into the RNN layer
  • Solve the anomaly detection problem via a deep learning approach
  • Perform automatic feature extraction by leveraging DL4J
  • Use regression with neural networks

Deep learning is a subfield of Artificial Intelligence and Machine Learning where a huge amount of data is processed in complex layers of neural networks. It has solved tons of interesting real-world problems in recent years. Distributed deep learning (DL) involves training a deep neural network in parallel across multiple machines. In this course, you will get started with implementing Deep Learning solutions easily with the help of Apache Spark.

You will begin with a short introduction on Deep Learning and Apache Spark and the principles of distributed modeling. With the help of real-world examples, you will investigate different types of neural network and work with DL libraries such as BigDL, Deeplearning4j, and the Deep Learning pipelines library to implement DL models and distributed computing on Spark. You will see how you can easily use a large dataset to implement efficient DL solutions to simplify real-world examples. You will also learn how to distribute the computationally heavy parts of DL into processes with the help of Apache Spark.

By the end of this course, you'll have gained experience in implementing Distributed Deep Learning for your models at work. Our examples will be based on real-world problems from the banking industry.

The code bundle for this course is available at https://github.com/PacktPublishing/Distributed-Deep-Learning-with-Apache-Spark

Style and Approach

This course provides step-by-step and hands-on training to help you leverage Spark and Deep Learning in Machine Learning problems. With this practical approach, you will be able to take your skills to the next level. At the end of this course, you will be able to create Deep Learning processing with DL4J and Spark for most problems you'll encounter.

  • Use Apache Spark to create fast and parallel deep learning programs that can be leveraged in multiple systems and business domains
  • Understand how to leverage deep learning with anomaly-detection problems
  • Learn about deep learning layers and how to compose them with well-known ML algorithms
Course Length 1 hours 50 minutes
ISBN 9781838553838
Date Of Publication 28 Feb 2019


Tomasz Lelek

Tomasz Lelek is a software engineer, programming mostly in Java and Scala. He has been working with the Spark and ML APIs for the past 6 years, with production experience in processing petabytes of data. He is passionate about nearly everything associated with software development and believes that we should always try to consider different solutions and approaches before attempting to solve a problem. Recently, he was also a speaker at conferences in Poland—Confitura, and JDD (Java Developers Day) and at Krakow Scala User Group. He has also conducted a live coding session at the Geecon Conference.