Machine Learning with Scala Quick Start Guide

More Information
Learn
  • Get acquainted with JVM-based machine learning libraries for Scala such as Spark ML and Deeplearning4j
  • Learn RDDs, DataFrame, and Spark SQL for analyzing structured and unstructured data
  • Understand supervised and unsupervised learning techniques with best practices and pitfalls
  • Learn classification and regression analysis with linear regression, logistic regression, Naïve Bayes, support vector machine, and tree-based ensemble techniques
  • Learn effective ways of clustering analysis with dimensionality reduction techniques
  • Learn recommender systems with collaborative filtering approach
  • Delve into deep learning and neural network architectures
About

Scala is a highly scalable integration of object-oriented nature and functional programming concepts that make it easy to build scalable and complex big data applications. This book is a handy guide for machine learning developers and data scientists who want to develop and train effective machine learning models in Scala.

The book starts with an introduction to machine learning, while covering deep learning and machine learning basics. It then explains how to use Scala-based ML libraries to solve classification and regression problems using linear regression, generalized linear regression, logistic regression, support vector machine, and Naïve Bayes algorithms.

It also covers tree-based ensemble techniques for solving both classification and regression problems. Moving ahead, it covers unsupervised learning techniques, such as dimensionality reduction, clustering, and recommender systems. Finally, it provides a brief overview of deep learning using a real-life example in Scala.

Features
  • Construct and deploy machine learning systems that learn from your data and give accurate predictions
  • Unleash the power of Spark ML along with popular machine learning algorithms to solve complex tasks in Scala.
  • Solve hands-on problems by combining popular neural network architectures such as LSTM and CNN using Scala with DeepLearning4j library
Page Count 220
Course Length 6 hours 36 minutes
ISBN 9781789345070
Date Of Publication 29 Apr 2019

Authors

Md. Rezaul Karim

Md. Rezaul Karim has more than 8 years of experience in the area of research and development with a solid knowledge of algorithms and data structures in C/C++, Java, Scala, R, and Python focusing Big Data technologies: Spark, Kafka, DC/OS, Docker, Mesos, Zeppelin, Hadoop, and MapReduce and Deep Learning technologies: TensorFlow, DeepLearning4j and H2O-Sparking Water. His research interests include Machine Learning, Deep Learning, Semantic Web/Linked Data, Big Data, and Bioinformatics.

He is a Research Scientist at Fraunhofer Institute for Applied Information Technology-FIT, Germany. He is also a Ph.D. candidate at the RWTH Aachen University, Aachen, Germany.

He holds a BS and an MS degree in Computer Engineering. Before joining the Fraunhofer-FIT, he had been working as a Researcher at the Insight Centre for Data Analytics, Ireland. Before that, he worked as a Lead Engineer with Samsung Electronics’ distributed R&D Institutes in Korea, India, Vietnam, Turkey, and Bangladesh.

Before that, he worked as a Research Assistant in the Database Lab at Kyung Hee University, Korea. He also worked as an R&D Engineer with BMTech21 Worldwide, Korea. Even before that, he worked as a Software Engineer with i2SoftTechnology, Dhaka, Bangladesh.