Hands-On Data Analysis with Scala

5 (1 reviews total)
By Rajesh Gupta
    What do you get with a Packt Subscription?

  • Instant access to this title and 7,500+ eBooks & Videos
  • Constantly updated with 100+ new titles each month
  • Breadth and depth in over 1,000+ technologies
  1. Section 1: Scala and Data Analysis Life Cycle

About this book

Efficient business decisions with an accurate sense of business data helps in delivering better performance across products and services. This book helps you to leverage the popular Scala libraries and tools for performing core data analysis tasks with ease.

The book begins with a quick overview of the building blocks of a standard data analysis process. You will learn to perform basic tasks like Extraction, Staging, Validation, Cleaning, and Shaping of datasets. You will later deep dive into the data exploration and visualization areas of the data analysis life cycle. You will make use of popular Scala libraries like Saddle, Breeze, Vegas, and PredictionIO for processing your datasets. You will learn statistical methods for deriving meaningful insights from data. You will also learn to create applications for Apache Spark 2.x on complex data analysis, in real-time. You will discover traditional machine learning techniques for doing data analysis. Furthermore, you will also be introduced to neural networks and deep learning from a data analysis standpoint.

By the end of this book, you will be capable of handling large sets of structured and unstructured data, perform exploratory analysis, and building efficient Scala applications for discovering and delivering insights

Publication date:
May 2019
Publisher
Packt
Pages
298
ISBN
9781789346114

 

Section 1: Scala and Data Analysis Life Cycle

In this section, you will gain an insight into what data is, how it is prepared and processed, and how it is analyzed and stored. This section will also get you introduced to the Scala framework and how we use Scala with data. You will learn some basic commands in Scala. You will also understand what a data pipeline is and the tasks involved in this pipeline.

This section will contain the following chapters:

About the Author

  • Rajesh Gupta

    Rajesh is a Hands-on Big Data Tech Lead and Enterprise Architect with extensive experience in the full life cycle of software development. He has successfully architected, developed and deployed highly scalable data solutions using Spark, Scala and Hadoop technology stack for several enterprises. A passionate, hands-on technologist, Rajesh has master’s degrees in Mathematics and Computer Science from BITS, Pilani (India).

    Browse publications by this author

Latest Reviews

(1 reviews total)
Es un libro excelente en lo que refiere al análisis de datos con Scala. Una pequeña sugerencia sería en las diversas propuestas de análisis visual que ofrecen habría que actualizar el notebook denomiinado spark-notebook (quedo sin soporte hace muy poco tiempo) por zeppelin (página 124), es solo una sugerencia se que podría incluirse en una fe de errata. Es de destacar que a pesar de ésto; que es una de las variadas posibilidades que ofrecen para prácticar no afecta ni la lectura ni la práctica del machine learning con scala porque ofrecen otras. Me alegro de poder haberlo adquirido y felicito a los autores por el excelente libro.
Hands-On Data Analysis with Scala
Unlock this book and the full library FREE for 7 days
Start now