About this video

This course is your guide to performing real-time data analytics and stream processing with Spark. Use different components and tools such as HDFS, HBase, and Hive to process raw data. Learn how tools such as Hive and Pig aid in this process.

In this course, you will start off by learning data analysis techniques with Hadoop using tools such as Hive. Furthermore, you will learn to apply these techniques in real-world big data applications. Also, you will delve into Spark and its related tools to perform real-time data analytics, streaming, and batch processing on your application.

Finally, you'll learn how to extend your analytics solutions to the cloud.

Please note that this course is based on Hadoop 3.0 but the code used in the course is compatible with Hadoop 3.2.

The code bundle for this video course is available at -

https://github.com/PacktPublishing/Hands-On-Big-Data-Analysis-with-Hadoop-3

Publication date:
August 2018
Publisher
Packt
Duration
1 hour 36 minutes
ISBN
9781788999908

About the Author

  • Tomasz Lelek

    Tomasz Lelek is a software engineer, programming mostly in Java and Scala. He has been working with the Spark and ML APIs for the past 6 years, with production experience in processing petabytes of data. He is passionate about nearly everything associated with software development and believes that we should always try to consider different solutions and approaches before attempting to solve a problem. Recently, he was also a speaker at conferences in Poland—Confitura, and JDD (Java Developers Day) and at Krakow Scala User Group. He has also conducted a live coding session at the Geecon Conference.

    Browse publications by this author
Book Title
Access this video and the full library for only $5/m
Access now