Switch to the store?

Hands-On Big Data Analysis with Hadoop 3 [Video]

More Information
Learn
  • Store data with HDFS and learn in detail about HBase
  • Share and access data in a SQL-like interface for HDFS
  • Analyze real-time events using Spark Streaming
  • Perform complex big data analytics using MapReduce
  • Analyze data to perform complex processing with Hive and Pig
  • Explore functional programming using Spark
  • Learn to import data using Sqoop
About

This course is your guide to performing real-time data analytics and stream processing with Spark. Use different components and tools such as HDFS, HBase, and Hive to process raw data. Learn how tools such as Hive and Pig aid in this process.

In this course, you will start off by learning data analysis techniques with Hadoop using tools such as Hive. Furthermore, you will learn to apply these techniques in real-world big data applications. Also, you will delve into Spark and its related tools to perform real-time data analytics, streaming, and batch processing on your application.

Finally, you'll learn how to extend your analytics solutions to the cloud.

Style and Approach

This course has a completely practical approach, with real-world examples which will help you to manage and analyze large-volume data with Hadoop.

Features
  • Analyze large volumes of data effectively by combining the power of big data processing tools such as Hadoop and Spark Streaming
  • Work with different kinds of data and perform real-life data operations
  • Explore best use cases, identify problem areas, and solve them with the best open-source tools
Course Length 1 hour 44 minutes
ISBN9781788999908
Date Of Publication 28 Aug 2018
Learning Spark’s Key Concepts – Spark Context, Driver, and RDD
Spark API – Functional Programming Using Spark
Spark Transformations and Actions
Writing MapReduce Jobs Using Apache Spark

Authors

Tomasz Lelek

Tomasz Lelek is a Software Engineer, programming mostly in Java, Scala. He has worked with ML algorithms for the past 5 years, with production experience in processing petabytes of data.
He is passionate about nearly everything associated with software development and believes that we should always try to consider different solutions and approaches before solving a problem. Recently he was a speaker at conferences in Poland, Confitura and JDD (Java Developers Day), and also at Krakow Scala User Group. He has also conducted a live coding session at Geecon Conference.

He is a co-founder of www.initlearn.com, an e-learning platform that was built with the Java language.

He has also written articles about everything related to the Java world: http://www.baeldung.com/.