Building a Big Data Analytics Stack [Video]

More Information
Learn
  • Use Pig and Hive in a non-Java way to understand the power of Hadoop
  • Explore Spark and use it to stream and batch process
  • Use HBase database from Java application
  • Find out more about the machine learning toolkit and its use with Spark
  • Know how to leverage the pros of Big Data tools
About

Building a Big Data ecosystem is hard. There are a variety of technologies available and every one of them has its pros and cons. When building a big data pipeline for software engineers, we need to use more low-level tools and APIs such as HBase and Apache Spark.

In this course, we’ll check out HBase, a database built by optimizing on the HDFS. Moving on, we’ll have a bit of fun with Spark MLlib. Finally, you’ll get an understanding of ETL and deploy a Hadoop project to the cloud. Building Big Data Ecosystem is hard. There are a variety of technologies available and every one of them has own pros and cons. Software Engineers we need to use more low-level tools and APIs like HBase and Apache Spark while building big data pipeline.

By the end of the course, you’ll be able to use more high-level tools that have more user-friendly, declarative APIs such as Pig and Hive.

Style and Approach

This course will give you both a knowledge-based understanding and practical hands-on experience of Hadoop 2.7. It also looks at Spark, Pig, Hive, HBase, and YARN, so you can understand how to implement these components while using Hadoop clusters.

Features
  • Understand ETL and deploy a Hadoop project to the cloud
  • Learn to maintain a Hadoop cluster running HDFS and Map Reduce in this hands-on course
  • A practical course that includes real-world examples of developing YARN and MapReduce
Course Length 1 hour 31 minutes
ISBN 9781787125018
Date Of Publication 27 Nov 2017

Authors

Tomasz Lelek

Tomasz Lelek is a software engineer who programs mostly in Java and Scala. He has worked with the core Java language for the past six years. He has developed multiple production Java software projects that work in a reactive way. He is passionate about nearly everything associated with software development and believes that we should always try to consider different solutions and approaches before solving a problem. Recently, he was a speaker at conferences in Poland, at JDD (Java Developers Day), and at Krakow Scala User Group. He has also conducted a live coding session at Geecon Conference. He is a co-founder of initLearn, an e-learning platform that was built with the Java language. He has also written articles about everything related to the Java world.