Getting Started with Hadoop 2.x [Video]

More Information
  • Get in-depth knowledge of the Hadoop 2.7 architecture
  • See how to implement your hypothesis/algorithms on big data
  • Understand the Hadoop 2.x Architecture
  • Discover the process to set up an HDFS cluster along with formatting and data transfer in between your local storage and the Hadoop filesystem
  • Get to know all about the Hadoop UI
  • Create Map-reduce jobs

Hadoop emerged in response to the proliferation of masses and masses of data collected by organizations, offering a strong solution to store, process, and analyze what has commonly become known as Big Data. It comprises a comprehensive stack of components designed to enable these tasks on a distributed scale, across multiple servers and thousands of machines.

This course introduces you to the powerful system synonymous with Big Data, demonstrating how to create an instance and leverage Hadoop ecosystem's many components to store, process, manage, and query massive data sets with confidence.

The video course opens with an introduction to the world of Hadoop, where we discuss Nodes, Data Sets, and operations such as map and reduce. The second section deals HDFS, Hadoop's file-system used to store data. Further on, you’ll discover the differences between jobs and tasks, and get to know about the Hadoop UI. After this, we turn our attention to storing data in HDFS and Data Transformations. Lastly, we will learn how to implement an algorithm in Hadoop map-reduce way and analyze the overall performance.

Style and Approach

This book gives you an in-depth understanding of the basics of Hadoop balanced with tutorials that put the theory into practice. The focus of this course is giving you both the theoretical understanding and the practical hands-on examples of Hadoop 2.7.

  • Get a better understanding of how to set up a HDFS cluster between local storage and the Hadoop filesystem
  • Run your own Hadoop clusters on your own machine or in the cloud
  • Implement the best practices for Hadoop development
Course Length 1 hour 59 minutes
ISBN 9781787122550
Date Of Publication 29 Apr 2017


A K M Zahiduzzaman

A K M Zahiduzzaman is a software engineer with NewsCred Dhaka. He is a software developer and technology enthusiast. He was a Ruby on Rails developer, but now working on NodeJS and angularJS and python.He is also working with a much wider vision as a technology company. The next goal is introducing SOA within the current applications to scale development via microservices.

Zahiduzzaman has a lot of experience with Spark and is passionate about it. He is also a guitarist and has a band too. He was also a speaker for an international event in Dhaka. He is very enthusiastic and love to share his knowledge.