Instant MapReduce Patterns - Hadoop Essentials How-to

More Information
Learn
  • Write and run a simple MapReduce program
  • Understand the workings of Hadoop and how to write a custom formatter
  • Calculate analytics, cross-correlation, and set operations using Hadoop
  • Write simple Hadoop programs to perform searches
  • Join data by writing Hadoop programs
  • Perform graph operations and clustering
About

MapReduce is a technology that enables users to process large datasets and Hadoop is an implementation of MapReduce. We are beginning to see more and more data becoming available, and this hides many insights that might hold key to success or failure. However, MapReduce has the ability to analyze this data and write code to process it.

Instant MapReduce Patterns - Hadoop Essentials How-to is a concise introduction to Hadoop and programming with MapReduce. It is aimed to get you started and give you an overall feel for programming with Hadoop so that you will have a well-grounded foundation to understand and solve all of your MapReduce problems as needed.

Instant MapReduce Patterns - Hadoop Essentials How-to will start with the configuration of Hadoop before moving on to writing simple examples and discussing MapReduce programming patterns.

We will start simply by installing Hadoop and writing a word count program. After which, we will deal with the seven styles of MapReduce programs: analytics, set operations, cross correlation, search, graph, Joins, and clustering. For each case, you will learn the pattern and create a representative example program. The book also provides you with additional pointers to further enhance your Hadoop skills.

Features
  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results.
  • Learn how to install, configure, and run Hadoop jobs
  • Seven recipes, each describing a particular style of the MapReduce program to give you a good understanding of how to program with MapReduce
  • A concise introduction to Hadoop and common MapReduce patterns
Page Count 60
Course Length 1 hours 48 minutes
ISBN 9781782167716
Date Of Publication 21 May 2013

Authors

Srinath Perera

Srinath Perera is a senior software architect at WSO2 Inc., where he overlooks the overall WSO2 platform architecture with the CTO. He also serves as a research scientist at Lanka Software Foundation and teaches as a visiting faculty at Department of Computer Science and Engineering, University of Moratuwa. He is a co-founder of Apache Axis2 open source project, and he has been involved with the Apache Web Service project since 2002 and is a member of Apache Software foundation and Apache Web Service project PMC. He is also a committer of Apache open source projects Axis, Axis2, and Geronimo. He received his Ph.D. and M.Sc. in Computer Sciences from Indiana University, Bloomington, USA and received his Bachelor of Science in Computer Science and Engineering degree from the University of Moratuwa, Sri Lanka. He has authored many technical and peer reviewed research articles, and more details can be found on his website. He is also a frequent speaker at technical venues. He has worked with large-scale distributed systems for a long time. He closely works with Big Data technologies like Hadoop and Cassandra daily. He also teaches a parallel programming graduate class at University of Moratuwa, which is primarily based on Hadoop.