Instant MapReduce Patterns – Hadoop Essentials How-to [Instant]


This title is available as an eBook only
Instant MapReduce Patterns – Hadoop Essentials How-to [Instant]
eBook: $19.99
Formats: PDF, PacktLib, ePub and Mobi formats
$17.00
save 15%!
Print & eBook also available on:
Learn in an Instant - Short, Fast, Focused
Overview
Table of Contents
Author
Support
Sample Chapters
  • Learn something new in an Instant! A short, fast, focused guide delivering immediate results.
  • Learn how to install, configure, and run Hadoop jobs
  • Seven recipes, each describing a particular style of the MapReduce program to give you a good understanding of how to program with MapReduce
  • A concise introduction to Hadoop and common MapReduce patterns

Book Details

Language : English
eBook : 60 pages
Release Date : May 2013
ISBN : 1782167706
ISBN 13 : 9781782167709
Author(s) : Srinath Perera
Topics and Technologies : All Books, Big Data and Business Intelligence, Data, Instant, Cloud, Open Source

Table of Contents

Preface
Instant MapReduce Patterns – Hadoop Essentials How-to
  • Instant MapReduce Patterns – Hadoop Essentials How-to
    • Writing a word count application using Java (Simple)
    • Writing a word count application with MapReduce and running it (Simple)
    • Installing Hadoop in a distributed setup and running a word count application (Simple)
    • Writing a formatter (Intermediate)
    • Analytics – drawing a frequency distribution with MapReduce (Intermediate)
    • Relational operations – join two datasets with MapReduce (Advanced)
    • Set operations with MapReduce (Intermediate)
    • Cross correlation with MapReduce (Intermediate)
    • Simple search with MapReduce (Intermediate)
    • Simple graph operations with MapReduce (Advanced)
    • Kmeans with MapReduce (Advanced)

Srinath Perera

Srinath Perera is a senior software architect at WSO2 Inc., where he overlooks the overall WSO2 platform architecture with the CTO. He also serves as a research scientist at Lanka Software Foundation and teaches as a visiting faculty at Department of Computer Science and Engineering, University of Moratuwa. He is a co-founder of Apache Axis2 open source project, and he has been involved with the Apache Web Service project since 2002 and is a member of Apache Software foundation and Apache Web Service project PMC. He is also a committer of Apache open source projects Axis, Axis2, and Geronimo. He received his Ph.D. and M.Sc. in Computer Sciences from Indiana University, Bloomington, USA and received his Bachelor of Science in Computer Science and Engineering degree from the University of Moratuwa, Sri Lanka. He has authored many technical and peer reviewed research articles, and more details can be found on his website. He is also a frequent speaker at technical venues. He has worked with large-scale distributed systems for a long time. He closely works with Big Data technologies like Hadoop and Cassandra daily. He also teaches a parallel programming graduate class at University of Moratuwa, which is primarily based on Hadoop.
Sorry, we don't have any reviews for this title yet.

Code Downloads

Download the code and support files for this book.


Submit Errata

Please let us know if you have found any errors not listed on this list by completing our errata submission form. Our editors will check them and add them to this list. Thank you.


Errata

- 1 submitted: last submission 07 Jul 2014

Errata Type: Support Query  Page: 9

The hadoop-microbook.jar file mentioned on page 9 is actually referring to the microbook folder present in the src folder of the code bundle. 

Sorry, there are currently no downloads available for this title.

Frequently bought together

Instant MapReduce Patterns – Hadoop Essentials How-to [Instant] +    MCTS: Microsoft Silverlight 4 Development (70-506) Certification Guide =
50% Off
the second eBook
Price for both: $40.60

Buy both these recommended eBooks together and get 50% off the cheapest eBook.

What you will learn from this book

  • Write and run a simple MapReduce program
  • Understand the workings of Hadoop and how to write a custom formatter
  • Calculate analytics, cross-correlation, and set operations using Hadoop
  • Write simple Hadoop programs to perform searches
  • Join data by writing Hadoop programs
  • Perform graph operations and clustering

In Detail

MapReduce is a technology that enables users to process large datasets and Hadoop is an implementation of MapReduce. We are beginning to see more and more data becoming available, and this hides many insights that might hold key to success or failure. However, MapReduce has the ability to analyze this data and write code to process it.

Instant MapReduce Patterns – Hadoop Essentials How-to is a concise introduction to Hadoop and programming with MapReduce. It is aimed to get you started and give you an overall feel for programming with Hadoop so that you will have a well-grounded foundation to understand and solve all of your MapReduce problems as needed.

Instant MapReduce Patterns – Hadoop Essentials How-to will start with the configuration of Hadoop before moving on to writing simple examples and discussing MapReduce programming patterns.

We will start simply by installing Hadoop and writing a word count program. After which, we will deal with the seven styles of MapReduce programs: analytics, set operations, cross correlation, search, graph, Joins, and clustering. For each case, you will learn the pattern and create a representative example program. The book also provides you with additional pointers to further enhance your Hadoop skills.

Approach

Filled with practical, step-by-step instructions and clear explanations for the most important and useful tasks. This is a Packt Instant How-to guide, which provides concise and clear recipes for getting started with Hadoop.

Who this book is for

This book is for big data enthusiasts and would-be Hadoop programmers. It is also meant for Java programmers who either have not worked with Hadoop at all, or who know Hadoop and MapReduce but are not sure how to deepen their understanding.

Code Download and Errata
Packt Anytime, Anywhere
Register Books
Print Upgrades
eBook Downloads
Video Support
Contact Us
Awards Voting Nominations Previous Winners
Judges Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software
Resources
Open Source CMS Hall Of Fame CMS Most Promising Open Source Project Open Source E-Commerce Applications Open Source JavaScript Library Open Source Graphics Software