Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Mastering Hadoop 3

You're reading from  Mastering Hadoop 3

Product type Book
Published in Feb 2019
Publisher Packt
ISBN-13 9781788620444
Pages 544 pages
Edition 1st Edition
Languages
Authors (2):
Chanchal Singh Chanchal Singh
Profile icon Chanchal Singh
Manish Kumar Manish Kumar
Profile icon Manish Kumar
View More author details

Table of Contents (23) Chapters

Title Page
Dedication
About Packt
Foreword
Contributors
Preface
Journey to Hadoop 3 Deep Dive into the Hadoop Distributed File System YARN Resource Management in Hadoop Internals of MapReduce SQL on Hadoop Real-Time Processing Engines Widely Used Hadoop Ecosystem Components Designing Applications in Hadoop Real-Time Stream Processing in Hadoop Machine Learning in Hadoop Hadoop in the Cloud Hadoop Cluster Profiling Who Can Do What in Hadoop Network and Data Security Monitoring Hadoop Other Books You May Enjoy Index

Optimizing MapReduce


The MapReduce framework provides a massive advantage for improving performance for large datasets as we can add more nodes to get more performance. The resources such as node, memory, and disk require significant investment, thus only adding the node should not be a parameter for performance optimization. Sometimes, adding more nodes does not help in getting more performance as the application performance could be something else, such as code optimization, unwanted data transfer, and so on. In this section, we will discuss some of the best practices to optimize the MapReduce application. 

The performance of the application is measured by the overall processing time taken by the application. MapReduce processes data in parallel and thus it already provides a performance advantage over your MapReduce application. The following factors play important roles in optimizing MapReduce performance.

 

 

Hardware configuration

Hardware setup is the first step in the Hadoop installation...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}