Hadoop Essentials

More Information
  • Get introduced to Hadoop, big data, and the pillars of Hadoop such as HDFS, MapReduce, and YARN
  • Understand different use cases of Hadoop along with big data analytics and real-time analysis in Hadoop
  • Explore the Hadoop ecosystem tools and effectively use them for faster development and maintenance of a Hadoop project
  • Demonstrate YARN's capacity for database processing
  • Work with Hive, HBase, and Pig with Hadoop to easily figure out your big data problems
  • Gain insights into widely used tools such as Sqoop, Flume, Storm, and Spark using practical examples

This book jumps into the world of Hadoop ecosystem components and its tools in a simplified manner, and provides you with the skills to utilize them effectively for faster and effective development of Hadoop projects.

Starting with the concepts of Hadoop YARN, MapReduce, HDFS, and other Hadoop ecosystem components, you will soon learn many exciting topics such as MapReduce patterns, data management, and real-time data analysis using Hadoop. You will also get acquainted with many Hadoop ecosystem components tools such as Hive, HBase, Pig, Sqoop, Flume, Storm, and Spark.

By the end of the book, you will be confident to begin working with Hadoop straightaway and implement the knowledge gained in all your real-world scenarios.

  • Get to grips with different Hadoop ecosystem tools that can help you achieve scalability, performance, maintainability, and efficiency in your projects
  • Understand the different paradigms of Hadoop and get the most out of it to engage the power of your data
  • This is a fast-paced reference guide covering the key components and functionalities of Hadoop
Page Count 194
Course Length 5 hours 49 minutes
ISBN 9781784396688
Date Of Publication 28 Apr 2015


Shiva Achari

Shiva Achari has over 8 years of extensive industry experience and is currently working as a Big Data Architect consultant with companies such as Oracle and Teradata. Over the years, he has architected, designed, and developed multiple innovative and high-performance large-scale solutions, such as distributed systems, data centers, big data management tools, SaaS cloud applications, Internet applications, and Data Analytics solutions.

He is also experienced in designing big data and analytics applications, such as ingestion, cleansing, transformation, correlation of different sources, data mining, and user experience in Hadoop, Cassandra, Solr, Storm, R, and Tableau.

He specializes in developing solutions for the big data domain and possesses sound hands-on experience on projects migrating to the Hadoop world, new developments, product consulting, and POC. He also has hands-on expertise in technologies such as Hadoop, Yarn, Sqoop, Hive, Pig, Flume, Solr, Lucene, Elasticsearch, Zookeeper, Storm, Redis, Cassandra, HBase, MongoDB, Talend, R, Mahout, Tableau, Java, and J2EE.

He has been involved in reviewing Mastering Hadoop, Packt Publishing.

Shiva has expertise in requirement analysis, estimations, technology evaluation, and system architecture along with domain experience in telecoms, Internet applications, document management, healthcare, and media.

Currently, he is supporting presales activities such as writing technical proposals (RFP), providing technical consultation to customers, and managing deliveries of big data practice groups in Teradata.

He is active on his LinkedIn page at https://www.linkedin.com/in/shivaachari/.