HBase Design Patterns

More Information
  • Install and configure a Hadoop cluster and HBase
  • Write Java code to read and write HBase
  • Explore Phoenix open source project to talk to HBase in SQL
  • Store single entities, generate keys, use lists, maps, and sets
  • Utilize UUID for generic key generation to store data and deal with large files
  • Use denormalization to optimize performance
  • Represent one-to-many and many-to-many relationships and deal with transactions
  • Troubleshoot and optimize your application

With the increasing use of NoSQL in general and HBase in particular, knowing how to build practical applications depends on the application of design patterns. These patterns, distilled from extensive practical experience of multiple demanding projects, guarantee the correctness and scalability of the HBase application. They are also generally applicable to most NoSQL databases.

Starting with the basics, this book will show you how to install HBase in different node settings. You will then be introduced to key generation and management and the storage of large files in HBase. Moving on, this book will delve into the principles of using time-based data in HBase, and show you some cases on denormalization of data while working with HBase. Finally, you will learn how to translate the familiar SQL design practices into the NoSQL world. With this concise guide, you will get a better idea of typical storage patterns, application design templates, HBase explorer in multiple scenarios with minimum effort, and reading data from multiple region servers.

  • Design HBase schemas for the most demanding functional and scalability requirements
  • Optimize HBase's handling of single entities, time series, large files, and complex events by utilizing design patterns
  • Written in an easy-to-follow style, and incorporating plenty of examples, and numerous hints and tips.
Page Count 150
Course Length 4 hours 30 minutes
ISBN 9781783981045
Date Of Publication 23 Dec 2014


Mark Kerzner

Mark Kerzner holds degrees in law, math, and computer science. He has been designing software for many years and Hadoop-based systems since 2008. He is a cofounder of Elephant Scale LLC, a big data training and consulting firm, as well as the co-author of the open source book Hadoop Illuminated. He has authored other books and patents as well. He knows about 10 languages and is a Mensa member.

Sujee Maniyam

Sujee Maniyam has been developing software for 15 years. He is a hands-on expert of Hadoop, NoSQL, and cloud technologies. He is a founder and the Principal at Elephant Scale (http://elephantscale.com/), where he consults and teaches big data technologies. He has authored a few open source projects and has contributed to the Hadoop project. He is an author of the open source book Hadoop Illuminated(http://hadoopilluminated.com/).

He is the founder of the Big Data Gurus meetup in San Jose, CA. He has presented at various meetups and conferences.

You can find him on LinkedIn at http://www.linkedin.com/in/sujeemaniyam or read more about him at http://sujee.net.