Reader small image

You're reading from  Mastering Apache Storm

Product typeBook
Published inAug 2017
Reading LevelExpert
Publisher
ISBN-139781787125636
Edition1st Edition
Languages
Right arrow
Author (1)
Ankit Jain
Ankit Jain
author image
Ankit Jain

Ankit Jain holds a bachelor's degree in computer science and engineering. He has 6 years, experience in designing and architecting solutions for the big data domain and has been involved with several complex engagements. His technical strengths include Hadoop, Storm, S4, HBase, Hive, Sqoop, Flume, Elasticsearch, machine learning, Kafka, Spring, Java, and J2EE. He also shares his thoughts on his personal blog. You can follow him on Twitter at @mynameisanky. He spends most of his time reading books and playing with different technologies. When not at work, he spends time with his family and friends watching movies and playing games.
Read more about Ankit Jain

Right arrow

Integrating Storm with HBase


As explained in earlier chapters, Storm is meant for real-time data processing. However, in most cases, you will need to store the processed data in a data store so that you can use the stored data for further batch analysis and execute the batch analysis query on the data stored. This section explains how you can store the data processed by Storm in HBase.

Before going to the implementation, I want to give a little overview of what HBase is. HBase is a NoSQL, multidimensional, sparse, horizontally scalable database that is modeled after GoogleBigTable. HBase is built on top of Hadoop, which means it relies on Hadoop and integrates with the MapReduce framework very well. Hadoop provides the following benefits to HBase:

  • A distributed data store that runs on top of the commodity hardware
  • Fault tolerance

We will assume that you have HBase installed and running on your system. You can refer to the article on HBase installation at https://hbase.apache.org/cygwin.html...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Mastering Apache Storm
Published in: Aug 2017Publisher: ISBN-13: 9781787125636

Author (1)

author image
Ankit Jain

Ankit Jain holds a bachelor's degree in computer science and engineering. He has 6 years, experience in designing and architecting solutions for the big data domain and has been involved with several complex engagements. His technical strengths include Hadoop, Storm, S4, HBase, Hive, Sqoop, Flume, Elasticsearch, machine learning, Kafka, Spring, Java, and J2EE. He also shares his thoughts on his personal blog. You can follow him on Twitter at @mynameisanky. He spends most of his time reading books and playing with different technologies. When not at work, he spends time with his family and friends watching movies and playing games.
Read more about Ankit Jain