Handling streamed data is a very important aspect of a Data Lake. In the Data Lake architecture discussed in this book, the handling of streamed data is the responsibility of the messaging layer. In this chapter, we will go into detail on this layer and will also discuss the technology that we have chosen to be a part of this layer doing the actual work.
We have chosen Apache Kafka as the fitting technology to be used in messaging layer. This chapter delves deep into this technology and it's architecture in regards to Data Lake.