Reader small image

You're reading from  Data Lakehouse in Action

Product typeBook
Published inMar 2022
Reading LevelBeginner
PublisherPackt
ISBN-139781801815932
Edition1st Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Pradeep Menon
Pradeep Menon
author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon

Right arrow

Summary

This chapter covered data ingestion and processing. We started by exploring the different patterns for batch data ingestion: ETL and ELT.

Then, we delved into the different components of the ELTL pattern, which is used to ingest and process batch data in a data lakehouse. Then, we discussed how to push or pull data into a raw data store. Finally, we discussed the pivotal role that the raw data store layer plays in data ingestion and processing.

Next, we delved into distributed computing and how it is used for processing batch data at scale.

After discussing batch data ingestion and processing, we discussed patterns for ingesting and processing stream data. Then, we discussed how to ingest stream data by publishing it to a topic and subscribing to it for processing. Finally, we learned how to micro batch the streams and exercise actions on a micro batch or a specific event of interest.

Finally, we brought all the concepts we'd discussed together and weaved...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Data Lakehouse in Action
Published in: Mar 2022Publisher: PacktISBN-13: 9781801815932

Author (1)

author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon