Reader small image

You're reading from  Data Lakehouse in Action

Product typeBook
Published inMar 2022
Reading LevelBeginner
PublisherPackt
ISBN-139781801815932
Edition1st Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Pradeep Menon
Pradeep Menon
author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon

Right arrow

Preface

Digital transformation is a reality. All organizations, big or small, have to embrace this reality to be relevant in the future. Data is at the core of this, and data analytics is the catalyst for this transformation. Therefore, an agile, scalable, and robust data architecture for analytics is pivotal for forging data as a strategic asset.

However, very few organizations can successfully harness their data estate for analytics. Many of them grapple with obsolete enterprise data warehouse architectural patterns or have jumped onto the data lake bandwagon without a proper architectural framework. Also, the new trending term "Data Lakehouse" focuses on various vendors' product-centric views rather than an architectural paradigm. This book views the concept of Data Lakehouse through an architectural lens.

This book is a comprehensive framework for developing a modern data analytics architecture. While writing this book, I have focused on architectural constructs of a Data Lakehouse. The book covers different layers and components of architecture. It explores how these different layers interoperate to form a robust, scalable, and modular architecture that can be deployed on any platform.

By the end of this book, you will understand the need for a new data architecture pattern called Data Lakehouse, the details of the different layers and components of a Data Lakehouse architecture, and the methods required to deploy this architecture in a cloud computing platform and scale it to achieve the macro-patterns of Data Mesh and Hub-spoke.

Who this book is for

This book is for people who want to understand how to architect modern analytics. This book targets anyone who wants to become well-versed with modern data architecture patterns to enable large-scale analytics. It explains concepts in a non-technical and straightforward manner. The book's target audience includes data architects, big data engineers, data strategists and practitioners, data stewards, and cloud computing practitioners.

What this book covers

Chapter 1, Introducing the Evolution of Data Analytics Patterns, provides an overview of the evolution of the data architecture patterns for analytics.

Chapter 2, The Data Lakehouse Architecture Overview, provides an overview of the various components that form the Data Lakehouse architecture pattern.

Chapter 3, Ingesting and Processing Data in a Data Lakehouse, deep dives into the methods of ingesting and processing data in a batch and streaming data in a Data Lakehouse.

Chapter 4, Storing and Serving Data in a Data Lakehouse, discusses the types of datastores of a data lake and various methods of serving data from a Data Lakehouse.

Chapter 5, Deriving Insights from a Data Lakehouse, discusses the ways in which business intelligence, artificial intelligence, and data exploration can be carried out.

Chapter 6, Applying Data Governance in a Data Lakehouse, discusses how data can be governed, how to implement and maintain data quality, and how data needs to be cataloged.

Chapter 7, Applying Data Security in a Data Lakehouse, discusses various components used to secure the Data Lakehouse and ways to provide proper access to the right users.

Chapter 8, Implementing a Data Lakehouse on Microsoft Azure, focuses on implementing a Data Lakehouse on the Microsoft Azure cloud computing platform.

Chapter 9, Scaling the Data Lakehouse Architecture, discusses how Data Lakehouses can be scaled to realize the macro-architecture patterns of Data Mesh and Hub-spoke.

Download the color images

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. You can download it here: https://static.packt-cdn.com/downloads/9781801815932_ColorImages.pdf.

Conventions used

Bold: Indicates a new term, an important word, or words that you see on screen. For instance, words in menus or dialog boxes appear in bold. Here is an example: "The two types of metadata that need to be cataloged include Functional and Technical."

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at customercare@packtpub.com.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share your thoughts

Once you've read Data Lakehouse in Action, we'd love to hear your thoughts! Please click https://packt.link/r/1-801-81593-3 to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we're delivering excellent quality content.

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Data Lakehouse in Action
Published in: Mar 2022Publisher: PacktISBN-13: 9781801815932
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon