Reader small image

You're reading from  Data Lakehouse in Action

Product typeBook
Published inMar 2022
Reading LevelBeginner
PublisherPackt
ISBN-139781801815932
Edition1st Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Pradeep Menon
Pradeep Menon
author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon

Right arrow

Chapter 5: Deriving Insights from a Data Lakehouse

A lot of ground has been covered so far. The previous chapters covered the methods of ingesting, processing, storing, and serving data in a data lakehouse. Transforming the underlying data into insights is the core aim of any data analytics platform, so this chapter will focus on how to do this. We will also explore the different kinds of data analytics that can be employed in this process.

First, we'll discuss some of the business requirements relating to data analytics. Then, we'll explore different kinds of data analytics and how different stakeholders can use them. After that, we will dive into how these capabilities are enabled using the three components of the data analytics layer: the analytics sandbox, the business intelligence service, and the artificial intelligence service. We will cover different types of descriptive and advanced data analytics. We will also focus on the methods of enabling these analytics...

Discussing the themes of analytics capabilities

Data analytics is the process through which data is transformed into insights. Before getting into the technical components that enable this transformation, let's discuss themes of analytics capabilities that an organization requires. The analytical capabilities are targeted at two types of personas:

  • Technical users: Technical users are stakeholders with the technical skills to directly engage with the underlying data structures. They have skills in SQL, programming, data science, and data engineering. A typical technical user could be a data analyst, data engineer, data scientist, or data architect.
  • Functional users: Functional users are stakeholders with expertise in the specific functional domain, but their focus on technology is limited. They are subject matter experts, and they understand how the business or a particular domain functions. A typical functional user could be a business analyst or manager.

The...

Enabling analytics capabilities in a data lakehouse

The previous section defined the different types of analytics that need to be fulfilled by a data lakehouse. Now, let's focus on how a data lakehouse enables these capabilities. Recall that in Chapter 2, The Data Lakehouse Architecture Overview, we defined the logical architecture of a data lakehouse. One of the layers of the architecture was the data analytics layer, which interacts with the data lake layer and the data serving layer. The following figure illustrates this interaction between the layers of the data lakehouse architecture:

Figure 5.4 – The interaction between the data lakehouse layers

The three components of the data analytics layer are as follows:

  • Analytical sandbox service
  • Business intelligence service
  • AI/ML service

The following figure maps the required analytics capabilities to the components that fulfill them:

Figure 5.5 –...

Summary

This chapter covered the data analytics layer, which is a vital element in a data lakehouse. We explained that transforming data into insights is the core aim of any data analytics platform, and we looked at how this can be achieved in detail.

We started by exploring some of the different business requirements of data analytics. Then, we covered different users and how they interact with data analytics. We showed how descriptive and advanced analytics form the broad categories of analytics that organizations typically require. We then discussed the characteristics of these categories in detail. Next, we drilled down into different analytics capabilities within these categories. After that, we discussed five analytics capabilities that are important for fulfilling the analytical needs of an organization.

We then mapped the three components of the data analytics layer with their analytics capabilities. Finally, the chapter discussed the sub-components required in each of...

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Data Lakehouse in Action
Published in: Mar 2022Publisher: PacktISBN-13: 9781801815932
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Pradeep Menon

Pradeep Menon is a seasoned data analytics professional with more than 18 years of experience in data and AI. Pradeep can balance business and technical aspects of any engagement and cross-pollinate complex concepts across many industries and scenarios. Currently, Pradeep works as a data and AI strategist at Microsoft. In this role, he is responsible for driving big data and AI adoption for Microsoft’s strategic customers across Asia. Pradeep is also a distinguished speaker and blogger and has given numerous keynotes on cloud technologies, data, and AI.
Read more about Pradeep Menon