Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Data Lake Development with Big Data

You're reading from  Data Lake Development with Big Data

Product type Book
Published in Nov 2015
Publisher
ISBN-13 9781785888083
Pages 164 pages
Edition 1st Edition
Languages
Concepts

Chapter 4. Data Discovery and Consumption

In the previous chapters, we discussed the Data Intake and Data Management tiers. During intake, we have seen that the data is ingested from disparate sources and stored in the Raw Zone. The Data Management Tier performs data profiling and validation; integrates, cleanses, standardizes, and enriches the data and places it in the Data Hub Zone.

Let us now understand how this data can be discovered, packaged, and provisioned for it to be consumed by the downstream systems. Data Consumption comprises Data Discovery and Data Provisioning. In this chapter, we will enable you to understand the following topics:

  • The process of enabling discovery in the Data Lake

  • The various Data Discovery functionalities

  • The important aspects of Data Provisioning such as data publication and subscription.

  • The architectural guidance on choosing Big Data tools and technologies for Data Discovery and Data Provisioning

The following figure represents the end-state architecture of...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}