Summary
This chapter explained the Data Consumption tier and we discussed in detail the Data Discovery and Data Provisioning zones of the Data Consumption tier. We started with understanding the various processes such as data classification, relation extraction, and indexing the data, that can be applied on the Raw and Data Hub zones of the Data Lake to enable discovery of the data. After the Data Discovery is enabled, we understood the key functionalities that can be implemented to perform Data Discovery.
We have also discussed Data Provisioning in detail, understanding the various functionalities that can be provided to data consumers while requesting for data to be provisioned. In the subsequent sections, we took a look at the various Big Data tools and technologies that can be used to perform Data Discovery and Data Provisioning to help you in decision making and arrive at the set of technologies that can be used for specific use cases, by giving an overview of where these tools can be...