Architectural guidance
As evidenced in the previous sections, there are a plethora of options available for Data Consumption; choosing the right tool depends primarily on the use case you are attempting to implement using the Data Lake. We also see that the market is flooded with umpteen tools that make decision making very difficult.
Data Discovery
We have seen, in the previous sections, that Data Lake exposes a queryable interface to data consumers to discover the data. Simple visualizations such as a histogram or tag cloud can provide an intuitive understanding of the data. The following figure depicts the key aspects that are to be considered while choosing the right tools and technologies for Data Discovery:
Big Data tools and technologies
The following section takes you through an indicative list of Big Data tools and technologies that can be used for your specific use case.
Elasticsearch
Elasticsearch is a scalable search engine that...