Data flow
- Data is ingested into the bronze layer by the Azure Data Factory pipelines of the Azure data lake and stored in its raw format.
- Azure Data Factory pipelines or a combination of Azure Data Factory and Azure Databricks cleans and formats the data and moves it to the silver layer of the data lake.
- The data in the silver layer can be directly consumed by machine learning algorithms via Azure Databricks or Azure Machine Learning.
- For advanced analytics that needs analytical cubes or a star schema of fact and dimension tables (https://learn.microsoft.com/en-us/power-bi/guidance/star-schema), data from the silver layer is further transformed using Azure Data Factory and Azure Databricks before being moved to the gold layer.
- Machine learning models trained on data from the silver layer are hosted on an AKS cluster for inferencing by applications.
- Power BI reads data from the gold or silver layer to build visual dashboards.
- Data that needs to be shared...