You're reading from Cloud Scale Analytics with Azure Data Services

Product type Book

Published in Jul 2021

Publisher Packt

ISBN-13 9781800562936

Pages 520 pages

Edition 1st Edition

Languages

Concepts

Data Streaming

Author (1):

Patrik Borosch

Table of Contents (20) Chapters

Preface

1. Section 1: Data Warehousing and Considerations Regarding Cloud Computing

2. Chapter 1: Balancing the Benefits of Data Lakes Over Data Warehouses

3. Chapter 2: Connecting Requirements and Technology

4. Section 2: The Storage Layer

5. Chapter 3: Understanding the Data Lake Storage Layer

6. Chapter 4: Understanding Synapse SQL Pools and SQL Options

7. Section 3: Cloud-Scale Data Integration and Data Transformation

8. Chapter 5: Integrating Data into Your Modern Data Warehouse

9. Chapter 6: Using Synapse Spark Pools

10. Chapter 7: Using Databricks Spark Clusters

11. Chapter 8: Streaming Data into Your MDWH

12. Chapter 9: Integrating Azure Cognitive Services and Machine Learning

13. Chapter 10: Loading the Presentation Layer

14. Section 4: Data Presentation, Dashboarding, and Distribution

15. Chapter 11: Developing and Maintaining the Presentation Layer

16. Chapter 12: Distributing Data

17. Chapter 13: Introducing Industry Data Models

18. Chapter 14: Establishing Data Governance

19. Other Books You May Enjoy

Summary

In this chapter, you have examined Synapse pipelines/Azure Data Factory. You have learned how to create a data movement pipeline using a wizard, as well as from scratch in the authoring environment. You have seen the orchestration capabilities with the many different activities provided.

You have further implemented your first mapping flow to create transformations your data is going through before it lands in your Data Lake Storage. You have examined wrangling flows and learned the difference between the two data flow components.

We have also examined the IRs and their differences and talked about managed virtual networks and managed private endpoints.

Finally, we have integrated our Data Factory with Azure DevOps and have established source control over our artifacts.

In the next chapter, we are going to dive into another option to transform and process data using one of the main compute components in our modern data warehouse: the Spark engine.