Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Limitless Analytics with Azure Synapse

You're reading from  Limitless Analytics with Azure Synapse

Product type Book
Published in Jun 2021
Publisher Packt
ISBN-13 9781800205659
Pages 392 pages
Edition 1st Edition
Languages
Author (1):
Prashant Kumar Mishra Prashant Kumar Mishra
Profile icon Prashant Kumar Mishra

Table of Contents (20) Chapters

Preface 1. Section 1: The Basics and Key Concepts
2. Chapter 1: Introduction to Azure Synapse 3. Chapter 2: Considerations for Your Compute Environment 4. Section 2: Data Ingestion and Orchestration
5. Chapter 3: Bringing Your Data to Azure Synapse 6. Chapter 4: Using Synapse Pipelines to Orchestrate Your Data 7. Chapter 5: Using Synapse Link with Azure Cosmos DB 8. Section 3: Azure Synapse for Data Scientists and Business Analysts
9. Chapter 6: Working with T-SQL in Azure Synapse 10. Chapter 7: Working with R, Python, Scala, .NET, and Spark SQL in Azure Synapse 11. Chapter 8: Integrating a Power BI Workspace with Azure Synapse 12. Chapter 9: Perform Real-Time Analytics on Streaming Data 13. Chapter 10: Generate Powerful Insights on Azure Synapse Using Azure ML 14. Section 4: Best Practices
15. Chapter 11: Performing Backup and Restore in Azure Synapse Analytics 16. Chapter 12: Securing Data on Azure Synapse 17. Chapter 13: Managing and Monitoring Synapse Workloads 18. Chapter 14: Coding Best Practices 19. Other Books You May Enjoy

Understanding Azure Data Lake

A data lake is a storage repository that allows you to store your data in native format without having to first structure the data at any scale.

Azure Data Lake Storage provides secure, scalable, cost-effective storage for big data analytics. There are two generations of Azure Data Lake, Gen1 and Gen2; however, we will focus on Gen2 only throughout this chapter. Azure Data Lake Gen2 converges the capabilities of Azure Data Lake Gen1 with the capabilities of Azure Blob Storage with the addition of a Hierarchical Namespace to Blob Storage. Because of Azure Blob Storage's capabilities, you get a high availability/disaster recovery solutions for your data lake at a low cost.

The new Azure Blob File System (ABFS) driver is available within Azure HDInsight, Azure Databricks, and Azure Synapse Analytics, which can be used to access the data in a similar way to Hadoop Distributed File System (HDFS).

To use Data Lake Storage Gen2's capabilities, you need to create a storage account that has a hierarchical namespace. You can go through the following steps to create your Azure Data Lake Storage Gen2 account:

  1. Log in to the Azure portal: https://portal.azure.com.
  2. Click on the + Create a Resource link and select Storage account from the list of all available resources.
  3. Select the Resource group where you want to create your storage account. If you don't have a Resource group created, click on the Create new link below the drop-down list.
  4. Fill in the fields for Storage account name and Location.  
  5. Select Standard or Premium Performance as per your business need. If you are new to Data Lake, then it would be better to begin with Standard.
  6. Select an appropriate value for Account kind and Replication as per the business need. Again, the recommendation would be to leave the default selected values in these fields if you are performing this operation just for your learning purposes:
    Figure 1.10 – Creating Azure Data Lake Gen2 in Azure

    Figure 1.10 – Creating Azure Data Lake Gen2 in Azure

  7. For now, we can skip the Networking and Data protection tabs and move directly to the Advanced tab.
  8. Click on the Enabled radio button for the Hierarchical namespace property under the Advanced tab:
    Figure 1.11 – Enabling Hierarchical namespace for Data Lake Storage Gen2 on the Advanced tab

    Figure 1.11 – Enabling Hierarchical namespace for Data Lake Storage Gen2 on the Advanced tab

  9. Leave the default values for all other fields and click on Review + create.
  10. After reviewing all the details, click on Create and your Azure Data Lake Gen2 account will be created in a couple of minutes.

Now that you have already created your Azure Data Lake Gen2 account, you can use this account with Azure Synapse Analytics. We will learn how to read data from Data Lake in later chapters, but for now, we will learn about Azure Synapse Studio, and how it provides a unified experience when working with various resources under one roof.

You have been reading a chapter from
Limitless Analytics with Azure Synapse
Published in: Jun 2021 Publisher: Packt ISBN-13: 9781800205659
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}