Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Limitless Analytics with Azure Synapse

You're reading from  Limitless Analytics with Azure Synapse

Product type Book
Published in Jun 2021
Publisher Packt
ISBN-13 9781800205659
Pages 392 pages
Edition 1st Edition
Languages
Author (1):
Prashant Kumar Mishra Prashant Kumar Mishra
Profile icon Prashant Kumar Mishra

Table of Contents (20) Chapters

Preface Section 1: The Basics and Key Concepts
Chapter 1: Introduction to Azure Synapse Chapter 2: Considerations for Your Compute Environment Section 2: Data Ingestion and Orchestration
Chapter 3: Bringing Your Data to Azure Synapse Chapter 4: Using Synapse Pipelines to Orchestrate Your Data Chapter 5: Using Synapse Link with Azure Cosmos DB Section 3: Azure Synapse for Data Scientists and Business Analysts
Chapter 6: Working with T-SQL in Azure Synapse Chapter 7: Working with R, Python, Scala, .NET, and Spark SQL in Azure Synapse Chapter 8: Integrating a Power BI Workspace with Azure Synapse Chapter 9: Perform Real-Time Analytics on Streaming Data Chapter 10: Generate Powerful Insights on Azure Synapse Using Azure ML Section 4: Best Practices
Chapter 11: Performing Backup and Restore in Azure Synapse Analytics Chapter 12: Securing Data on Azure Synapse Chapter 13: Managing and Monitoring Synapse Workloads Chapter 14: Coding Best Practices Other Books You May Enjoy

Implementing best practices for a Synapse Spark pool

As with Synapse SQL pools, it is also important to keep our Spark pool healthy. In this section, we are going to learn how to optimize cluster configuration for any particular workload. We will also learn how to use various techniques for enhancing Apache Spark performance.

Configuring the Auto-pause setting

There are some major advantages of using Platform-as-a-Service (PaaS) instead of an on-premises environment, and the Auto-pause setting is one of the best features that PaaS has to offer. If you are running a Spark cluster on your on-premises environment, you need to pay for provisioning it even though you may only need to use this cluster for a couple of hours a day. However, Synapse gives you the option to configure the Auto-pause setting to pause a cluster automatically if not in use. Upon entering a value for the Number of minutes idle field within the Auto-pause setting, the Spark pool will go to a Pause state automatically...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}