Optimizing Spectrum queries for provisioned clusters
Amazon Redshift Spectrum allows you to extend your Amazon Redshift data warehouse to use SQL queries on data that is stored in Amazon S3. Optimizing Amazon Redshift Spectrum queries allows optimal throughputs for the SQL queries, as well as saving on the costs associated with them. In this recipe, we will demonstrate techniques to get insights into the performance of Spectrum-based queries and optimize them.
Getting ready
To complete this recipe, you will need:
- An IAM user with access to Amazon Redshift and Amazon S3
- An Amazon Redshift data warehouse deployed in the AWS Region eu-west-1
- Amazon Redshift data warehouse master user credentials
- Access to any SQL interface, such as a SQL client or Amazon Redshift Query Editor V2
- An IAM role attached to an Amazon Redshift data warehouse that can access Amazon S3; we will refer to it in the recipes with
[Your-Redshift_Role] - An AWS account...