Getting Started with Amazon Redshift

Start by learning the fundamentals and then progress to creating and managing your own Redshift cluster. This guide walks you step-by-step through the world of big data, cloud computing, and scalable data warehousing.

Getting Started with Amazon Redshift

Starting
Stefan Bauer

Start by learning the fundamentals and then progress to creating and managing your own Redshift cluster. This guide walks you step-by-step through the world of big data, cloud computing, and scalable data warehousing.
$10.00
$29.99
RRP $14.99
RRP $29.99
eBook
Print + eBook
$12.99 p/month

Get Access

Get Unlimited Access to every Packt eBook and Video course

Enjoy full and instant access to over 3000 books and videos – you’ll find everything you need to stay ahead of the curve and make sure you can always get the job done.

+ Collection
Free Sample

Book Details

ISBN 139781782178088
Paperback154 pages

About This Book

  • Step-by-step instructions to create and manage your Redshift cluster
  • Understand the technology behind the database engine, as you learn about compression, block level storage, and column stores
  • Learn the implementation and database design considerations you will need to understand to successfully implement your own Amazon Redshift cluster

Who This Book Is For

This book is for CIOs, enterprise architects, developers, and anyone else who needs to get familiar with RedShift. The CIO will gain an understanding of what their technical staff is working on; the technical implementation personnel will get an in-depth view of the technology, and what it will take to implement their own solutions.

Table of Contents

Chapter 1: Overview
Pricing
Configuration options
Data storage
Considerations for your environment
Summary
Chapter 2: Transition to Redshift
Cluster configurations
Cluster creation
Cluster details
SQL Workbench and other query tools
Unsupported features
Command line
The PSQL command line
Summary
Chapter 3: Loading Your Data to Redshift
Datatypes
Schemas
Connecting to S3
The copy command
Load troubleshooting
ETL products
Performance monitoring
Indexing strategies
Sort keys
Distribution keys
Summary
Chapter 4: Managing Your Data
Backup and recovery
Resize
Table maintenance
Workload Management (WLM)
Compression
Streaming data
Query optimizer
Summary
Chapter 5: Querying Data
SQL syntax considerations
Query performance monitoring
Explain plans
Working with tables
Summary
Chapter 6: Best Practices
Security
Cluster configuration
Database maintenance
Cluster operation
Database design
Monitoring
Data processing
Summary

What You Will Learn

  • Provision your own Redshift cluster
  • Connect third party tools and software to your Redshift cluster
  • Load data using Redshift's copy tools and manage your data files
  • Manage queries through the implementation of Workload Management
  • Gain an understanding of the underlying architecture, everything from a detailed view of compression, to column store block-level storage
  • Resize your cluster and use Redshift scalability to expand and grow with ease as your data storage needs grow

In Detail

Amazon Redshift is a fast, fully managed, petabyte-scale data warehouse service. It provides an excellent approach to analyzing all your data using your existing business intelligence tools.

Getting Started with Amazon Redshift is an easy-to-read, descriptive guide that breaks down the complex topics of data warehousing and Amazon Redshift. You will learn the fundamentals of Redshift technology and how to implement your own Redshift cluster, through practical, real-world examples. This exciting new technology is a powerful tool in your arsenal of data management and this book is a must-have to implement and manage your next enterprise Data Warehouse.

Packed with detailed descriptions, diagrams, and explanations, Getting Started with Amazon Redshift will bring you along, regardless of your current level of understanding, to a point where you will feel comfortable with running your own Redshift cluster. The author's own experiences will give you an understanding of what you will need to consider when working with your own data. You will also learn about how compression has been implemented and what that means relative to a column store database structure. As you progress, you will gain an understanding of monitoring techniques, performance considerations, and what it will take to successfully run your Amazon Redshift cluster on a day-to-day basis. There truly is something in this book for everyone who is interested in learning about this technology.

Authors

Table of Contents

Chapter 1: Overview
Pricing
Configuration options
Data storage
Considerations for your environment
Summary
Chapter 2: Transition to Redshift
Cluster configurations
Cluster creation
Cluster details
SQL Workbench and other query tools
Unsupported features
Command line
The PSQL command line
Summary
Chapter 3: Loading Your Data to Redshift
Datatypes
Schemas
Connecting to S3
The copy command
Load troubleshooting
ETL products
Performance monitoring
Indexing strategies
Sort keys
Distribution keys
Summary
Chapter 4: Managing Your Data
Backup and recovery
Resize
Table maintenance
Workload Management (WLM)
Compression
Streaming data
Query optimizer
Summary
Chapter 5: Querying Data
SQL syntax considerations
Query performance monitoring
Explain plans
Working with tables
Summary
Chapter 6: Best Practices
Security
Cluster configuration
Database maintenance
Cluster operation
Database design
Monitoring
Data processing
Summary

Book Details

ISBN 139781782178088
Paperback154 pages
Read More