Reader small image

You're reading from  Serverless Machine Learning with Amazon Redshift ML

Product typeBook
Published inAug 2023
Reading LevelBeginner
PublisherPackt
ISBN-139781804619285
Edition1st Edition
Languages
Right arrow
Authors (4):
Debu Panda
Debu Panda
author image
Debu Panda

Debu Panda, a Senior Manager, Product Management at AWS, is an industry leader in analytics, application platform, and database technologies, and has more than 25 years of experience in the IT world. Debu has published numerous articles on analytics, enterprise Java, and databases and has presented at multiple conferences such as re:Invent, Oracle Open World, and Java One. He is lead author of the EJB 3 in Action (Manning Publications 2007, 2014) and Middleware Management (Packt, 2009).
Read more about Debu Panda

Phil Bates
Phil Bates
author image
Phil Bates

Phil Bates is a Senior Analytics Specialist Solutions Architect at AWS. He has more than 25 years of experience implementing large-scale data warehouse solutions. He is passionate about helping customers through their cloud journey and leveraging the power of ML within their data warehouse.
Read more about Phil Bates

Bhanu Pittampally
Bhanu Pittampally
author image
Bhanu Pittampally

Bhanu Pittampally is Analytics Specialist Solutions Architect at Amazon Web Services. His background is in data and analytics and is in the field for over 16 years. He currently lives in Frisco, TX with his wife Kavitha and daughters Vibha and Medha.
Read more about Bhanu Pittampally

Sumeet Joshi
Sumeet Joshi
author image
Sumeet Joshi

Sumeet Joshi is an Analytics Specialist Solutions Architect based out of New York. He specializes in building large-scale data warehousing solutions. He has over 17 years of experience in the data warehousing and analytical space.
Read more about Sumeet Joshi

View More author details
Right arrow

Preface

Amazon Redshift Serverless enables organizations to run petabyte-scale cloud data warehouses quickly and in a cost-effective way, enabling data science professionals to efficiently deploy cloud data warehouses and leverage easy-to-use tools to train models and run predictions. This practical guide will help developers and data professionals working with Amazon Redshift data warehouses to put their SQL knowledge to work for training and deploying machine learning models.

The book begins by helping you to explore the inner workings of Redshift Serverless as well as the foundations of data analytics and types of data machine learning. With the help of step-by-step explanations of essential concepts and practical examples, you’ll then learn to build your own classification and regression models. As you advance, you’ll find out how to deploy various types of machine learning projects using familiar SQL code, before delving into Redshift ML. In the concluding chapters, you’ll discover best practices for implementing serverless architecture with Redshift.

By the end of this book, you’ll be able to configure and deploy Amazon Redshift Serverless, train and deploy machine learning models using Amazon Redshift ML, and run inference queries at scale.

Who this book is for

Data scientists and machine learning developers working with Amazon Redshift who want to explore its machine-learning capabilities will find this definitive guide helpful. A basic understanding of

machine learning techniques and working knowledge of Amazon Redshift is needed to make the most of this book.

What this book covers

Chapter 1, Introduction to Amazon Redshift Serverless, presents an overview of Amazon Redshift and Redshift Serverless, walking you through how to get started in just a few minutes and connect using Redshift Query Editor v2. You will create a sample database and run queries using the Notebook feature.

Chapter 2, Data Loading and Analytics on Redshift Serverless, helps you learn different mechanisms to efficiently load data into Redshift Serverless.

Chapter 3, Applying Machine Learning in Your Data Warehouse, introduces machine learning and common use cases to apply to your data warehouse.

Chapter 4, Leveraging Amazon Redshift Machine Learning, builds on Chapter 3. Here, we dive into Amazon Redshift ML, learning how it works and how to leverage it to solve use cases.

Chapter 5, Building Your First Machine Learning Model, sees you get hands-on with Amazon Redshift ML and build your first model using simple CREATE MODEL syntax.

Chapter 6, Building Classification Models, covers classification problems and the algorithms you can use in Amazon Redshift ML to solve these problems and learn how to create a model with user guidance.

Chapter 7, Building Regression Models, helps you identify whether a problem involves regression and explores the different methods available in Amazon Redshift ML for training and building regression models.

Chapter 8, Building Unsupervised Models with K-Means Clustering, shows you how to build machine learning models with unlabeled data and make predictions at the observation level using K-means clustering.

Chapter 9, Deep Learning with Redshift ML, covers the use of deep learning in Amazon Redshift ML using the MLP model type for data that is not linearly separable.

Chapter 10, Creating Custom ML Model with XGBoost, shows you how to use the Auto Off option of Amazon Redshift ML to prepare data in order to build a custom model.

Chapter 11, Bring Your Own Models for In-Database Inference, goes beyond Redshift ML models. Up to this point in the book, we will have run inference queries only on models built directly in Amazon Redshift ML. This chapter shows how you can leverage models built outside of Amazon Redshift ML and execute inference queries inside Amazon Redshift ML.

Chapter 12, Time-Series Forecasting in Your Data Warehouse, dives into forecasting and time-series data using the integration of Amazon Forecast with Amazon Redshift ML.

Chapter 13, Operationalizing and Optimizing Amazon Redshift ML Models, concludes the book by showing techniques to refresh your model, create versions of your models, and optimize your Amazon Redshift ML models.

To get the most out of this book

You will need access to an AWS account to perform code examples in this book. You will need either to have administrator access or to work with an administrator to create a Redshift Serverless data warehouse and the IAM user, roles, and policies used in this book.

Software/hardware covered in the book

Operating system requirements

The AWS CLI (optional)

Windows, macOS, or Linux

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book’s GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Serverless-Machine-Learning-with-Amazon-Redshift. If there’s an update to the code, it will be updated in the GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: “Mount the downloaded WebStorm-10*.dmg disk image file as another disk in your system.”

A block of code is set as follows:

cnt = client.execute_statement(Database='dev',
    Sql='Select count(1) from chapter2.orders;',
    WorkgroupName=REDSHIFT_WORKGROUP)
query_id = cnt["Id"]

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

SHOW MODEL chapter5_buildfirstmodel.customer_churn_model;

Any command-line input or output is written as follows:

$ pip install pandas

Bold: Indicates a new term, an important word, or words that you see onscreen. For instance, words in menus or dialog boxes appear in bold. Here is an example: “Select System info from the Administration panel.”

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at customercare@packtpub.com and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share Your Thoughts

Once you’ve read Serverless Machine Learning with Amazon Redshift ML, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Download a free PDF copy of this book

Thanks for purchasing this book!

Do you like to read on the go but are unable to carry your print books everywhere? Is your eBook purchase not compatible with the device of your choice?

Don’t worry, now with every Packt book you get a DRM-free PDF version of that book at no cost.

Read anywhere, any place, on any device. Search, copy, and paste code from your favorite technical books directly into your application.

The perks don’t stop there, you can get exclusive access to discounts, newsletters, and great free content in your inbox daily

Follow these simple steps to get the benefits:

  1. Scan the QR code or visit the link below

https://packt.link/free-ebook/978-1-80461-928-5

  1. Submit your proof of purchase
  2. That’s it! We’ll send your free PDF and other benefits to your email directly
lock icon
The rest of the chapter is locked
You have been reading a chapter from
Serverless Machine Learning with Amazon Redshift ML
Published in: Aug 2023Publisher: PacktISBN-13: 9781804619285
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Authors (4)

author image
Debu Panda

Debu Panda, a Senior Manager, Product Management at AWS, is an industry leader in analytics, application platform, and database technologies, and has more than 25 years of experience in the IT world. Debu has published numerous articles on analytics, enterprise Java, and databases and has presented at multiple conferences such as re:Invent, Oracle Open World, and Java One. He is lead author of the EJB 3 in Action (Manning Publications 2007, 2014) and Middleware Management (Packt, 2009).
Read more about Debu Panda

author image
Phil Bates

Phil Bates is a Senior Analytics Specialist Solutions Architect at AWS. He has more than 25 years of experience implementing large-scale data warehouse solutions. He is passionate about helping customers through their cloud journey and leveraging the power of ML within their data warehouse.
Read more about Phil Bates

author image
Bhanu Pittampally

Bhanu Pittampally is Analytics Specialist Solutions Architect at Amazon Web Services. His background is in data and analytics and is in the field for over 16 years. He currently lives in Frisco, TX with his wife Kavitha and daughters Vibha and Medha.
Read more about Bhanu Pittampally

author image
Sumeet Joshi

Sumeet Joshi is an Analytics Specialist Solutions Architect based out of New York. He specializes in building large-scale data warehousing solutions. He has over 17 years of experience in the data warehousing and analytical space.
Read more about Sumeet Joshi