Reader small image

You're reading from  Automated Machine Learning with Microsoft Azure

Product typeBook
Published inApr 2021
PublisherPackt
ISBN-139781800565319
Edition1st Edition
Right arrow
Author (1)
Dennis Michael Sawyers
Dennis Michael Sawyers
author image
Dennis Michael Sawyers

Dennis Michael Sawyers is a senior cloud solutions architect (CSA) at Microsoft, specializing in data and AI. In his role as a CSA, he helps Fortune 500 companies leverage Microsoft Azure cloud technology to build top-class machine learning and AI solutions. Prior to his role at Microsoft, he was a data scientist at Ford Motor Company in Global Data Insight and Analytics (GDIA) and a researcher in anomaly detection at the highly regarded Carnegie Mellon Auton Lab. He received a master's degree in data analytics from Carnegie Mellon's Heinz College and a bachelor's degree from the University of Michigan. More than anything, Dennis is passionate about democratizing AI solutions through automated machine learning technology.
Read more about Dennis Michael Sawyers

Right arrow

Preface

Automated Machine Learning with Microsoft Azure will help you build high-performing, accurate machine learning models in record time. It allows anyone to easily harness the power of artificial intelligence and increase the productivity and profitability of their business. With a series of clicks on a graphical user interface (GUI), novices and seasoned data scientists alike can easily train and deploy machine learning solutions to production.

This book will teach you how to use Azure AutoML with both the GUI and the Azure Machine Learning Python SDK in a careful, step-by-step fashion. First, you'll learn how to prepare data, train models, and register them to your Azure Machine Learning workspace. Then, you'll learn how to take those models and use them to create both automated batch solutions using machine learning pipelines and real-time scoring solutions using Azure Kubernetes Service (AKS).

By the time you finish Automated Machine Learning with Microsoft Azure, you will be able to use AutoML on your own data to not only train regression, classification, and forecasting models but also use them to solve a wide variety of business problems. You'll be able to show your business partners exactly how your machine learning models make predictions through automatically generated charts and graphs, earning their trust and respect.

Who this book is for

Data scientists, aspiring data scientists, machine learning engineers, and anyone interested in applying artificial intelligence or machine learning in their business will find this book useful. You need to have beginner-level knowledge of artificial intelligence and a technical background in computer science, statistics, or information technology before getting started with this machine learning book. Having a background in Python will help you implement this book's more advanced features, but even data analysts and SQL experts will be able to train machine learning models after finishing this book.

What this book covers

Chapter 1, Introducing AutoML, begins by explaining the current state of data science and artificial intelligence in industry and why so many companies are having such a hard time extracting value from data. It explains how data scientists work, why their processes are inherently slow, and why they need to be made quicker. Finally, it introduces AutoML as the solution to achieve the return on investment required by industry.

Chapter 2, Getting Started with Azure Machine Learning Service, goes into depth in explaining the different artifacts of Azure Machine Learning and how they integrate to form end-to-end machine learning solutions. You'll learn about datastores, datasets, compute instances, compute clusters, environments, and experiments, and how you use them to create machine learning solutions on Azure.

Chapter 3, Training Your First AutoML Model, will have you create your first AutoML model using publicly available Titanic data. You will use the Azure Machine Learning Studio GUI to upload your data into your workspace, create a dataset, and run an AutoML classification job to predict Titanic survivors. Lastly, you'll use AutoML's explainability features to see which factors were most vital to predicting survival.

Chapter 4, Building an AutoML Regression Solution, will help you train an AutoML regression model using the Azure Machine Learning SDK in Python. You'll learn how to access Jupyter notebooks within Azure Machine Learning, use compute clusters for remote training on the cloud, and create an AutoML model that predicts a number. By the end of this chapter, you will be able to replicate this work for any regression problem you have in the future.

Chapter 5, Building an AutoML Classification Solution, will help you train an AutoML classification model using the Azure Machine Learning SDK in Python in two ways. First, you'll train a binary classification model to predict one of two categories. Then, you will train a multiclass classification model to predict one of three categories. By the end of this chapter, you'll be an expert in training all types of classification models with AutoML.

Chapter 6, Building an AutoML Forecasting Solution, looks at forecasting, one of the most common machine learning problems and one of the hardest to master. In this chapter, you'll learn how to code a forecasting solution with AutoML, making use of advanced forecasting-specific algorithms and features. You'll learn the ins and outs of forecasting and be able to avoid many of the common mistakes people make while forecasting.

Chapter 7, Using the Many Models Solution Accelerator, expands on how the Many Models Solution Accelerator (MMSA) is a cutting-edge Azure technology that lets companies train hundreds of thousands of models quickly and easily. Here, you will learn how to access the MMSA and adapt it to your own problems. This is a powerful code-only solution aimed at seasoned data scientists, but even novices will be able to use it by the end of this chapter.

Chapter 8, Choosing Real-Time versus Batch Scoring, explores how real-time solutions and batch solutions represent the two ways to score machine learning models. This chapter delves into common business scenarios and explains how you should choose which type of solution to create. The end of this chapter features a quiz that will test your ability to match business problems to the correct type of solution, saving you time and money.

Chapter 9, Implementing a Batch Scoring Solution, emphasizes how machine learning pipelines are Azure Machine Learning's batch scoring solution of choice. Machine learning pipelines are containerized code where, once you create them, you can easily rerun and schedule them on an automated basis. This chapter has you use the AutoML models you created in earlier chapters to create powerful batch scoring solutions that run on a schedule of your choice.

Chapter 10, Creating End-to-End AutoML Solutions, emphasizes how Azure Data Factory (ADF) is a code-free data orchestration tool that integrates easily with machine learning pipelines. In this chapter, you'll learn how to seamlessly move data into and out of Azure, and how to integrate that flow with your scoring pipelines. By the end of this chapter, you will understand how ADF and AMLS combine to create the ultimate data science experience.

Chapter 11, Implementing a Real-Time Scoring Solution, teaches you how to create real-time scoring endpoints hosted on AKS and Azure Container Instances (ACI). You'll learn how to deploy AutoML models to an endpoint with a single click from the Azure Machine Learning Studio GUI as well as through Python code in a Jupyter notebook, completing your AutoML training.

Chapter 12, Realizing Business Value with AutoML, focuses on how creating an end-to-end solution is just the first step in realizing business value; you'll also need to gain end user trust. This chapter focuses on how to gain this trust through architectural diagrams, model interpretability, and presenting results in an intuitive, easy-to-understand manner. You'll learn how to become and be seen as a trusted, reliable partner to your business.

To get the most out of this book

You will need to have the following requirements:

In order to use Automated Machine Learning with Microsoft Azure, you will need a working internet connection. We recommend either Microsoft Edge or Google Chrome to have the best experience with the Azure portal. Furthermore, you will be required to create an Azure account (at no cost) if you do not already have one.

If you are using the digital version of this book, we advise you to type the code yourself or access the code via the GitHub repository (link available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

As you work through the book, please feel free to try AutoML with your own data. It helps greatly in your learning experience to solve problems that interest you. At the end of each chapter, try adapting your own datasets to the example code.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/Automated-Machine-Learning-with-Microsoft-Azure. In case there's an update to the code, it will be updated on the existing GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots/diagrams used in this book. You can download it here: https://static.packt-cdn.com/downloads/9781800565319_ColorImages.pdf.

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: "You have another helper function here, get_forecasting_output."

A block of code is set as follows:

from azureml.core import Workspace, Dataset, Datastore
from azureml.core import Experiment
from azureml.core.compute import ComputeTarget
from azureml.train.automl import AutoMLConfig
from azureml.train.automl.run import AutoMLRun
from azureml.widgets import RunDetails

Any command-line input or output is written as follows:

from azureml.pipeline.core import PipelineRun
experiment = Experiment(ws, 'your-experiment_name')
pipeline_run = PipelineRun(experiment, 'your-pipeline-run-id')

Bold: Indicates a new term, an important word, or words that you see onscreen. For example, words in menus or dialog boxes appear in the text like this. Here is an example: "Go to Experiments under Assets in Azure Machine Learning Studio, click your experiment name, select your run ID, click the Models tab, select the highest-performing algorithm, and click the Metrics tab."

Tips or important notes

Appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, mention the book title in the subject of your message and email us at customercare@packtpub.com.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata, selecting your book, clicking on the Errata Submission Form link, and entering the details.

Piracy: If you come across any illegal copies of our works in any form on the Internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Reviews

Please leave a review. Once you have read and used this book, why not leave a review on the site that you purchased it from? Potential readers can then see and use your unbiased opinion to make purchase decisions, we at Packt can understand what you think about our products, and our authors can see your feedback on their book. Thank you!

For more information about Packt, please visit packt.com.

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Automated Machine Learning with Microsoft Azure
Published in: Apr 2021Publisher: PacktISBN-13: 9781800565319
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Dennis Michael Sawyers

Dennis Michael Sawyers is a senior cloud solutions architect (CSA) at Microsoft, specializing in data and AI. In his role as a CSA, he helps Fortune 500 companies leverage Microsoft Azure cloud technology to build top-class machine learning and AI solutions. Prior to his role at Microsoft, he was a data scientist at Ford Motor Company in Global Data Insight and Analytics (GDIA) and a researcher in anomaly detection at the highly regarded Carnegie Mellon Auton Lab. He received a master's degree in data analytics from Carnegie Mellon's Heinz College and a bachelor's degree from the University of Michigan. More than anything, Dennis is passionate about democratizing AI solutions through automated machine learning technology.
Read more about Dennis Michael Sawyers