You're reading from Automated Machine Learning with Microsoft Azure Build highly accurate and scalable end-to-end AI solutions with Azure AutoML

Product type Paperback

Published in Apr 2021

Last Updated in Feb 2025

Publisher Packt

ISBN-13 9781800565319

Length 340 pages

Edition 1st Edition

Languages

Python

Tools

Azure

Concepts

Machine Learning

Authors (2):

Dennis Michael Sawyers

Dennis Sawyers

View More author details

Table of Contents (17) Chapters

Preface

1. Section 1: AutoML Explained – Why, What, and How

2. Chapter 1: Introducing AutoML FREE CHAPTER

3. Chapter 2: Getting Started with Azure Machine Learning Service

4. Chapter 3: Training Your First AutoML Model

5. Section 2: AutoML for Regression, Classification, and Forecasting – A Step-by-Step Guide

6. Chapter 4: Building an AutoML Regression Solution

7. Chapter 5: Building an AutoML Classification Solution

8. Chapter 6: Building an AutoML Forecasting Solution

9. Chapter 7: Using the Many Models Solution Accelerator

10. Section 3: AutoML in Production – Automating Real-Time and Batch Scoring Solutions

11. Chapter 8: Choosing Real-Time versus Batch Scoring

12. Chapter 9: Implementing a Batch Scoring Solution

13. Chapter 10: Creating End-to-End AutoML Solutions

14. Chapter 11: Implementing a Real-Time Scoring Solution

15. Chapter 12: Realizing Business Value with AutoML

16. Other Books You May Enjoy

Creating a parallel scoring pipeline

Standard ML pipelines work just fine for the majority of ML use cases, but when you need to score a large amount of data at once, you need a more powerful solution. That's where ParallelRunStep comes in. ParallelRunStep is Azure's answer to scoring big data in batch. When you use ParallelRunStep, you leverage all of the cores on your compute cluster simultaneously.

Say you have a compute cluster consisting of eight Standard_DS3_v2 virtual machines. Each Standard_DS3_v2 node has four cores, so you can perform 32 parallel scoring processes at once. This parallelization essentially lets you score data many times faster than if you used a single machine. Furthermore, it can easily scale vertically (increasing the size of each virtual machine in the cluster) and horizontally (increasing the node count).

This section will allow you to become a big data scientist who can score large batches of data. Here, you will again be using simulated...