You're reading from Automated Machine Learning with Microsoft Azure

Product typeBook

Published inApr 2021

PublisherPackt

ISBN-139781800565319

Edition1st Edition

Tools

Azure Functions

Concepts

Machine Learning

Author (1)

Dennis Michael Sawyers

Chapter 7: Using the Many Models Solution Accelerator

Now that you have experienced building regression, classification, and forecasting models with AutoML, it's time for you to learn how to deploy and utilize those models in actual business scenarios. Before you tackle this, however, we will first introduce you to a new, very powerful solution, that is, the Many Models Solution Accelerator (MMSA).

The MMSA lets you build hundreds to thousands of machine learning (ML) models at once and easily scales to hundreds of thousands of models. It's an advanced technology at the cutting edge of ML. Not only can you build hundreds of thousands of models, but you can also use the MMSA to easily deploy them into production.

In this chapter, you will begin by installing the accelerator and understanding the various use cases to which it applies. You will then run the three sections of the accelerator notebook-by-notebook: prepping data, training models, and forecasting new data...

Technical requirements

Within this chapter, you will log in to your Azure Machine Learning studio (AMLS), open up a Jupyter notebook on a compute instance, and install the MMSA from its location on GitHub. You will then run all three pieces of the MMSA sequentially, prepping the data, training the models remotely, and forecasting the data. As such, you need to have an Azure account, a compute instance for writing Python code, and a compute cluster for remote training. The full list of requirements is as follows:

Access to the internet.
A web browser, preferably Google Chrome or Microsoft Edge Chromium.
A Microsoft Azure account.
You should have created an AMLS workspace.
You should have created a compute instance in Chapter 2, Getting Started with Azure Machine Learning Service.
You should have created the compute cluster in Chapter 2, Getting Started with Azure Machine Learning Service.
You should understand how to navigate to the Jupyter environment...

Installing the many models solution accelerator

The MMSA was built by Microsoft in 2019 to address the needs of a growing number of customers who wanted to train hundreds of thousands of similar ML models simultaneously. This is particularly important for product demand forecasting, where you are trying to make forecasts for many different products at many different locations.

The impetus for the accelerator is model accuracy. While you could train a single model to predict product demand across all of your product lines and all of your stores, you will find that training individual models for each combination of product and store tends to yield superior performance. This is because a multitude of factors are dependent on both your algorithm and your data. It can be very difficult for some algorithms to find meaningful patterns when you're dealing with hundreds of thousands of different products distributed across the globe.

Additionally, the same columns can have different...

Prepping data for many models

While training thousands of ML models simultaneously sounds complicated, the MMSA makes it easy. The example included in the notebooks uses the OJ Sales data you used in Chapter 6, Building an AutoML Forecasting Solution. You will prepare the data simply by opening and running 01_Data_Preparation.ipynb. By reading the instructions carefully step by step and working through the notebook slowly, you will be able to understand what each section is about.

Once you're able to understand what each section is doing and you have the OJ Sales data loaded, you will be able to load the new dataset into your Jupyter notebook. This way, by the end of this section, you will be able to load your own data into Azure, modify it for the MMSA, and master the ability to use this powerful solution.

Prepping the sample OJ dataset

To understand how the first notebook works, follow these instructions in order:

Open 01_Data_Preparation.ipynb.
Run all...

Training many models simultaneously

Like prepping data for many models, training many models is simply a matter of navigating to the correct notebook and running the cells. There's no custom code required, and you are simply required to change a few settings.

Like prepping data, you will first run the notebook step by step to carefully understand how it works. Once you have that understanding, you will then create a new notebook with code that uses the datasets you made from the sample data. This will benefit you tremendously, as you will understand exactly which parts of the code you need to change to facilitate your own projects.

Training the sample OJ dataset

To train many models using the OJ data and to understand the underlying process, follow these instructions step by step:

From the solution-accelerator-many-models folder, click on the Automated_ML folder.
From the Automated_ML folder, click on the 02_AutoML_Training_Pipeline folder.
Open 02_AutoML_Training_Pipeline...

Scoring new data for many models

Scoring new data with the MMSA is a fairly straightforward task. Once you have your models trained, simply navigate to the correct notebook, change your variables to match your training notebook, and click the run button. As there are very few settings to alter compared to the training notebook, it's even easier to use with your own code.

In this section, like the others, first you will run the out-of-the-box scoring notebook with OJ Sales. Then, you will create a new notebook to score the sample data.

Scoring OJ sales data with the MMSA

To score OJ Sales data with the multiple models you've trained, follow these steps:

From the solution-accelerator-many-models folder, open the Automated_ML folder.
From the Automated_ML folder, open the 03_AutoML_Forecasting_Pipeline folder.
Open 03_AutoML_Forecasting_Pipeline.ipynb.
Run all of the cells in section 1.0. These cells set up your AMLS workspace, compute cluster...

Improving your many models results

Now that you have adapted all three of the notebooks to run your own code, you should be feeling pretty confident in your ability to use the MMSA. Still, it's pretty easy to get stuck. Many models is a complicated framework and small errors in your data can lead to errors.

Additionally, sometimes it's really hard to know what your data will look like when you are dealing with thousands of files you wish to train. Here is some good advice to follow in order to ensure you do not come to an impasse when using your own data with the MMSA:

Before using the accelerator, always try creating a single model first with your entire dataset. Check the performance of your model. Only use the MMSA if the single model's performance is subpar compared to your expectations or in a situation where obtaining the best accuracy is mission-critical for your project. Sometimes, the trade-off between complexity and performance isn't worth...

Summary

Advanced solutions like the MMSA are at the bleeding edge of ML and AI. It is a truly state-of-the-art technology and now it's another tool in your belt.

You've not only run all three notebooks on the OJ Sales data, but you have also converted the code to take in other datasets and understand how it works. Prepping data, training models, and forecasting the future using the MMSA are all things you have done and could do again. You may already have a use case to which you can apply it, or you may have to wait a few more years until your company is ready, but you are prepared.

Chapter 8, Choosing Real-Time versus Batch Scoring, continues your journey at the forefront of the ML world. Once you build a model in AutoML, the next step is to deploy it, and there are two options: batch versus real-time scoring. You will learn when to use batch scoring, when to use real-time scoring, and the main differences between the two. Mastering these concepts is key to successfully...

The rest of the chapter is locked

You have been reading a chapter from

Automated Machine Learning with Microsoft Azure

Published in: Apr 2021Publisher: PacktISBN-13: 9781800565319

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Dennis Michael Sawyers

Dennis Michael Sawyers is a senior cloud solutions architect (CSA) at Microsoft, specializing in data and AI. In his role as a CSA, he helps Fortune 500 companies leverage Microsoft Azure cloud technology to build top-class machine learning and AI solutions. Prior to his role at Microsoft, he was a data scientist at Ford Motor Company in Global Data Insight and Analytics (GDIA) and a researcher in anomaly detection at the highly regarded Carnegie Mellon Auton Lab. He received a master's degree in data analytics from Carnegie Mellon's Heinz College and a bachelor's degree from the University of Michigan. More than anything, Dennis is passionate about democratizing AI solutions through automated machine learning technology.
Read more about Dennis Michael Sawyers

Other recommended products

Related to this chapter

Azure Data Factory Cookbook

With the help of well-structured and practical recipes, this book will teach you how to integrate data from the cloud and on-premise. You’ll learn how to transform, clean, and consolidate data into a single data platform and get to grips with using ADF as the main ETL and orchestration tool for your data warehouse or data platform project.

BookDec 2020382 pages

Automated Machine Learning

This guide will help you to explore automated machine learning (AutoML), a rapidly growing subfield of machine learning. You’ll learn how you can use AutoML to fully automate the machine learning process even if you’re not an expert, and in turn increase your productivity drastically.

BookFeb 2021312 pages

Mastering Azure Machine Learning

This book will help you learn how to build a scalable end-to-end machine learning pipeline in Azure from experimentation and training to optimization and deployment. By the end of this book, you will learn to build complex distributed systems and scalable cloud infrastructure using powerful machine learning algorithms to compute insights.

BookApr 2020436 pages

Engineering MLOps

Get to grips with ML lifecycle management and MLOps implementation for your organization. This book will give you comprehensive insights into MLOps coupled with real-world examples in Azure that will teach you how to write programs, train robust and scalable ML models, and build ML pipelines to train, deploy, and monitor models securely in production.

BookApr 2021370 pages

Limitless Analytics with Azure Synapse

This book helps you understand the basic concepts and techniques of using Azure Synapse step-by-step. You'll gradually gain the skills you need to work with data and develop analytics solutions using the Azure analytics platform even with no prior knowledge of Azure.

BookJun 2021392 pages

Hands-On Data Warehousing with Azure Data Factory

Azure Data Factory (ADF) is a Microsoft Azure PaaS solution which supports data movement between many on premises and cloud data sources. This book covers custom tailored tutorials to help you develop , maintain and troubleshoot data movement processes and environments using Azure Data Factory V2 and SQL Server Integration Services 2017

BookMay 2018284 pages

Cloud Analytics with Microsoft Azure

Cloud Analytics with Microsoft Azure is an end-to-end guide to processing and analyzing big data using a range of Microsoft Azure features. This book covers everything you need to build your own data warehouse and learn numerous techniques to gain useful insights by analyzing big data.

BookNov 2019242 pages

Hands-On Machine Learning with Azure

This book will teach you how advanced machine learning can be performed in the cloud in a very cheap way. You will learn more about Azure ML processes as an enterprise-ready methodology. By the end of this book, you will implement machine learning and artificial intelligence concepts in your model to solve real-world problems.

BookOct 2018340 pages

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages