You're reading from Azure Data Scientist Associate Certification Guide

Product typeBook

Published inDec 2021

Reading LevelBeginner

PublisherPackt

ISBN-139781800565005

Edition1st Edition

Languages

Python

Tools

Azure Functions

Concepts

Machine Learning

Authors (2):

Andreas Botsikas

Michael Hlobil

View More author details

Chapter 9: Optimizing the ML Model

In this chapter, you will learn about two techniques you can use to discover the optimal model for your dataset. You will start by exploring the HyperDrive package of the AzureML SDK. This package allows you to fine-tune the model's performance by tweaking the parameters it exposes, a process also known as hyperparameter tuning. You will then explore the Automated ML (AutoML) package of the AzureML SDK, which allows you to automate the model selection, training, and optimization process through code.

In this chapter, we are going to cover the following main topics:

Hyperparameter tuning using HyperDrive
Running AutoML experiments with code

Technical requirements

You will need to have access to an Azure subscription. Within that subscription, you will need a resource group named packt-azureml-rg. You will need to have either a Contributor or Owner Access control (IAM) role on the resource group level. Within that resource group, you should have already deployed a machine learning resource named packt-learning-mlw, as described in Chapter 2, Deploying Azure Machine Learning Workspace Resources.

You will also need to have a basic understanding of the Python language. The code snippets target Python version 3.6 or newer. You should also be familiar with working in the notebook experience within AzureML Studio, something that was covered in Chapter 8, Experimenting with Python Code.

This chapter assumes you have registered the scikit-learn diabetes dataset in your AzureML workspace and that you have created a compute cluster named cpu-sm-cluster, as described in the sections Defining datastores, Working with datasets...

Hyperparameter tuning using HyperDrive

In Chapter 8, Experimenting with Python Code, you trained a LassoLars model that was accepting the alpha parameter. In order to avoid overfitting to the training dataset, the LassoLars model uses a technique called regularization, which basically introduces a penalty term within the optimization formula of the model. You can think of this technique as if the linear regression that we are trying to fit consists of a normal linear function that is being fitted with the least-squares function plus this penalty term. The alpha parameter specifies how important this penalty term is, something that directly impacts the training outcome. Parameters that affect the training process are referred to as being hyperparameters. To understand better what a hyperparameter is, we are going to explore the hyperparameters of a decision tree. In a decision tree classifier model, like the DecisionTreeClassifier class located in the scikit-learn library, you can define...

Running AutoML experiments with code

So far, in this chapter, you were fine-tuning a LassoLars model, performing a hyperparameter tuning process to identify the best value for the alpha parameter based on the training data. In this section, you will use AutoML in the AzureML SDK to automatically select the best combination of data preprocessing, model, and hyperparameter settings for your training dataset.

To configure an AutoML experiment through the AzureML SDK, you will need to configure an AutoMLConfig object. You will need to define the Task type, the Metric, the Training data, and the Compute budget you want to invest. The output of this process is a list of models from which you can select the best run and the best model associated with that run, as shown in Figure 9.11:

Figure 9.11 – AutoML process

Depending on the type of problem you are trying to model, you must select the task parameter, selecting either classification, regression, or...

Summary

In this chapter, you explored the most-used approaches in optimizing a specific model to perform well against a dataset and how you can even automate the process of model selection. You started by performing parallelized hyperparameter tuning using the HyperDriveConfig class to optimize the alpha parameter of the LassoLars model you have been training against the diabetes dataset. Then, you automated the model selection, using AutoML to detect the best combination of algorithms and parameters that predicts the target column of the diabetes dataset.

In the next chapter, you will build on top of this knowledge, learning how to use the AzureML SDK to interpret the model results.

Questions

You want to get the best model trained by an AutoML run. Which code is correct?
a. model = run.get_output()[0]
b. model = run.get_output()[1]
c. model = run.get_outputs()[0]
d. model = run.get_outputs()[1]
You want to run a forecasting AutoML experiment on top of data you receive from a sensor. You receive one record every day from the sensor. You want to be able to predict the values for 5 days. Which of the following parameters should you pass to the ForecastingParameters class?
a. forecast_horizon = 5 * 1
b. forecast_horizon = 5 * 24
c. forecast_horizon = 5 * 12

The HyperDriveConfig class: https://docs.microsoft.com/en-us/python/api/azureml-train-core/azureml.train.hyperdrive.hyperdriveconfig?view=azure-ml-py
The AutoMLConfig class: https://docs.microsoft.com/en-us/Python/api/azureml-train-automl-client/azureml.train.automl.automlconfig.automlconfig
Data featurization in automated machine learning: https://docs.microsoft.com/en-us/azure/machine-learning/how-to-configure-auto-features
Auto-train a forecast model: https://docs.microsoft.com/en-us/azure/machine-learning/how-to-auto-train-forecast
Reference to the diabetes dataset that was loaded from the scikit-learn library: https://scikit-learn.org/stable/modules/generated/sklearn.datasets.load_diabetes.html

The rest of the chapter is locked

You have been reading a chapter from

Azure Data Scientist Associate Certification Guide

Published in: Dec 2021Publisher: PacktISBN-13: 9781800565005

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Andreas Botsikas

Andreas Botsikas is an experienced advisor working in the software industry. He has worked in the finance sector, leading highly efficient DevOps teams, and architecting and building high-volume transactional systems. He then traveled the world, building AI-infused solutions with a group of engineers and data scientists. Currently, he works as a trusted advisor for customers onboarding into Azure, de-risking and accelerating their cloud journey. He is a strong engineering professional with a Doctor of Philosophy (Ph.D.) in resource optimization with artificial intelligence from the National Technical University of Athens.
Read more about Andreas Botsikas

Michael Hlobil

Michael Hlobil is an experienced architect focused on quickly understanding customers' business needs, with over 25 years of experience in IT pitfalls and successful projects, and is dedicated to creating solutions based on the Microsoft Platform. He has an MBA in Computer Science and Economics (from the Technical University and the University of Vienna) and an MSc (from the ESBA) in Systemic Coaching. He was working on advanced analytics projects in the last decade, including massive parallel systems and Machine Learning systems. He enjoys working with customers and supporting the journey to the cloud.
Read more about Michael Hlobil

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Azure Data Scientist Associate Certification Guide

Chapter 9: Optimizing the ML Model

Technical requirements

Hyperparameter tuning using HyperDrive

Running AutoML experiments with code

Summary

Questions

Further reading

Unlock this book and the full library FREE for 7 days

Authors (2)

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook