You're reading from The Definitive Guide to Google Vertex AI

Product typeBook

Published inDec 2023

PublisherPackt

ISBN-139781801815260

Edition1st Edition

Concepts

Data Science

Authors (2):

Jasmeet Bhatia

Kartik Chaudhary

View More author details

Model Optimizations – Hyperparameter Tuning and NAS

We have now become quite familiar with some of the Vertex AI offerings related to managing data, training no-code and low-code models, and launching large-scale custom model training jobs (with metadata tracking and monitoring capabilities). As ML practitioners, we know that it is highly unlikely that the first model we train would be the best model for a given use case and dataset. Thus, in order to find the best model (which is the most accurate and least biased), we often use different model optimization techniques. Hyperparameter Tuning (HPT) and Neural Architecture Search (NAS) are two such model optimization techniques. In this chapter, we will learn how to configure and launch model optimization experiments using Vertex AI on Google Cloud.

In this chapter, we will first learn about the importance of model optimization techniques such as HPT and then learn how to quickly set up and launch HPT jobs within Google Vertex...

Technical requirements

The code examples shown in this chapter can be found in the following GitHub repo: https://github.com/PacktPublishing/The-Definitive-Guide-to-Google-Vertex-AI/tree/main/Chapter09

What is HPT and why is it important?

Hyperparameter tuning, or HPT for short, is a popular model optimization technique that is very commonly used across ML projects. In this section, we will learn about hyperparameters, the importance of tuning them, and different methods of finding the best hyperparameters for a machine learning algorithm.

What are hyperparameters?

When we train an ML system, we basically have three kinds of data – input data, model parameters, and model hyperparameters. Input data refers to our training or test data that is associated with the problem we are solving. Model parameters are the variables that we modify during the model training process and we try to adjust them to fit the training data. Model hyperparameters, on the other hand, are variables that govern the training process itself. These hyperparameters are fixed before we start to train our model. For example, learning rate, optimizer, batch size, number of hidden layers in a neural network...

Setting up HPT jobs on Vertex AI

In this section, we will learn how to set up HPT jobs with Vertex AI. We will use the same neural network model experiment from Chapter 7, Training Fully Custom ML Models with Vertex AI, and optimize its hyperparameters to get the best model settings.

The first step is to create a new Jupyter Notebook in Vertex AI Workbench and import useful libraries:

import numpy as np
import glob
import matplotlib.pyplot as plt
import os
import google.cloud.aiplatform as aiplatform
from google.cloud.aiplatform import hyperparameter_tuning as hpt
from datetime import datetime
TIMESTAMP = datetime.now().strftime("%Y%m%d%H%M%S")
%matplotlib inline

Next, we set up project configurations:

PROJECT_ID='************'
REGION='us-west2'
SERVICE_ACCOUNT='417xxxxxxxxx7-compute@developer.gserviceaccount.com'
BUCKET_URI='gs://my-training-artifacts'

Then, we initialize the Vertex AI SDK:

aiplatform.init(project=PROJECT_ID...

What is NAS and how is it different from HPT?

Artificial Neural Networks or ANNs are widely used today for solving complex ML problems. Most of the time, these network architectures are hand-designed by ML experts, which may not be optimal every time. Neural Architecture Search or NAS is a technique that automates the process of designing neural network architectures that usually outperform hand-designed networks.

Although both HPT and NAS are used as model optimization techniques, there are certain differences in how they both work. HPT assumes a given architecture and focuses on optimizing the hyperparameters that lead to the best model. HPT optimizes hyperparameters such as learning rate, optimizer, batch size, activation function, and so on. NAS, on the other hand, focuses on optimizing architecture-specific parameters (in a way, it automates the process of designing a neural network architecture). NAS optimizes parameters such as the number of layers, number of units, types...

NAS on Vertex AI overview

Vertex AI NAS is an optimization technique that can be leveraged to find the best neural network architecture for a given ML use case. NAS-based optimization searches for the best network in terms of accuracy but can also be augmented with other constraints such as latency, memory, or a custom metric as per the requirements. In general, the search space of possible neural networks can be quite large and NAS may support a search space as large as 10^20. In the past few years, NAS has been able to successfully generate some state-of-the-art computer vision network architectures, including NASNet, MNasNet, EfficientNet, SpineNet, NAS-FPN, and so on.

It may seem complex, but NAS features are quite flexible and easy to use. A beginner can leverage prebuilt modules for search spaces, trainer scripts, and Jupyter notebooks to start exploring Vertex AI NAS on a custom dataset. If you are an expert, you could potentially develop custom trainer scripts, custom search...

Summary

In this chapter, we discussed the importance of applying model optimization techniques to get the best performance for our application. We learned about two model optimization methods – HPT and NAS, with their similarities and differences. We also learned how to set up and launch large-scale HPT jobs on Vertex AI with code examples. Additionally, we discussed some best practices to get the best out of both HPT and NAS.

After reading this chapter, you should have a fair understanding of the term “model optimization” and its importance while developing ML applications. Additionally, you should now be confident about quickly setting up small to large-scale hyperparameter tuning experiments with the help of Vertex AI tooling on Google Cloud. You should also have a fair understanding of NAS, its differences from HPT, and the best practices for setting up a NAS job.

Now that we understand the importance and common methods of model optimization techniques...

The rest of the chapter is locked

You have been reading a chapter from

The Definitive Guide to Google Vertex AI

Published in: Dec 2023Publisher: PacktISBN-13: 9781801815260

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (2)

Jasmeet Bhatia

Jasmeet is a Machine Learning Architect with over 8 years of experience in Data Science and Machine Learning Engineering at Google and Microsoft, and overall has 17 years of experience in Product Engineering and Technology consulting at Deloitte, Disney, and Motorola. He has been involved in building technology solutions that focus on solving complex business problems by utilizing information and data assets. He has built high performing engineering teams, designed and built global scale AI/Machine Learning, Data Science, and Advanced analytics solutions for image recognition, natural language processing, sentiment analysis, and personalization.
Read more about Jasmeet Bhatia

Kartik Chaudhary

Kartik is an Artificial Intelligence and Machine Learning professional with 6+ years of industry experience in developing and architecting large scale AI/ML solutions using the technological advancements in the field of Machine Learning, Deep Learning, Computer Vision and Natural Language Processing. Kartik has filed 9 patents at the intersection of Machine Learning, Healthcare, and Operations. Kartik loves sharing knowledge, blogging, travel, and photography.
Read more about Kartik Chaudhary

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages