You're reading from Practical Deep Learning at Scale with MLflow

Product typeBook

Published inJul 2022

PublisherPackt

ISBN-139781803241333

Edition1st Edition

Concepts

Deep Learning

Author (1)

Yong Liu

Chapter 2: Getting Started with MLflow for Deep Learning

One of the key capabilities of MLflow is to enable Machine Learning (ML) experiment management. This is critical because data science requires reproducibility and traceability so that a Deep Learning (DL) model can be easily reproduced with the same data, code, and execution environment. This chapter will help us get started with how to implement DL experiment management quickly. We will learn about MLflow experiment management concepts and capabilities, set up an MLflow development environment, and complete our first DL experiment using MLflow. By the end of this chapter, we will have a working MLflow tracking server showing our first DL experiment results.

In this chapter, we're going to cover the following main topics:

Setting up MLflow
Implementing our first MLflow logging-enabled DL experiment
Exploring MLflow's components and usage patterns

Technical requirements

To complete the experiment in this chapter, we will need the following tools, libraries, and GitHub repositories installed or checked out on our computer:

VS Code: The version we use in this book is August 2021 (that is, version 1.60). We use VS Code for our local code development environment. This is the recommended way for local developments. Please refer to https://code.visualstudio.com/updates/v1_60.
MLflow: Version 1.20.2. In this chapter, in the Setting up MLflow section, we will walk through how to set up MLflow locally or remotely. Please refer to https://github.com/mlflow/mlflow/releases/tag/v1.20.2.
Miniconda: Version 4.10.3. Please refer to https://docs.conda.io/en/latest/miniconda.html.
PyTorch lightning-flash: 0.5.0. Please refer to https://github.com/PyTorchLightning/lightning-flash/releases/tag/0.5.0.
The GitHub URL for the code in this chapter: You can find this at https://github.com/PacktPublishing/Practical-Deep-Learning...

Setting up MLflow

MLflow is an open source tool that is primarily written in Python. It has over 10,000 stars tagged in its GitHub source repository (https://github.com/mlflow/mlflow). The benefits of using MLflow are numerous, but we can illustrate one benefit with the following scenario: Let's say you are starting a new ML project, trying to evaluate different algorithms and model parameters. Within a few days, you run hundreds of experiments with lots of code changes using different ML/DL libraries and get different models with different parameters and accuracies. You need to compare which model works better and also allow your team members to reproduce the results for model review purposes. Do you prepare a spreadsheet and write down the model name, parameters, accuracies, and location of the models? How can someone else rerun your code or use your trained model with a different set of evaluation datasets? This can quickly become unmanageable when you have lots of iterations...

Implementing our first DL experiment with MLflow autologging

Let's use the DL sentiment classifier we built in Chapter 1, Deep Learning Life Cycle and MLOps Challenges, and add MLflow autologging to it to explore MLflow's tracking capabilities:

First, we need to import the MLflow module:
```
import mlflow
```

This will provide MLflow Application Programming Interfaces (APIs) for logging and loading models.

Just before we run the training code, we need to set up an active experiment using mlflow.set_experiment for the current running code:

EXPERIMENT_NAME = "dl_model_chapter02"
mlflow.set_experiment(EXPERIMENT_NAME)
experiment = mlflow.get_experiment_by_name(EXPERIMENT_NAME)
print("experiment_id:", experiment.experiment_id)

This sets an experiment named dl_model_chapter02 to be the current active experiment. If this experiment does not exist in your current tracking server, it will be created automatically.

Environment Variable

...

Exploring MLflow's components and usage patterns

Let's use the working example implemented in the previous section to explore the following central concepts, components, and usages in MLflow. These include experiments, runs, metadata about experiments, artifacts for experiments, models, and code.

Exploring experiments and runs in MLflow

Experiment is a first-class entity in the MLflow APIs. This makes sense as data scientists and ML engineers need to run lots of experiments in order to build a working model that meets the requirements. However, the idea of an experiment goes beyond just the model development stage and extends to the entire life cycle of the ML/DL development and deployment. So, this means that when we do retraining or training for a production version of the model, we need to treat them as production-quality experiments. This unified view of experiments builds a bridge between the offline and online production environments. Each experiment consists...

Summary

In this chapter, we learned how to set up MLflow to work with either a local MLflow tracking server or a remote MLflow tracking server. Then, we implemented our first DL model with MLflow autologging enabled. This allowed us to explore MLflow in a hands-on way to understand a few central concepts and foundational components such as experiments, runs, metadata about experiments and runs, code tracking, model logging, and model flavor. The knowledge and first-round experiences gained in this chapter will help us to learn more in-depth MLflow tracking APIs in the next chapter.

The MLflow Command-Line Interface documentation: https://www.mlflow.org/docs/latest/cli.html
The MLflow PyTorch autologging documentation: https://www.mlflow.org/docs/latest/tracking.html#pytorch-experimental
The MLflow PyTorch model flavor documentation: https://www.mlflow.org/docs/latest/python_api/mlflow.pytorch.html#module-mlflow.pytorch
MLflow and PyTorch — Where Cutting Edge AI meets MLOps: https://medium.com/pytorch/mlflow-and-pytorch-where-cutting-edge-ai-meets-mlops-1985cf8aa789
Controlled Experiments in Machine Learning: https://machinelearningmastery.com/controlled-experiments-in-machine-learning/

The rest of the chapter is locked

You have been reading a chapter from

Practical Deep Learning at Scale with MLflow

Published in: Jul 2022Publisher: PacktISBN-13: 9781803241333

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Author (1)

Yong Liu

Yong Liu has been working in big data science, machine learning, and optimization since his doctoral student years at the University of Illinois at Urbana-Champaign (UIUC) and later as a senior research scientist and principal investigator at the National Center for Supercomputing Applications (NCSA), where he led data science R&D projects funded by the National Science Foundation and Microsoft Research. He then joined Microsoft and AI/ML start-ups in the industry. He has shipped ML and DL models to production and has been a speaker at the Spark/Data+AI summit and NLP summit. He has recently published peer-reviewed papers on deep learning, linked data, and knowledge-infused learning at various ACM/IEEE conferences and journals.
Read more about Yong Liu

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Practical Deep Learning at Scale with MLflow

Chapter 2: Getting Started with MLflow for Deep Learning

Technical requirements

Setting up MLflow

Implementing our first DL experiment with MLflow autologging

Exploring MLflow's components and usage patterns

Exploring experiments and runs in MLflow

Summary

Further reading

Unlock this book and the full library FREE for 7 days

Author (1)

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook