Reader small image

You're reading from  Practical Deep Learning at Scale with MLflow

Product typeBook
Published inJul 2022
PublisherPackt
ISBN-139781803241333
Edition1st Edition
Right arrow
Author (1)
Yong Liu
Yong Liu
author image
Yong Liu

Yong Liu has been working in big data science, machine learning, and optimization since his doctoral student years at the University of Illinois at Urbana-Champaign (UIUC) and later as a senior research scientist and principal investigator at the National Center for Supercomputing Applications (NCSA), where he led data science R&D projects funded by the National Science Foundation and Microsoft Research. He then joined Microsoft and AI/ML start-ups in the industry. He has shipped ML and DL models to production and has been a speaker at the Spark/Data+AI summit and NLP summit. He has recently published peer-reviewed papers on deep learning, linked data, and knowledge-infused learning at various ACM/IEEE conferences and journals.
Read more about Yong Liu

Right arrow

Summary

In this chapter, we took a deep dive into how we can track code and data versions in an MLflow experiment run. We started by reviewing the different types of notebooks: Jupyter notebooks, Databricks notebooks, and VS Code notebooks. We compared them and recommended that VS Code should be used to author a notebook due to its IDE support, as well as its Python styling, autocompletion, and many more rich features.

Then, after reviewing the limitations of existing ML pipeline API frameworks, we discussed how to create a multi-step DL pipeline using MLflow's MLproject framework. We showed a step-by-step approach to creating a three-step DL pipeline using MLproject and how to implement a pipeline function to orchestrate the necessary tasks. We also provided a Python implementation template to help you implement each pipeline task. When running a pipeline with MLflow, we can track the entire pipeline's progress with a parent run_id, and then use a child run_id for each...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Practical Deep Learning at Scale with MLflow
Published in: Jul 2022Publisher: PacktISBN-13: 9781803241333

Author (1)

author image
Yong Liu

Yong Liu has been working in big data science, machine learning, and optimization since his doctoral student years at the University of Illinois at Urbana-Champaign (UIUC) and later as a senior research scientist and principal investigator at the National Center for Supercomputing Applications (NCSA), where he led data science R&D projects funded by the National Science Foundation and Microsoft Research. He then joined Microsoft and AI/ML start-ups in the industry. He has shipped ML and DL models to production and has been a speaker at the Spark/Data+AI summit and NLP summit. He has recently published peer-reviewed papers on deep learning, linked data, and knowledge-infused learning at various ACM/IEEE conferences and journals.
Read more about Yong Liu