Reader small image

You're reading from  Essential PySpark for Scalable Data Analytics

Product typeBook
Published inOct 2021
Reading LevelBeginner
PublisherPackt
ISBN-139781800568877
Edition1st Edition
Languages
Tools
Concepts
Right arrow
Author (1)
Sreeram Nudurupati
Sreeram Nudurupati
author image
Sreeram Nudurupati

Sreeram Nudurupati is a data analytics professional with years of experience in designing and optimizing data analytics pipelines at scale. He has a history of helping enterprises, as well as digital natives, build optimized analytics pipelines by using the knowledge of the organization, infrastructure environment, and current technologies.
Read more about Sreeram Nudurupati

Right arrow

Summary

In this chapter, you were introduced to the end-to-end ML life cycle and the various steps involved in it. MLflow is a complete, end-to-end ML life cycle management tool. The MLflow Tracking component was presented, which is useful for streaming the ML experimentation process and helps you track all its attributes, including the data version, ML code, model parameters and metrics, and any other arbitrary artifacts. MLflow Model was introduced as a standards-based model format that provides model portability and reproducibility. MLflow Model Registry was also explored, which is a central model repository that supports the entire life cycle of a newly created model, from staging to production to archival. Model serving mechanisms, such as using batch and online processes, were also introduced. Finally, continuous delivery for ML was introduced. It is used to streamline the entire ML life cycle and automate the model life cycle using Model Registry features, such as the ability...

lock icon
The rest of the page is locked
Previous PageNext Chapter
You have been reading a chapter from
Essential PySpark for Scalable Data Analytics
Published in: Oct 2021Publisher: PacktISBN-13: 9781800568877

Author (1)

author image
Sreeram Nudurupati

Sreeram Nudurupati is a data analytics professional with years of experience in designing and optimizing data analytics pipelines at scale. He has a history of helping enterprises, as well as digital natives, build optimized analytics pipelines by using the knowledge of the organization, infrastructure environment, and current technologies.
Read more about Sreeram Nudurupati