Reader small image

You're reading from  Machine Learning Engineering with MLflow

Product typeBook
Published inAug 2021
PublisherPackt
ISBN-139781800560796
Edition1st Edition
Tools
Right arrow
Author (1)
Natu Lauchande
Natu Lauchande
author image
Natu Lauchande

Natu Lauchande is a principal data engineer in the fintech space currently tackling problems at the intersection of machine learning, data engineering, and distributed systems. He has worked in diverse industries, including biomedical/pharma research, cloud, fintech, and e-commerce/mobile. Along the way, he had the opportunity to be granted a patent (as co-inventor) in distributed systems, publish in a top academic journal, and contribute to open source software. He has also been very active as a speaker at machine learning/tech conferences and meetups.
Read more about Natu Lauchande

Right arrow

Setting up a batch inference job

The code required for this section is in the pystock-inference-api folder. The MLflow infrastructure is provided in the Docker image accompanying the code as shown in the following figure:

Figure 9.1 – Layout of a batch scoring deployment

If you have direct access to the artifacts, you can do the following. The code is available under the pystock-inference-batch directory. In order to set up a batch inference job, we will follow these steps:

  1. Import the dependencies of your batch job; among the relevant dependencies we include pandas, mlflow, and xgboost:
    import pandas as pd
    import mlflow
    import xgboost as xgb
    import mlflow.xgboost
    import mlflow.pyfunc
  2. We will next load start_run by calling mlflow.start_run and load the data from the input.csv scoring input file:
    if __name__ == "__main__":
        with mlflow.start_run(run_name="batch_scoring") as run:
        ...
lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Machine Learning Engineering with MLflow
Published in: Aug 2021Publisher: PacktISBN-13: 9781800560796

Author (1)

author image
Natu Lauchande

Natu Lauchande is a principal data engineer in the fintech space currently tackling problems at the intersection of machine learning, data engineering, and distributed systems. He has worked in diverse industries, including biomedical/pharma research, cloud, fintech, and e-commerce/mobile. Along the way, he had the opportunity to be granted a patent (as co-inventor) in distributed systems, publish in a top academic journal, and contribute to open source software. He has also been very active as a speaker at machine learning/tech conferences and meetups.
Read more about Natu Lauchande