You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Product typeBook

Published inFeb 2024

Reading LevelN/a

PublisherPackt

ISBN-139781805128724

Edition3rd Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1)

Denis Rothman

Hugging Face AutoTrain: Training Vision Models without Coding

Training a machine learning model doesn’t require a college degree anymore. Somebody with no AI or programming knowledge can train a transformer model.

Hugging Face’s AutoTrain requires no coding. You just need to upload your data. AutoTrain will then automatically process the data and choose and train one or several models for your project. You can then deploy the trained model in a few clicks.

You can find similar services on Google AI Platform AutoML, Amazon SageMaker Autopilot, Microsoft Azure Machine Learning, IBM Watson Studio, and a growing number of other platforms.

In less than a few years, anybody can compete with somebody with a Ph.D. in AI who spent years in college. With automated machine learning platforms, a powerful copilot such as ChatGPT, and a credit card, anybody can compete with an AI engineer.

This chapter will show how anybody can train a vision transformer with absolutely...

Goal and scope of this chapter

AI systems have become more automated but also more complex. This chapter aims to point out the expertise a non-AI user may require in some cases from an AI expert to get the job done.

This chapter describes the main steps of how to implement Hugging Fae AutoTrain for an image classification task on CIFAR-10 transportation images.

The scope remains to implement an automated training process on a cloud platform.

While going through the process, we will point out some, not all, of the issues that may come up to show the role of an AI professional in the ever-changing workplace.

Before we begin, we will go through some key things to note:

Through hard work and creativity, Hugging Face managed to make transformers accessible. They also deserve credit for anticipating transformer technology’s huge success years before it reached mainstream users with ChatGPT.
The goal of this chapter is not to oppose college graduate...

Getting started

The Hugging Face platform is continuously upgraded and modified to adapt to the AI market. The interfaces are constantly evolving through competition and user feedback. As such, this section describes the process and provides the links to get the job done. Focus on the processes. The interface will continuously evolve. Modern AI requires adaptability and continuous learning.

Make sure to read Hugging Face’s auto-training conditions and the cost of this service before proceeding to activate anything on the platform.

To get started, first, go to Hugging Face’s AutoTrain platform: https://huggingface.co/autotrain.

You will see that Hugging Face insists on creating “powerful models without code.”

Make sure to follow the instructions that will also evolve but are worth the time it takes. Make sure to obtain your Hugging Face token.

The AI professional sees the opportunity to save time and energy. The non-AI professional...

Uploading the dataset

First, click on Upload Training File(s) as shown in Figure 18.2:

A close-up of a cloud Description automatically generated

Figure 18.2: Uploading the training dataset

The interface may evolve, but you will need to upload data. You will need to read the documentation on the Hugging Face platform carefully to prepare your data. The interface evolves constantly to follow the cutting-edge AI market. Choose your method but remain focused on the task: loading data.

You will need to follow Hugging Face’s procedures for data formatting: https://huggingface.co/docs/autotrain/image_classification.

Make sure to read the upgrades regularly. Again, it is worthwhile!

In this case, we are loading CIFAR-10 images as shown in the Figure 18.3 excerpt:

A collage of different images of airplanes and cars Description automatically generated

Figure 18.3: Excerpt of CIFAR-10 transportation images

The CIFAR-10 images in this chapter are from Learning Multiple Layers of Features from Tiny Images, Alex Krizhevsky, 2009:https://www.cs.toronto.edu/~kriz/learning-features-2009-TR...

Training models with AutoTrain

When we activate the training process, Hugging Face first downloads our data from the Hugging Face Hub, where we uploaded our dataset.

Hugging Face’s AutoTrain interface provides a rapid, seamless, and intuitive approach to training text and vision models. Just click on Start Training when you are ready, as shown in Figure 18.8:

A blue rectangle with white text Description automatically generated

Figure 18.8: AutoTrain: an intuitive approach

Hugging Face AutoTrain’s module acts as an overall controller for your training process. For more, read Hugging Face’s documentation, which evolves constantly as the platform makes progress: https://huggingface.co/docs/autotrain/main/en/index#who-should-use-autotrain.

Once a model is trained, we can train other ones to create a trained model set of potential candidates for our project.

Reminder: Evaluate the cost of training before running a training process. Budget management remains a critical factor in training AI models.

...

Deploying a model

Make sure to read the latest continuously evolving documentation and explore the latest Hugging Face interfaces. The interface may change, but you will have to create a model card for your model.

We can go to our profile and choose the model card of one of our trained models to access the settings of the model and decide how to deploy it, as shown in Figure 18.11:

A screenshot of a computer Description automatically generated

Figure 18.11: Model card of a trained model

We can choose a deployment method: deploy on a platform such as AWS, for example, to create an endpoint or run the model directly in transformers. In this chapter, we will run the models by using transformers directly.

We first need to make each model public by going to the model card, then Settings, making the model public, as shown in Figure 18.12:

Figure 18.12: Making a Hugging Face model public

Clicking on Make public will make it public (Figure 18.13):

A close up of a text Description automatically generated

Figure 18.13: Model visibility status

We will now use the...

Running our models for inference

The trained models can now perform image classification with validation images. In this section, we will run several trained models.

Open the Hugging_Face_AutoTrain.ipynb that we will use in this section to:

Retrieve a relatively easy image and a challenging one.
Classify the validation images.
Analyze the difficulty of image classification.
Investigate the configuration of the trained models.

We will begin by retrieving the validation images.

Retrieving validation images

The notebook first imports IPython for media rendering:

from IPython.display import Image     #This is used for rendering images in the notebook

The first image is relatively easy to classify: generate_an_image_of_a_car_in_space.jpg. This image, which we will now download, was classified in Chapter 16, Beyond Text: Vision Transformers in the Dawn of Revolutionary AI, by a vision transformer:

#Development access to delete...

Summary

This chapter brought us to the frontier of AI, where automation rages. Competitors around the world are struggling to make AI accessible to mainstream users. OpenAI’s ChatGPT opened the door to a flood of automated tasks.

Hugging Face has successfully deployed numerous automated functions on their platform, made transformers easy to run, and provided many other productive functions. Hugging Face AutoTrain provides a no-coding service to train and deploy AI models.

We began by creating an image classification project with a Hugging Face Space. We uploaded a CIFAR-10 dataset with transportation images.

We then continued by running the models with a difficult and an easy image of cars. We first built an API function to query the models and an output processing function to display the scores produced.

The models explored were ViT, Swin, BEiT, ConvNext, and ResNet. The chapter notebook displayed each model’s configuration, accompanied by a brief model...

Questions

Hugging Face AutoTrain can train every vision transformer on the market. (True/False)
Datasets are always easy to create. (True/False)
A no-coding AI system requires no machine learning knowledge. (True/False)
Hugging Face AutoTrain can classify any image submitted. (True/False)
Even a well-prepared dataset can be insufficient. (True/False)
Creating a validation set of images to test a vision transformer is useful. (True/False)
Even a well-trained model can lead to overfitting. (True/False)
It may take months to optimize a vision transformer. (True/False)
An automated service can sometimes work well with no coding. (True/False)
An AI professional will always be necessary for complex AI issues. (True/False)

References

Hugging Face AutoTrain: https://huggingface.co/autotrain
ViT on Hugging Face: https://huggingface.co/docs/transformers/v4.27.1/model_doc/vit
Swin on Hugging Face: https://huggingface.co/docs/transformers/main/model_doc/swin
BEiT on Hugging Face: https://huggingface.co/docs/transformers/main/model_doc/beit
ConvNext: https://huggingface.co/docs/transformers/main/model_doc/convnext
Hugging Face ResNet: https://huggingface.co/docs/transformers/model_doc/resnet

Denis Rothman graduated from Sorbonne University and Paris-Diderot University, designing one of the very first word2matrix patented embedding and patented AI conversational agents. He began his career authoring one of the first AI cognitive Natural Language Processing (NLP) chatbots applied as an automated language teacher for Moet et Chandon and other companies. He authored an AI resource optimizer for IBM and apparel producers. He then authored an Advanced Planning and Scheduling (APS) solution used worldwide.
Read more about Denis Rothman

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Hugging Face AutoTrain: Training Vision Models without Coding

Goal and scope of this chapter

Getting started

Uploading the dataset

Training models with AutoTrain

Deploying a model

Running our models for inference

Retrieving validation images

Summary

Questions

References

Further reading

Join our community on Discord

Author (1)