You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Product typeBook

Published inFeb 2024

Reading LevelN/a

PublisherPackt

ISBN-139781805128724

Edition3rd Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1)

Denis Rothman

Fine-tuning GPT-3 for completion (generative)

OpenAI (at the time of the writing of this book) has a service to fine-tune the following original GPT-3 models: davinci, curie, babbage, and ada. They are original models and, as such, have no suffixes. GPT-4 models are not available for fine-tuning at the time of the writing of this book. However, if GPT-4 models become available for fine-tuning, the same or similar process as for GPT-3 will apply.A fine-tuned model can perform data exploration, classification, question answering, and other NLP tasks like the original models. As such, the fined-tuned model might produce acceptable or inaccurate results. Quality control remains essential. Make sure to go through OpenAI's documentation before beginning a project: https://platform.openai.com/docs/guides/fine-tuning/This section aims to implement the fine-tuning process of a model in a notebook, cell by cell, so you can apply fine-tuning to your specific domain.Fine-tuning GPT-3 models...

Fine-tuned for classification (discriminative)

The miracle of generative models, such as GPTs, is that they can perform a classification task with the right prompts!In this section, we will fine-tune babbage-002 to classify baseball and hockey text inputs. You will see that you can fine-tune an original OpenAI model to a wide range of tasks. Your imagination will be the limit!Open Fine-tuned_classification.ipynb in the chapter directory in the GitHub repository. The structure of the notebook is the same as Fine_tuning_GPT_3.ipynb notebook we just created. The main section titles are identical.Fine_tuning_GPT_3.ipynb was created for completion tasks with text as a prompt and completion, although you can modify it for any other NLP task. Fine-tuned_classification.ipynb is designed to classify baseball and hockey texts. You can adapt this notebook to other NLP tasks once you have explored it.The dataset is designed for classification tasks, but the process is the same as the one we went...

Summary

This chapter led us to the potential of adapting an OpenAI model to our needs through fine-tuning. The process requires careful data analysis and preparation. We must also determine if fine-tuning using OpenAI's platform does not violate our privacy, confidentiality, and security requirements.We first built a fine-tuning process for a completion(generative) task by loading a pre-processed dataset of Immanuel Kant's Critique of Pure Reason. We submitted it to OpenAI's data preparation tool. The tool converted our data into JSONL. An ada model was fine-tuned and stored. We then ran the model.Then the babbage-002 model was fine-tuned for a classification (discriminative) task. This process brought us back to square one: can a standard OpenAI model achieve the same results as a fine-tuned model? If so, why bother fine-tuning a model?To satisfy our scientific curiosity, we ran davinci on the same task as the trained ada to classify a text to determine if it was about...

Questions

It is useless to fine-tune an OpenAI model. (True/False)
Any pretrained OpenAI model can do the task we need without fine-tuning. (True/False)
We don't need to prepare a dataset to fine-tune an OpenAI model. (True/False)
We don't need one if no datasets are available on the web. (follow-up question for question 3.) (True/False)
We don't need to keep track of the fine-tunes we created. (True/False)
As of July 2023, anybody can access our fine-tunes. (True/False)
Wandb is a state-of-art transformer model. (True/False)
Wandb can be synced with OpenAI models. (True/False)
Unfortunately, Wandb cannot display accuracy. (True/False)
The lineage of the fine-tunes is one of Wandb's artifacts. (True/False)

References

OpenAI fine-tuning documentation: https://platform.openai.com/docs/guides/fine-tuningOpenAI and Wandb guide: https://docs.wandb.ai/guides/integrations/openai?utm_source=wandb_docs&utm_medium=code&utm_campaign=OpenAI+APIWandb, Weights & Biases:https://wandb.ai/home

Join our book's Discord space

Join the book's Discord workspace:https://www.packt.link/Transformers

A picture containing black, darkness Description automatically generated

References

BertViz: https://github.com/jessevig/BertViz
Zeyu Yun, Yubei Chen, Bruno A. Olshausen, Yann LeCun, 2021, Transformer visualization via dictionary learning: contextualized embedding as a linear superposition of transformer factors: https://arxiv.org/abs/2103.15949
Hugging Face with Slunberg SHAP: https://github.com/slundberg/SHAPTransformer
Visualization via dictionary learning: https://transformervis.github.io/transformervis/
OpenAI, Large Language Models can explain neurons in language models: https://openai.com/research/language-models-can-explain-neurons-in-language-models
OpenAI neuro explainer paper: https://openaipublic.blob.core.windows.net/neuron-explainer/paper/index.html
LIT: https://pair-code.github.io/lit/

Denis Rothman graduated from Sorbonne University and Paris-Diderot University, designing one of the very first word2matrix patented embedding and patented AI conversational agents. He began his career authoring one of the first AI cognitive Natural Language Processing (NLP) chatbots applied as an automated language teacher for Moet et Chandon and other companies. He authored an AI resource optimizer for IBM and apparel producers. He then authored an Advanced Planning and Scheduling (APS) solution used worldwide.
Read more about Denis Rothman

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Fine-tuning GPT-3 for completion (generative)

Fine-tuned for classification (discriminative)

Summary

Questions

References

Further Reading

Join our book's Discord space

References

Further reading

Join our community on Discord

Author (1)