You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Product type Book

Published in Feb 2024

Publisher Packt

ISBN-13 9781805128724

Pages 728 pages

Edition 3rd Edition

Languages

Python

Concepts

Deep Learning

Author (1):

Denis Rothman

Table of Contents (24) Chapters

Preface

What Are Transformers?

Getting Started with the Architecture of the Transformer Model

Emergent vs Downstream Tasks: The Unseen Depths of Transformers

Advancements in Translations with Google Trax, Google Translate, and Gemini

Diving into Fine-Tuning through BERT

Pretraining a Transformer from Scratch through RoBERTa

The Generative AI Revolution with ChatGPT

Fine-Tuning OpenAI GPT Models

Shattering the Black Box with Interpretable Tools

Investigating the Role of Tokenizers in Shaping Transformer Models

Leveraging LLM Embeddings as an Alternative to Fine-Tuning

Toward Syntax-Free Semantic Role Labeling with ChatGPT and GPT-4

Summarization with T5 and ChatGPT

Exploring Cutting-Edge LLMs with Vertex AI and PaLM 2

Guarding the Giants: Mitigating Risks in Large Language Models

Beyond Text: Vision Transformers in the Dawn of Revolutionary AI

Transcending the Image-Text Boundary with Stable Diffusion

Hugging Face AutoTrain: Training Vision Models without Coding

On the Road to Functional AGI with HuggingGPT and its Peers

Beyond Human-Designed Prompts with Generative Ideation

Other Books You May Enjoy

Index

Appendix: Answers to the Questions

Fine-tuning

Fine-tuning becomes an option when the responses produce an acceptable result or when the prompt design does not meet expectations. Or not! In Chapter 11, Leveraging LLM Embeddings as an Alternative to Fine-Tuning, we saw that advanced prompt engineering leveraging OpenAI LLM Ada’s embeddings could produce good results.

So, what should we do? Prompt design by crafting good prompts with a ready-to-use model? Prompt engineering with an embedding model? Fine-tune a model to fit our needs?

Each of these choices comes with a cost. The best empirical method in computer science remains to:

Rely on a reliable and optimized (volume, quality) evaluation dataset.
Test different models and approaches. In this case, evaluate the outputs obtained through prompt design, engineering, and fine-tuning.
Evaluate the risks and costs.

Like Amazon Web Services (AWS), Microsoft Azure, IBM Cloud, and others, Google Cloud provides a solid, simplified...