You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Product typeBook

Published inFeb 2024

Reading LevelN/a

PublisherPackt

ISBN-139781805128724

Edition3rd Edition

Languages

Python

Tools

PyTorch

Concepts

Deep Learning

Author (1)

Denis Rothman

Exploring Cutting-Edge LLMs with Vertex AI and PaLM 2

Maybe Chapter 7, The Generative AI Revolution with ChatGPT, made us think that OpenAI ChatGPT ended the tremendous progression of Large Language Models (LLMs) and Generative AI. Perhaps the incredible tools, such as GitHub Copilot, GPT-4, and OpenAI’s embedding models, stunned us into believing we had reached a limit.

Then, on May 23, 2023, Google released the PaLM 2 Technical Report, built on the earlier PaLM and Pathways papers. The content of these papers is mind-blowing! The architecture of PaLM improved both software and hardware performances with ingenious innovations.

We will begin the chapter by examining Pathways to understand PaLM. We will continue and look at the main features of PaLM (which stands for Pathways Language Model), a decoder-only, densely activated, and autoregressive model transformer model with 540 billion parameters trained on Google’s Pathways system. PaLM was trained on 780 billion...

Architecture

PaLM and PaLM2 were built on top of Pathways. Pathways is a Google technology that improves the efficiency of training LLMs through data parallelism, model parallelism, and execution-level parallelism.

We will begin with Pathways, the cornerstone of Google AI’s impressive achievements.

Pathways

The title of the Pathways paper may seem esoteric. Pathways: Asynchronous Distributed Dataflow by Barham et al. (2022) indeed appears like something you might want to avoid looking into. However, once you start reading the paper, you will be hooked!

If we look at some of the key features, we are somewhat stunned:

Heterogeneous execution: Pathways can run programs on many devices, including TPUs, CPUs, and GPUs. This is a significant advance when assembling all the computing power we can get.
Asynchronous execution: Pathways allows programs to be executed asynchronously. This might seem uninteresting, but PaLM will build on this technology...

Assistants

The structure of this section is similar to that of Chapter 7, The Generative AI Revolution with ChatGPT, and Chapter 8, Fine-Tuning OpenAI GPT Models.

We can view the similarity between Microsoft and Google assistants from two perspectives:

The billion-dollar competition between these two giants is fierce, so they must produce products and services that match their technology rivalry.
Standardization has sprung from necessity: they offer transformer Generative AI services in every application they sell, leading to an intuitive way to use them (prompts and parameters).

As we saw in Chapter 1, What Are Transformers?, transformers are General Purpose Technologies (GPTs) like electricity, computers, smartphones, and the internet. They have pervaded every domain and will increasingly progress.

However, each competing platform produces different results and performances that we must evaluate before building an application.

Transformer...

Vertex AI PaLM 2 API

The Vertex AI platform evolves continually. You are at the cutting edge!

The models that we use get updated from time to time. These updates can cause small differences in the results or how it works. The latest version might not be the most stable version. You will need to be on the watch and consult Google’s versioning documentation regularly: https://cloud.google.com/vertex-ai/docs/generative-ai/learn/model-versioning.

That is the price to pay to be on top of the market! However, the power of the models makes it worthwhile to explore PaLM 2. When the models stabilize, you will be far ahead of your competition. So, please fasten your seat belts, and let’s get the API rolling!

There are hundreds of known NLP tasks, and hundreds more mainstream users invent some every day. You cannot implement them all at once. Focus on understanding the tasks you explore in depth. You will then be able to adapt to new ones.

Open Google_Vertex_AI...

Fine-tuning

Fine-tuning becomes an option when the responses produce an acceptable result or when the prompt design does not meet expectations. Or not! In Chapter 11, Leveraging LLM Embeddings as an Alternative to Fine-Tuning, we saw that advanced prompt engineering leveraging OpenAI LLM Ada’s embeddings could produce good results.

So, what should we do? Prompt design by crafting good prompts with a ready-to-use model? Prompt engineering with an embedding model? Fine-tune a model to fit our needs?

Each of these choices comes with a cost. The best empirical method in computer science remains to:

Rely on a reliable and optimized (volume, quality) evaluation dataset.
Test different models and approaches. In this case, evaluate the outputs obtained through prompt design, engineering, and fine-tuning.
Evaluate the risks and costs.

Like Amazon Web Services (AWS), Microsoft Azure, IBM Cloud, and others, Google Cloud provides a solid, simplified...

Summary

We saw that PaLM 2 could match and perhaps surpass other models in some fields. Google AI assistants, cloud tools, and APIs provide general-purpose functionalities. They resemble Microsoft Azure and OpenAI offers. However, the technology is quite different, as we saw with the architecture that led to PaLM 2. Also, many factors will influence the decision, and the outputs might vary from one task to another.

Before making a choice, extensive evaluations must be made with the right questions before working with Microsoft Azure, Google Cloud, and IBM Cloud, among others.

The chapter was divided into the four parts we will probably encounter when we explore new transformer models: architecture improvements, large language generative mainstream assistants, API implementation, and customization (fine-tuning or other methods).

We began the chapter by reviewing the many improvements in the architecture of LLM Generative AI models. Pathways opens the door to increased optimization...

Questions

Pathways is not a significant game-changer. (True/False)
PaLM 2 uses the same embedding matrices for input and output operations. (True/False)
PALM models have an encoder and a decoder stack. (True/False)
Google Workspace assistants contain Generative AI. (True/False)
All transformers are implemented for Generative AI. (True/False)
A generative LLM cannot perform discriminative tasks. (True/False)
Transformer models have reached their limits. (True/False)
PaLM 2 is the last Generative AI model Google will produce. (True/False)
Prompt engineering is an alternative to fine-tuning. (True/False)
Fine-tuning a transformer model does not require much dataset preparation. (True/False)

References

Barham et al., 2022, Pathways: Asynchronous Distributed Dataflow for ML: https://arxiv.org/abs/2203.12533
Chowdhery et al., 2022, PaLM: Scaling Language Modeling with Pathways: https://arxiv.org/pdf/2204.02311.pdf
Anil et al., 2023, PaLM 2 Technical Report: https://arxiv.org/abs/2305.10403
Google Cloud Vertex AI documentation: https://cloud.google.com/vertex-ai/docs/generative-ai/learn/overview

Denis Rothman graduated from Sorbonne University and Paris-Diderot University, designing one of the very first word2matrix patented embedding and patented AI conversational agents. He began his career authoring one of the first AI cognitive Natural Language Processing (NLP) chatbots applied as an automated language teacher for Moet et Chandon and other companies. He authored an AI resource optimizer for IBM and apparel producers. He then authored an Advanced Planning and Scheduling (APS) solution used worldwide.
Read more about Denis Rothman

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Transformers for Natural Language Processing and Computer Vision - Third Edition

Exploring Cutting-Edge LLMs with Vertex AI and PaLM 2

Architecture

Pathways

Assistants

Vertex AI PaLM 2 API

Fine-tuning

Summary

Questions

References

Further reading

Join our community on Discord

Author (1)