You're reading from The Deep Learning Architect's Handbook

Product type Book

Published in Dec 2023

Publisher Packt

ISBN-13 9781803243795

Pages 516 pages

Edition 1st Edition

Languages

Concepts

Deep Learning

Author (1):

Ee Kin Chin

Table of Contents (25) Chapters

Preface

Part 1 – Foundational Methods

Chapter 1: Deep Learning Life Cycle

Chapter 2: Designing Deep Learning Architectures

Chapter 3: Understanding Convolutional Neural Networks

Chapter 4: Understanding Recurrent Neural Networks

Chapter 5: Understanding Autoencoders

Chapter 6: Understanding Neural Network Transformers

Chapter 7: Deep Neural Architecture Search

Chapter 8: Exploring Supervised Deep Learning

Chapter 9: Exploring Unsupervised Deep Learning

Part 2 – Multimodal Model Insights

Chapter 10: Exploring Model Evaluation Methods

Chapter 11: Explaining Neural Network Predictions

Chapter 12: Interpreting Neural Networks

Chapter 13: Exploring Bias and Fairness

Chapter 14: Analyzing Adversarial Performance

Part 3 – DLOps

Chapter 15: Deploying Deep Learning Models to Production

Chapter 16: Governing Deep Learning Models

Chapter 17: Managing Drift Effectively in a Dynamic Environment

Chapter 18: Exploring the DataRobot AI Platform

Chapter 19: Architecting LLM Solutions

Index

Why subscribe?

Other Books You May Enjoy

Exploring neural network transformers

Figure 6.1 provides an overview of the impact transformers have had, thanks to the plethora of transformer model variants.

Figure 6.1 – Transformers’ different modality and model branches

The transformer does not have inherent inductive bias structurally designed into its architecture. Inductive bias refers to the pre-assumptions made by a learning algorithm on the data. This bias can be built into the model architecture or the learning process, and it helps to guide the model toward learning specific patterns or structures in the data. Traditional models, such as RNNs, incorporate inductive bias through their design, for instance, by assuming that data has a sequential structure and that the order of elements is important. Another example is CNN models, which are specifically designed for processing grid-like data, such as images, by incorporating inductive bias in the form of local connectivity and...