You're reading from The Deep Learning Architect's Handbook

Product type Book

Published in Dec 2023

Publisher Packt

ISBN-13 9781803243795

Pages 516 pages

Edition 1st Edition

Languages

Concepts

Deep Learning

Author (1):

Ee Kin Chin

Table of Contents (25) Chapters

Preface

1. Part 1 – Foundational Methods

2. Chapter 1: Deep Learning Life Cycle

3. Chapter 2: Designing Deep Learning Architectures

4. Chapter 3: Understanding Convolutional Neural Networks

5. Chapter 4: Understanding Recurrent Neural Networks

6. Chapter 5: Understanding Autoencoders

7. Chapter 6: Understanding Neural Network Transformers

8. Chapter 7: Deep Neural Architecture Search

9. Chapter 8: Exploring Supervised Deep Learning

10. Chapter 9: Exploring Unsupervised Deep Learning

11. Part 2 – Multimodal Model Insights

12. Chapter 10: Exploring Model Evaluation Methods

13. Chapter 11: Explaining Neural Network Predictions

14. Chapter 12: Interpreting Neural Networks

15. Chapter 13: Exploring Bias and Fairness

16. Chapter 14: Analyzing Adversarial Performance

17. Part 3 – DLOps

18. Chapter 15: Deploying Deep Learning Models to Production

19. Chapter 16: Governing Deep Learning Models

20. Chapter 17: Managing Drift Effectively in a Dynamic Environment

21. Chapter 18: Exploring the DataRobot AI Platform

22. Chapter 19: Architecting LLM Solutions

23. Index

Why subscribe?

24. Other Books You May Enjoy

Identifying challenges with LLM solutions

Despite their impressive capabilities, LLMs face challenges when solving complex real-world problems. In this section, we will explore some of the challenges faced by LLM solutions and discuss possible ways to tackle them. We will explore challenges by high-level groups, as follows:

Output and input limitations:
- LLMs just produce text: Text output can help provide value for a lot of businesses. However, many other use cases require predictions and recommendations in entirely different formats.
- The context size of an LLM is limited: The issue is that with a large input size, you need exponentially more compute resources to train and predict. So, context size usually stays in a token range of one to three thousand. This issue should be prevalent only for use cases that require long context, as a few thousand context sizes should be enough for most use cases.
- An LLM is a text-specific model: Other data modalities are not supported by default...