You're reading from The Deep Learning Architect's Handbook

Product type Book

Published in Dec 2023

Publisher Packt

ISBN-13 9781803243795

Pages 516 pages

Edition 1st Edition

Languages

Concepts

Deep Learning

Author (1):

Ee Kin Chin

Table of Contents (25) Chapters

Preface

1. Part 1 – Foundational Methods

2. Chapter 1: Deep Learning Life Cycle

3. Chapter 2: Designing Deep Learning Architectures

4. Chapter 3: Understanding Convolutional Neural Networks

5. Chapter 4: Understanding Recurrent Neural Networks

6. Chapter 5: Understanding Autoencoders

7. Chapter 6: Understanding Neural Network Transformers

8. Chapter 7: Deep Neural Architecture Search

9. Chapter 8: Exploring Supervised Deep Learning

10. Chapter 9: Exploring Unsupervised Deep Learning

11. Part 2 – Multimodal Model Insights

12. Chapter 10: Exploring Model Evaluation Methods

13. Chapter 11: Explaining Neural Network Predictions

14. Chapter 12: Interpreting Neural Networks

15. Chapter 13: Exploring Bias and Fairness

16. Chapter 14: Analyzing Adversarial Performance

17. Part 3 – DLOps

18. Chapter 15: Deploying Deep Learning Models to Production

19. Chapter 16: Governing Deep Learning Models

20. Chapter 17: Managing Drift Effectively in a Dynamic Environment

21. Chapter 18: Exploring the DataRobot AI Platform

22. Chapter 19: Architecting LLM Solutions

23. Index

Why subscribe?

24. Other Books You May Enjoy

Summary

In this chapter, we explored the various aspects of deploying DL models in production environments, focusing on key components, requirements, and strategies. We discussed architectural choices, hardware infrastructure, model packaging, safety, trust, reliability, security, authentication, communication protocols, user interfaces, monitoring, and logging components, along with continuous integration and deployment.

This chapter also provided a step-by-step guide for choosing the right deployment options based on specific needs, such as latency, availability, scalability, cost, model hardware, data privacy, and safety requirements. We also explored general recommendations for ensuring model safety, trust, and reliability, optimizing model latency, and utilizing tools that simplify the deployment process.

A practical tutorial on deploying a language model with ONNX, TensorRT, and NVIDIA Triton Server was presented, showcasing a minimal workflow needed for accelerated deployment...