Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Enterprise GENERATIVE AI Well-Architected Framework & Patterns
Enterprise GENERATIVE AI Well-Architected Framework & Patterns

Enterprise GENERATIVE AI Well-Architected Framework & Patterns: An Architect's Real-life Guide to Adopting Generative AI in Enterprises at Scale

By Suvoraj Biswas
$15.99 per month
Book Apr 2024 113 pages 1st Edition
eBook
$9.99 $6.98
Subscription
$15.99 Monthly
eBook
$9.99 $6.98
Subscription
$15.99 Monthly

What do you get with a Packt Subscription?

Free for first 7 days. $15.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details


Publication date : Apr 4, 2024
Length 113 pages
Edition : 1st Edition
Language : English
ISBN-13 : 9781836202912
Category :
Table of content icon View table of contents Preview book icon Preview Book

Enterprise GENERATIVE AI Well-Architected Framework & Patterns

Enterprise  GENERATIVE AI

Well Architected   Framework & Patterns

An Architect’s Real-life Guide to Adopting Generative AI in Enterprises at Scale

SUVORAJ BISWAS

COPYRIGHT NOTICE

Copyright © 2023 by Suvoraj Biswas

            All rights reserved. No part of this publication may be reproduced, distributed, or transmitted in any form or by any means, including photocopying, recording, or other electronic or mechanical methods, without the prior written permission of the author, except in the case of brief quotations embodied in critical reviews and certain other noncommercial uses permitted by copyright law.For permission requests, reach the Author at suvoraj.biswas@gmail.com

            This book, " Enterprise Generative AI Architecture " is intended for informational purposes only. The content within is provided "as is" and without warranties of any kind, express or implied, including but not limited to accuracy, completeness, or fitness for a particular purpose. The author and publisher shall not be liable for any errors, omissions, or any losses, injuries, or damages arising from the display or use of this information.

The book's content, including all illustrated pictures, reflects the author's personal perspectives and research. It is entirely independent and not affiliated with any of the author's current or past organizations.

PREFACE

            Welcome to the " Enterprise Generative AI Well Architected Framework & Patterns " a comprehensive step by step guide designed for Enterprise IT Professionals to explore the cutting-edge world of generative artificial intelligence (AI) systems within the context of enterprise applications. As most of the businesses continue to embrace AI-driven solutions to streamline processes, drive automation and generate valuable insights, the importance of robust and scalable generative AI architectures becomes increasingly evident. When I decided to explore Generative AI, as a Solutions Architect I faced a lot of challenges, roadblocks and at the same time learned a lot while exploring many tools and technologies during the initial steps. My everyday learning and challenges pushed me to come up with generic well architected frameworks for my fellow IT professionals to build world class software to delight the Enterprise stakeholders following the existing common LLM integration and adoption pattern.

            This book is intended to provide readers with a clear understanding of the fundamental principles, methodologies, and best practices for implementing generative AI in large-scale enterprise environments. Whether you are a seasoned AI practitioner(Architects, Engineers / Engineering Managers or Product Managers) seeking to deepen your knowledge or an enterprise leader (VPs, CXOs, Founders) exploring the potential of Generative AI for your organization, this book offers valuable insights into leveraging the power of generative models effectively and responsibly. We will also discuss the challenges and ethical considerations associated with deploying AI systems in enterprise settings and provide strategies to address these concerns.

           

Please note that I have shared information about both open-source tools and products from premium software providers. This information is based on my independent research and exploration, and I am not affiliated with any entity or sponsored by anyone. Therefore, I recommend conducting your own due diligence before deciding to use any of these tools or products.

            I hope that " Enterprise Generative AI Well Architected Framework & Pattern " will serve as a valuable resource on your journey to unlocking the potential of generative AI in your organization. Feel free to connect with me at Linkedin ( www.linkedin.com/in/suvoraj ) or comment at Amazon to share your valuable feedback.

Happy reading & exploring Generative AI.

Suvoraj Biswas

ACKNOWLEDGEMENT

I want to express my deep appreciation to all the readers and reviewers of this book. Your belief in me and my work, along with your continued support throughout this journey, means the world to me.

I extend my heartfelt thanks to all my professors & supervisors in my career, who have been instrumental in my professional growth, and to my colleagues, from whom I've gained valuable technical and professional insights and knowledge.

Last but certainly not least, I am sincerely grateful to my wife, Moumita, and my two wonderful children, Shantosmita and Soujash, for their unwavering support. They sacrificed precious family time to support my late nights and early mornings, allowing me the space to think and write this book.

TABLE OF CONTENTS

Chapter 1:

Introduction to  Generative AI - The Well Architected Framework

  • What is the background of GenAI - Well Architected Framework?
  • Why do we need a Well Architected Framework for GenAI based Enterprise Use Case?
  • What are the pillars of the Well Architected Framework?
  • What are the building blocks for each pillar?

Chapter 2:

Operational Excellence Pillar

  • What is a Large Language Model?
  • What is a Foundation Model?
  • When should we be using the Foundation Model?

Chapter 3:

Security & Privacy Pillar

  • Introduction to Content Moderation building block
  • What is AI Guardrails and How to integrate it?

Chapter 4:

Compliance Pillar

  • Introduction to the Compliance Pillar:
  • Benefits of having an Archival for Compliance building block
  • What are some solutions for Archiving AI & Human interactions for Regulatory Compliance?

Chapter 5:

Reliability Pillar

  • The FM or LLM observability Building Block:
  • What are some LLM Observability Solutions?

Chapter 6:

Cost Optimization

  • Introduction
  • What are some best practices to optimize cost during Architecting?

Chapter 7:

System Architecture Excellence

  • Prompt Engineering
  • Why Prompt Engineering plays a crucial role in the GenAI apps?
  • Some Best Practices for Effective Prompts:
  • Embedding & Vector Database
  • What are Embeddings and why are they so important?
  • What are OpenAI's Offerings on Embeddings?
  • What is a Vector Database & How do they really work?
  • Options of Vector Databases for Solutioning
  • The Orchestrator Building Block
  • Some Orchestrator Application framework for AI development

Chapter 8:

Common Architectural Pattern For FM/LLM Integration & Adoption

  • Introduction
  • What is Retrieval-augmented generation (RAG) Pattern for GenAI?
  • Demo Time - a GenAI based QnA python app using RAG pattern (LangChain, PineCone & Open AI api)
  • What is Fine tuning ?
  • Why do we need fine tuning? Can’t we use RAG for domain specific use cases?
  • Are Fine Tuning & Pre-Training referring to the similar process?
  • Foundation Model or Large Language Model fine-tuning techniques
  • Unsupervised / Supervised Fine Tuning (U-SFT)
  • RLHF Reinforcement Learning from Human Feedback
  • Parameter Efficient Fine Tuning (PEFT)
  • What is the difference between regular fine tuning (SFT or RLHF) and PEFT?
  • What are some Enterprise Use cases where SFT/RLHF or PEFT can be used?
  • What is LoRA (Low Rank Adaptation Model) technique?
  • What are the advantages of the LoRA method in fine tuning LLM?
  • What is QLoRA (Quantized Low Rank Adaptation Model)?

Chapter 9:

AWS Solutions for Generative AI

  • Vector Database & Semantic Search capabilities
  • How to run PostGreSQL with pgVector extension in EC2 ?
  • Overview of AWS Sagemaker and AWS Bedrock
  • Example Generative AI based Enterprise Product Search Architecture in AWS

Conclusions

References (Tools, Libraries, Articles)

WHAT INDUSTRY LEADERS & EXPERTS ARE SAYING ABOUT THIS BOOK?

Enterprise GENERATIVE AI Well Architected Framework & Patterns ” is a new addition in this very timely topic on Artificial Intelligence by Suvoraj Biswas. There are some other  publications in this area. What has impressed me about this edition is its readability and expression in a lucid manner. This book has explained to readers with a clear understanding of the fundamental principles, methodologies, and best practices for implementing generative AI in large-scale enterprise environments. Additional advantage provided in this book is the information about both open-source tools and products from premium software providers, gained by the author from many years of experience in the IT industry.

Dr Debajyoti Mukhopadhyay

Former Scientist at Bell Communications Research, New Jersey, USA

Former Dean of the School of Engineering & Applied Sciences, Bennett University, India

https://in.linkedin.com/in/debajyotimukhopadhyay

" Enterprise Generative AI Well Architected Framework & Patterns " is a comprehensive book that explores the intricacies of Building Gen AI Solutions with remarkable clarity and depth.The book covers the six building blocks that the architects should consider while building Gen AI solutions for enterprise.

In addition to the theoretical underpinnings, the book offers practical guidance for implementing Generative AI in real-world scenarios. The step-by-step examples and code snippets are exceptionally well-written, allowing readers to experiment and gain hands-on experience. It strikes the perfect balance between theory and practicality, making it an indispensable resource for anyone interested in the fascinating world of Generative AI.

Mitali Biswas

Chief Information Officer, CK Birla Hospitals, India

https://www.linkedin.com/in/connect-mitali-biswas

As LLM practitioners, most of our work focuses on how to leverage the existing models to output the results we want (the operational excellence) and overlook the other important aspects such as security, compliance, reliability, cost optimization, etc. With clear illustrations and real-world scenarios, this book not only broadens the horizon of LLM practitioners but also provides a cohesive way to tie different components into a sustainable and trustworthy LLM system. A must-read for anyone wanting to truly understand and harness the power of LLM in a comprehensive manner.

Leyuan Yu

Senior Machine Learning Scientist, Coursera, Canada

https://www.linkedin.com/in/leyuanyu/

The book is a comprehensive guide to explore how generative AI could be leveraged at the Enterprise level. It also simplifies the concept of embeddings, vector databases, and prompt engineering which are critical for successful adoption of generative AI. Certainly a good read for anyone looking to get started in this space.

Rudra Roy Choudhury

Product Manager @ Tubi, USA | Linkedin Top Voice

https://www.linkedin.com/in/rudravaswata-roychoudhury/

CHAPTER  

ONE

Enterprise Generative AI -  

The Well Architected Framework

What is Generative AI?

We have all witnessed how OpenAI has recently reshaped the digital landscape through the introduction of tools like ChatGPT, which gained a substantial user base surpassing all popular social media applications.  ChatGPT has been powered by what we call Generative AI which not only has remarkable influence in the consumer sphere but also many Enterprises are adopting to solve many business challenges which previously appeared impossible. Generative AI is a form of deep learning system which is able to generate new original contents (texts or digital media - audio or Video or images). It uses the machine learning algorithm and artificial neural networks to recognize the underlying pattern in the training data to predict new original contents without any human intervention or influences.  Before understanding Generative AI we need to understand the ecosystem first.

Artificial Intelligence, often abbreviated as AI, refers to a computer system capable of emulating human behavior and performing tasks without requiring explicit programming.

In the same ecosystem, we find Machine Learning (ML), which is a subset of AI. However, it has the ability to autonomously learn from historical data and make predictions based on the patterns it acquires. For example, through the utilization of Machine Learning, we can predict patterns such as whether people prefer takeout or dining in a restaurant during a particular season, based on historical data.

Similarly Deep Learning technique refers to another subset of Machine Learning which gets the cognitive ability in the computer systems by integrating the artificial deep neural networks just like the human brain. It entails a substantial collection of input data and hidden layers designed in a manner where the output from one layer is progressively refined in the subsequent layer, ultimately generating predictive contents.

Generative AI is the part of Deep Learning models which can generate completely new content in the form of texts or images or videos based on these supervised training data.

What is the background of GenAI - Well Architected Framework?

With the rise of ChatGPT and other Large Language Model based AI systems modern businesses are rapidly embracing the integration of Generative AI into their enterprise systems to solve many potential business challenges which previously were not possible.

As a solution architect I got the opportunity to work on several projects using Generative AI when my company decided to explore the Generative AI and Large Language Model for some potential Enterprise use cases. It has been a great learning experience for me, indeed. While exploring and designing the effective solutions we faced many kinds of challenges. Based on my learning and evaluating various aspects of Generative AI, I felt the lack of proper Architectural Framework and decentralized patterns for adopting generative AI. This brings the need to follow an uniform approach and patterns combined together so that Enterprises can focus on building robust cost effective AI products within their allocated budget and strict timeline.

Why do we need a Well Architected  Framework for GenAI based Enterprise Use Case?

A Well Architected Enterprise Generative AI Framework, embedded within the enterprise architecture, offers a structured and coherent methodology, which if adopted correctly can help mitigate the business and technical challenges while adopting the Generative AI capabilities. The framework being proposed here is constructed upon a standardized collection of building blocks and supporting pillars that facilitate the conception, implementation and management of the enterprise-grade Generative AI-based solutions. Here's why such a framework is crucial to adopt:

  • Consistency and Efficiency  - by adopting this framework your organization would be able to follow the uniform approach and pattern to build consistent and efficient products for all kinds of Gen AI needs.

  • Scalability - As the end user usage increases for the delivered products, the complexity of both the operation and maintainability increases. Adopting a predefined framework during the build stage addresses the long term scalability requirement.

  • Integration - For a large enterprise a specific solution might need to integrate with many different input data sources or systems. Enterprise Generative AI framework addresses this requirement at an early stage by allowing a loosely coupled approach and  smooth data flow.

  • Agility - The framework addresses the rapidly evolving customer requirement, usage pattern and allows the room to respond and adjust swift without compromising the stability of the delivered Gen AI  product.

Left arrow icon Right arrow icon

Key benefits

  • Learn to secure AI environments
  • Achieve excellence in AI architecture
  • Implement AI with AWS solutions

Description

The course begins with an insightful introduction to the burgeoning field of Generative AI, laying down a robust framework for understanding its applications within the AWS ecosystem. The course focuses on meticulously detailing the five pillars of the AWS Well-Architected Framework—Operational Excellence, Security, Compliance, Reliability, and Cost Optimization. Each module is crafted to provide you with a comprehensive understanding of these essential areas, integrating Generative AI technologies. You'll learn how to navigate the complexities of securing AI systems, ensuring they comply with legal and regulatory standards, and designing them for unparalleled reliability. Practical sessions on cost optimization strategies for AI projects will empower you to deliver value without compromising on performance or scalability. Furthermore, the course delves into System Architecture Excellence, emphasizing the importance of robust design principles in creating effective Generative AI solutions. The course wraps up by offering a forward-looking perspective on the Common Architectural Pattern for FM/LLM Integration & Adoption within the AWS framework. You'll gain hands-on experience with AWS solutions specifically tailored for Generative AI applications, including Lambda, API Gateway, and DynamoDB, among others.

What you will learn

Apply Operational Excellence in AI Secure Generative AI implementations Navigate compliance in AI solutions Ensure reliability in AI systems Optimize costs for AI projects Integrate FM/LLM with AWS solutions

What do you get with a Packt Subscription?

Free for first 7 days. $15.99 p/m after that. Cancel any time!
Product feature icon Unlimited ad-free access to the largest independent learning library in tech. Access this title and thousands more!
Product feature icon 50+ new titles added per month, including many first-to-market concepts and exclusive early access to books as they are being written.
Product feature icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Product feature icon Thousands of reference materials covering every tech concept you need to stay up to date.
Subscribe now
View plans & pricing

Product Details


Publication date : Apr 4, 2024
Length 113 pages
Edition : 1st Edition
Language : English
ISBN-13 : 9781836202912
Category :

Table of Contents

43 Chapters
1. What is Generative AI? Chevron down icon Chevron up icon
2. What is the background of GenAI - Well Architected Framework? Chevron down icon Chevron up icon
3. Why do we need a Well Architected Framework for GenAI based Enterprise Use Case? Chevron down icon Chevron up icon
4. What are the pillars of the Well Architected Framework? Chevron down icon Chevron up icon
5. What are the building blocks for each pillar? Chevron down icon Chevron up icon
6. What is a Large Language Model? Chevron down icon Chevron up icon
7. What is a Foundation Model? Chevron down icon Chevron up icon
8. When should we be using the Foundation Model? Chevron down icon Chevron up icon
9. Introduction to Content Moderation building block: Chevron down icon Chevron up icon
10. What is AI Guardrails and How to integrate it? Chevron down icon Chevron up icon
11. Introduction to the Compliance Pillar: Chevron down icon Chevron up icon
12. Benefits of having an Archival for Compliance building block: Chevron down icon Chevron up icon
13. What are some solutions for Archiving AI & Human interactions for Regulatory Compliance? Chevron down icon Chevron up icon
14. The FM or LLM observability Building Block: Chevron down icon Chevron up icon
15. What are some LLM Observability Solutions? Chevron down icon Chevron up icon
16. Introduction Chevron down icon Chevron up icon
17. What are some best practices to optimize Cost during Architecting? Chevron down icon Chevron up icon
18. Prompt Engineering Chevron down icon Chevron up icon
19. Why Prompt Engineering plays a crucial role in the GenAI apps? Chevron down icon Chevron up icon
20. Some Best Practices for Effective Prompts: Chevron down icon Chevron up icon
21. Embedding & Vector Database Chevron down icon Chevron up icon
22. What are Embeddings and why are they so important? Chevron down icon Chevron up icon
23. What are OpenAI's Offerings on Embeddings? Chevron down icon Chevron up icon
24. What is a Vector Database & How do they really work? Chevron down icon Chevron up icon
25. Options of Vector Databases for Solutioning Chevron down icon Chevron up icon
26. The Orchestrator Building Block Chevron down icon Chevron up icon
27. Some Orchestrator Application framework for AI development: Chevron down icon Chevron up icon
28. Introduction Chevron down icon Chevron up icon
29. What is Retrieval-augmented generation (RAG) Pattern for GenAI? Chevron down icon Chevron up icon
30. Demo Time - a GenAI based QnA python app using RAG pattern (LangChain, PineCone & Open AI api) Chevron down icon Chevron up icon
31. What is Fine tuning? Chevron down icon Chevron up icon
32. Why do we need fine tuning? Can’t we use RAG for domain specific use cases? Chevron down icon Chevron up icon
33. Are Fine Tuning & Pre-Training referring to the similar process? Chevron down icon Chevron up icon
34. Foundation Model or Large Language Model fine-tuning techniques: Chevron down icon Chevron up icon
35. Parameter Efficient Fine Tuning: Chevron down icon Chevron up icon
36. What are some Enterprise Use cases where SFT/RLHF or PEFT can be used? Chevron down icon Chevron up icon
37. What is LoRA (Low Rank Adaptation Model) technique? Chevron down icon Chevron up icon
38. What are the advantages of the LoRA method in fine tuning LLM? Chevron down icon Chevron up icon
39. What is QLoRA (Quantized Low Rank Adaptation Model) ? Chevron down icon Chevron up icon
40. Vector Database & Semantic Search capabilities Chevron down icon Chevron up icon
41. How to run PostGreSQL with pgVector extension in EC2 ? Chevron down icon Chevron up icon
42. Overview of AWS Sagemaker and AWS Bedrock Chevron down icon Chevron up icon
43. Example Generative AI based Enterprise Product Search Architecture in AWS Chevron down icon Chevron up icon

Customer reviews

Filter icon Filter
Top Reviews
Rating distribution
Empty star icon Empty star icon Empty star icon Empty star icon Empty star icon 0
(0 Ratings)
5 star 0%
4 star 0%
3 star 0%
2 star 0%
1 star 0%

Filter reviews by


No reviews found
Get free access to Packt library with over 7500+ books and video courses for 7 days!
Start Free Trial

FAQs

What is included in a Packt subscription? Chevron down icon Chevron up icon

A subscription provides you with full access to view all Packt and licnesed content online, this includes exclusive access to Early Access titles. Depending on the tier chosen you can also earn credits and discounts to use for owning content

How can I cancel my subscription? Chevron down icon Chevron up icon

To cancel your subscription with us simply go to the account page - found in the top right of the page or at https://subscription.packtpub.com/my-account/subscription - From here you will see the ‘cancel subscription’ button in the grey box with your subscription information in.

What are credits? Chevron down icon Chevron up icon

Credits can be earned from reading 40 section of any title within the payment cycle - a month starting from the day of subscription payment. You also earn a Credit every month if you subscribe to our annual or 18 month plans. Credits can be used to buy books DRM free, the same way that you would pay for a book. Your credits can be found in the subscription homepage - subscription.packtpub.com - clicking on ‘the my’ library dropdown and selecting ‘credits’.

What happens if an Early Access Course is cancelled? Chevron down icon Chevron up icon

Projects are rarely cancelled, but sometimes it's unavoidable. If an Early Access course is cancelled or excessively delayed, you can exchange your purchase for another course. For further details, please contact us here.

Where can I send feedback about an Early Access title? Chevron down icon Chevron up icon

If you have any feedback about the product you're reading, or Early Access in general, then please fill out a contact form here and we'll make sure the feedback gets to the right team. 

Can I download the code files for Early Access titles? Chevron down icon Chevron up icon

We try to ensure that all books in Early Access have code available to use, download, and fork on GitHub. This helps us be more agile in the development of the book, and helps keep the often changing code base of new versions and new technologies as up to date as possible. Unfortunately, however, there will be rare cases when it is not possible for us to have downloadable code samples available until publication.

When we publish the book, the code files will also be available to download from the Packt website.

How accurate is the publication date? Chevron down icon Chevron up icon

The publication date is as accurate as we can be at any point in the project. Unfortunately, delays can happen. Often those delays are out of our control, such as changes to the technology code base or delays in the tech release. We do our best to give you an accurate estimate of the publication date at any given time, and as more chapters are delivered, the more accurate the delivery date will become.

How will I know when new chapters are ready? Chevron down icon Chevron up icon

We'll let you know every time there has been an update to a course that you've bought in Early Access. You'll get an email to let you know there has been a new chapter, or a change to a previous chapter. The new chapters are automatically added to your account, so you can also check back there any time you're ready and download or read them online.

I am a Packt subscriber, do I get Early Access? Chevron down icon Chevron up icon

Yes, all Early Access content is fully available through your subscription. You will need to have a paid for or active trial subscription in order to access all titles.

How is Early Access delivered? Chevron down icon Chevron up icon

Early Access is currently only available as a PDF or through our online reader. As we make changes or add new chapters, the files in your Packt account will be updated so you can download them again or view them online immediately.

How do I buy Early Access content? Chevron down icon Chevron up icon

Early Access is a way of us getting our content to you quicker, but the method of buying the Early Access course is still the same. Just find the course you want to buy, go through the check-out steps, and you’ll get a confirmation email from us with information and a link to the relevant Early Access courses.

What is Early Access? Chevron down icon Chevron up icon

Keeping up to date with the latest technology is difficult; new versions, new frameworks, new techniques. This feature gives you a head-start to our content, as it's being created. With Early Access you'll receive each chapter as it's written, and get regular updates throughout the product's development, as well as the final course as soon as it's ready.We created Early Access as a means of giving you the information you need, as soon as it's available. As we go through the process of developing a course, 99% of it can be ready but we can't publish until that last 1% falls in to place. Early Access helps to unlock the potential of our content early, to help you start your learning when you need it most. You not only get access to every chapter as it's delivered, edited, and updated, but you'll also get the finalized, DRM-free product to download in any format you want when it's published. As a member of Packt, you'll also be eligible for our exclusive offers, including a free course every day, and discounts on new and popular titles.