Reader small image

You're reading from  The Machine Learning Solutions Architect Handbook - Second Edition

Product typeBook
Published inApr 2024
PublisherPackt
ISBN-139781805122500
Edition2nd Edition
Right arrow
Author (1)
David Ping
David Ping
author image
David Ping

David Ping is an accomplished author and industry expert with over 28 years of experience in the field of data science and technology. He currently serves as the leader of a team of highly skilled data scientists and AI/ML solutions architects at AWS. In this role, he assists organizations worldwide in designing and implementing impactful AI/ML solutions to drive business success. David's extensive expertise spans a range of technical domains, including data science, ML solution and platform design, data management, AI risk, and AI governance. Prior to joining AWS, David held positions in renowned organizations such as JPMorgan, Credit Suisse, and Intel Corporation, where he contributed to the advancements of science and technology through engineering and leadership roles. With his wealth of experience and diverse skill set, David brings a unique perspective and invaluable insights to the field of AI/ML.
Read more about David Ping

Right arrow

Preface

As artificial intelligence (AI) continues to gain traction across diverse industries, the need for proficient machine learning (ML) solutions architects is on the rise. These professionals play a pivotal role in bridging business requirements with ML solutions, crafting ML technology platforms that address both business and technical challenges. This book is designed to equip individuals with a comprehensive understanding of business use cases, ML algorithms, system architecture patterns, ML tools, AI risk management, enterprise AI adoption strategies, and the emerging field of generative AI.

Upon completing this book, you will possess a comprehensive understanding of AI/ML and generative AI topics, encompassing business use cases, scientific principles, technological underpinnings, architectural considerations, risk management, operational aspects, and the journey towards enterprise adoption. Moreover, you will acquire hands-on technical proficiency with a diverse array of open-source and AWS technologies, empowering you to build and deploy cutting-edge AI/ML and generative AI solutions effectively. This holistic knowledge and practical skillset will enable you to articulate and address the multifaceted challenges and opportunities presented by these disruptive technologies.

Who this book is for

This book is designed for two primary audiences: developers and cloud architects who are looking for guidance and hands-on learning materials to become ML solutions architects, and experienced ML architecture practitioners and data scientists who are looking to develop a broader understanding of industry ML use cases, enterprise data and ML architecture patterns, data management and ML tools, ML governance, and advanced ML engineering techniques. This book can also benefit data engineers and cloud system administrators looking to understand how data management and cloud system architecture fit into the overall ML platform architecture. Risk professionals, AI product managers, and technology decision makers will also benefit from topics on AI risk management, business AI use cases, and ML maturity journey and best practices.

This book assumes you have some Python programming knowledge and are familiar with AWS services. Some of the chapters are designed for ML beginners to learn the core ML fundamentals, and they might overlap with the knowledge already possessed by experienced ML practitioners.

What this book covers

Chapter 1, Navigating the ML Lifecycle with ML Solutions Architecture, introduces ML solutions architecture functions, covering its fundamentals and scope.

Chapter 2, Exploring ML Business Use Cases, talks about real-world applications of AI/ML across various industries such as financial services, healthcare, media entertainment, automotive, manufacturing, and retail.

Chapter 3, Exploring ML Algorithms, introduces common ML and deep learning algorithms for classification, regression, clustering, time series, recommendations, computer vision, natural language processing, and generative AI tasks. You will get hands-on experience of setting up a Jupyter server and building ML models on your local machine.

Chapter 4, Data Management for ML, addresses the crucial topic of data management for ML, detailing how to leverage an array of AWS services to construct robust data management architectures. You will develop hands-on skills with AWS services for building data management pipelines for ML.

Chapter 5, Exploring Open-Source ML Libraries, covers the core features of scikit-learn, Spark ML, PyTorch and TensorFlow, and how to use these ML libraries for data preparation, model training, and model serving. You will practice building deep learning models using TensorFlow and PyTorch.

Chapter 6, Kubernetes Container Orchestration Infrastructure Management, introduces containers, Kubernetes concepts, Kubernetes networking, and Kubernetes security. Kubernetes is a core open-source infrastructure for building open-source ML solutions. You will also practice setting up the Kubernetes platform on AWS EKS and deploying an ML workload in Kubernetes.

Chapter 7, Open-Source ML Platforms, talks about the core concepts and the technical details of various open-source ML platform technologies, such as Kubeflow, MLflow, AirFlow, and Seldon Core. The chapter also covers how to use these technologies to build a data science environment and ML automation pipeline.

Chapter 8, Building a Data Science Environment Using AWS ML Services, introduces various AWS managed services for building data science environments, including Amazon SageMaker, Amazon ECR, and Amazon CodeCommit. You will also get hands-on experience with these services to configure a data science environment for experimentation and model training.

Chapter 9, Designing an Enterprise ML Architecture with AWS ML Services, talks about the core requirements for an enterprise ML platform, discusses the architecture patterns and best practices for building an enterprise ML platform on AWS, and dives deep into the various core ML capabilities of SageMaker and other AWS services.

Chapter 10, Advanced ML Engineering, provides insights into advanced ML engineering aspects such as distributed model training and low-latency model serving, crucial for meeting the demands of large-scale model training and high-performance serving requirements. You will also get hands on with distributed data parallel model training using a SageMaker training cluster.

Chapter 11, Building ML Solutions with AWS AI Services, will introduce AWS AI services and the types of problems these services can help solve without building an ML model from scratch. You will learn about the core capabilities of some key AI services and where they can be leveraged for building ML-powered business applications.

Chapter 12, AI Risk Management, explores AI risk management principles, frameworks, and risk and mitigation, providing comprehensive coverage of AI risk scenarios, guiding principles, frameworks, and risk mitigation considerations across the entire ML lifecycle. It elucidates how ML platforms can facilitate governance through documentation, model inventory maintenance, and monitoring processes.

Chapter 13, Bias, Explainability, Privacy, and Adversarial Attacks, delves into the technical aspects of various risks, providing in-depth explanations of bias detection techniques, model explainability methods, privacy preservation approaches, as well as adversarial attack scenarios and corresponding mitigation strategies.

Chapter 14, Charting the Course of Your ML Journey, outlines the stages of adoption and presents a corresponding maturity model designed to facilitate progress along the ML journey. Additionally, it addresses key considerations essential for overcoming the hurdles encountered throughout this process.

Chapter 15, Navigating the Generative AI Project Lifecycle, discusses the advancement and economic impact of generative AI, the various industry trends in generative AI adoption, and guides readers through the various stages of a generative AI project, from ideation to deployment, exploring various generative AI technologies, and limitations and challenges along the way.

Chapter 16, Designing Generative AI Platforms and Solutions, explores generative AI platforms’ architecture, the retrieval-augmented generation (RAG) application architecture and best practices, considerations for generative AI production deployment and practical generative AI-powered business applications across diverse industry use cases.

The chapter finishes with a discussion on artificial general intelligence (AGI) and various theoretical approaches the research community has taken in their pursuit of AGI.

To get the most out of this book

If you are using the digital version of this book, we advise you to type the code yourself or access the code from the book’s GitHub repository (a link is available in the next section). Doing so will help you avoid any potential errors related to the copying and pasting of code.

For the hardware/software requirements for the book, all you will need is a Windows or Mac machine, and an AWS account.

Download the example code files

You can download the example code files for this book from GitHub at https://github.com/PacktPublishing/The-Machine-Learning-Solutions-Architect-and-Risk-Management-Handbook-Second-Edition/. If there’s an update to the code, it will be updated in the GitHub repository.

We also have other code bundles from our rich catalog of books and videos available at https://github.com/PacktPublishing/. Check them out!

Download the color images

We also provide a PDF file that has color images of the screenshots and diagrams used in this book. You can download it here: https://packt.link/gbp/9781805122500.

Conventions used

There are a number of text conventions used throughout this book.

Code in text: Indicates code words in text, database table names, folder names, filenames, file extensions, pathnames, dummy URLs, user input, and Twitter handles. Here is an example: “Mount the downloaded WebStorm-10*.dmg disk image file as another disk in your system.”

A block of code is set as follows:

import pandas as pd
churn_data = pd.read_csv("churn.csv")
churn_data.head()

When we wish to draw your attention to a particular part of a code block, the relevant lines or items are set in bold:

# The following command calculates the various statistics
for the features.
churn_data.describe()
# The following command displays the histograms for the
different features.
# You can replace the column names to plot the histograms
for other features
churn_data.hist(['CreditScore', 'Age', 'Balance'])
# The following command calculate the correlations among
features
churn_data.corr()

Any command-line input or output is written as follows:

! pip3 install --upgrade tensorflow

Bold: Indicates a new term, an important word, or words that you see on screen. For instance, words in menus or dialog boxes appear in bold. Here is an example: “An example of a deep learning-based solution is the Amazon Echo virtual assistant.”

Warnings or important notes appear like this.

Tips and tricks appear like this.

Get in touch

Feedback from our readers is always welcome.

General feedback: If you have questions about any aspect of this book, email us at customercare@packtpub.com and mention the book title in the subject of your message.

Errata: Although we have taken every care to ensure the accuracy of our content, mistakes do happen. If you have found a mistake in this book, we would be grateful if you would report this to us. Please visit www.packtpub.com/support/errata and fill in the form.

Piracy: If you come across any illegal copies of our works in any form on the internet, we would be grateful if you would provide us with the location address or website name. Please contact us at copyright@packt.com with a link to the material.

If you are interested in becoming an author: If there is a topic that you have expertise in and you are interested in either writing or contributing to a book, please visit authors.packtpub.com.

Share your thoughts

Once you’ve read The Machine Learning Solutions Architect Handbook, Second Edition, we’d love to hear your thoughts! Please click here to go straight to the Amazon review page for this book and share your feedback.

Your review is important to us and the tech community and will help us make sure we’re delivering excellent quality content.

Download a free PDF copy of this book

Thanks for purchasing this book!

Do you like to read on the go but are unable to carry your print books everywhere?

Is your eBook purchase not compatible with the device of your choice?

Don’t worry, now with every Packt book you get a DRM-free PDF version of that book at no cost.

Read anywhere, any place, on any device. Search, copy, and paste code from your favorite technical books directly into your application.

The perks don’t stop there, you can get exclusive access to discounts, newsletters, and great free content in your inbox daily.

Follow these simple steps to get the benefits:

  1. Scan the QR code or visit the link below:

https://packt.link/free-ebook/9781805122500

  1. Submit your proof of purchase.
  2. That’s it! We’ll send your free PDF and other benefits to your email directly.
lock icon
The rest of the chapter is locked
You have been reading a chapter from
The Machine Learning Solutions Architect Handbook - Second Edition
Published in: Apr 2024Publisher: PacktISBN-13: 9781805122500
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
David Ping

David Ping is an accomplished author and industry expert with over 28 years of experience in the field of data science and technology. He currently serves as the leader of a team of highly skilled data scientists and AI/ML solutions architects at AWS. In this role, he assists organizations worldwide in designing and implementing impactful AI/ML solutions to drive business success. David's extensive expertise spans a range of technical domains, including data science, ML solution and platform design, data management, AI risk, and AI governance. Prior to joining AWS, David held positions in renowned organizations such as JPMorgan, Credit Suisse, and Intel Corporation, where he contributed to the advancements of science and technology through engineering and leadership roles. With his wealth of experience and diverse skill set, David brings a unique perspective and invaluable insights to the field of AI/ML.
Read more about David Ping