You're reading from Enhancing Deep Learning with Bayesian Inference

Product typeBook

Published inJun 2023

PublisherPackt

ISBN-139781803246888

Edition1st Edition

Concepts

Deep Learning

Authors (3):

Matt Benatan

Jochem Gietema

Marian Schneider

View More author details

Chapter 4
Introducing Bayesian Deep Learning

In Chapter 2, Fundamentals of Bayesian Inference, Fundamentals of Bayesian Inference, we saw how traditional methods for Bayesian inference can be used to produce model uncertainty estimates, and we introduced the properties of well-calibrated and well-principled methods for uncertainty estimation. While these traditional methods are powerful in many applications, Chapter 2, Fundamentals of Bayesian Inference also highlighted some of their limitations with respect to scaling. In Chapter 3, Fundamentals of Deep Learning, we saw the impressive things DNNs are capable of given large amounts of data; but we also learned that they aren’t perfect. In particular, they often lack robustness for out-of-distribution data – a major concern when we consider the deployment of these methods in real-world applications.

Figure 4.1: BDL combines the strengths of both deep learning and traditional Bayesian inference

BDL...

4.1 Technical requirements

To complete the practical tasks in this chapter, you will need a Python 3.8 environment with the SciPy stack and the following additional Python packages installed:

TensorFlow 2.0
TensorFlow Probability
Seaborn plotting library

All of the code for this book can be found in the GitHub repository for the book: https://github.com/PacktPublishing/Enhancing-Deep-Learning-with-Bayesian-Inference.

4.2 The ideal BNN

As we saw in the previous chapter, a standard neural network comprises multiple layers. Each of these layers comprises a number of perceptrons – and these perceptrons comprise a multiplicative component (weight) and an additive component (bias). Each weight and bias parameter comprises a single parameter – or point estimate – and, in combination, these parameters transform the input to the perceptron. As we’ve seen, multiple layers of perceptrons are capable of achieving impressive feats when trained via backpropagation. However, these point estimates contain very limited information – let’s take a look.

Generally speaking, the goal of deep learning is to find (potentially very, very many) parameter values that best map a set of inputs onto a set of outputs. That is, given some data, for each parameter in our network, we’ll choose the parameter that best describes the data. This often boils down to taking the mean –...

4.3 BDL fundamentals

Throughout the rest of the book, we will introduce a range of methods necessary to make BDL possible. There are a number of common themes present through these methods. We’ll cover these here, so that we have a good understanding of these concepts when we encounter them later on.

These concepts include the following:

Gaussian assumptions: With many BDL methods, we use Gaussian assumptions to make things computationally tractable
Uncertainty sources: We’ll take a look at the different sources of uncertainty, and how we can determine the contributions of these sources for some BDL methods
Likelihoods: We were introduced to likelihoods in Chapter 2, Fundamentals of Bayesian Inference, and here we’ll learn more about the importance of likelihood as a metric for evaluating the calibration of probabilistic models

Let’s look at each of these in the following subsections.

4.3.1 Gaussian assumptions

In the ideal case described previously...

4.4 Tools for BDL

In this chapter, as well as in Chapter 2, Fundamentals of Bayesian Inference, we’ve seen a lot of equations involving probability. While it’s possible to create BDL models without a probability library, having a library that supports some of the fundamental functions makes things much easier. As we’re using TensorFlow for the examples in this book, we’ll be using the TensorFlow Probability (TFP) library to help us with some of these probabilistic components. In this section, we’ll introduce TFP and show how it can be used to easily implement many of the concepts we’ve seen in Chapter 2, Fundamentals of Bayesian Inference and Chapter 4, Introducing Bayesian Deep Learning.

Much of the content up to this point has been about introducing the concept of working with distributions. As such, the first TFP module we’ll learn about is the distributions module. Let’s take a look:

 
import tensorflow_probability...

4.5 Summary

In this chapter, we were introduced to the fundamental concepts that we’ll need to progress through the book and learn how to implement and use BNNs. Most crucially, we learned about the ideal BNN, which introduced us to the core ideas underlying BDL, and the computational difficulties of achieving this in practice. We also covered the fundamental practical methods used in BDL, giving us a grounding in the concepts that allow us to implement computationally tractable BNNs.

The chapter also introduced the concept of uncertainty sources, describing the difference between data and model uncertainty, how these contribute to total uncertainty, and how we can estimate the contributions of different types of uncertainty with various models. We also introduced one of the most fundamental components in probabilistic inference – the likelihood function – and learned about how it can help us to train better principled and better calibrated models. Lastly, we were...

4.6 Further reading

This chapter has introduced the material necessary to start working with BDL; however, there are many resources that go into more depth on the topics of uncertainty sources. The following are a few recommendations for readers interested in exploring the theory and code in more depth:

Machine Learning: A Probabilistic Perspective, Murphy: Kevin Murphy’s extremely popular book on machine learning has become a staple for students and researchers in the field. This book provides a detailed treatment of machine learning from a probabilistic standpoint, unifying concepts from statistics, machine learning, and Bayesian probability.
TensorFlow Probability Tutorials: in this book, we’ll see how TensorFlow Probability can be used to develop BNNs, but their website includes a wide array of tutorials addressing probabilistic programming more generally: https://www.tensorflow.org/probability/overview
Pyro Tutorials: Pyro is a PyTorch-based library for probabilistic...

The rest of the chapter is locked

You have been reading a chapter from

Enhancing Deep Learning with Bayesian Inference

Published in: Jun 2023Publisher: PacktISBN-13: 9781803246888

A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.

undefined

Unlock this book and the full library FREE for 7 days

Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of

Start free trial

Renews at $15.99/month. Cancel anytime

Authors (3)

Matt Benatan

Matt Benatan is a Principal Research Scientist at Sonos and a Simon Industrial Fellow at the University of Manchester. His work involves research in robust multimodal machine learning, uncertainty estimation, Bayesian optimization, and scalable Bayesian inference.
Read more about Matt Benatan

Jochem Gietema

Jochem Gietema is an Applied Scientist at Onfido in London where he has developed and deployed several patented solutions related to anomaly detection, computer vision, and interactive data visualisation.
Read more about Jochem Gietema

Marian Schneider

Marian Schneider is an applied scientist in machine learning. His work involves developing and deploying applications in computer vision, ranging from brain image segmentation and uncertainty estimation to smarter image capture on mobile devices.
Read more about Marian Schneider

Personalised recommendations for you

Based on your interests and search pattern

Et al.

Ever wonder why speech recognition systems don't understand the Scottish accent, or what would happen if an astronaut only ate mac 'n' cheese, or other spurious reflections you'd have at a bar? We did, then collated those deliberations into absurd research articles with fake figures and methodologies inspired by even more fictionally absurd studies.

BookAug 2023230 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages4

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages1

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

BookDec 2023360 pages5

Mastering Tableau 2023

This book is a comprehensive resource to mastering your Tableau skills and becoming a BI expert. As you progress, you will learn how to build advanced dashboards and improve your storytelling to derive key business insight, as well as make you well-versed with advanced functionalities of Tableau in the business intelligence domain.

BookAug 2023684 pages

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages5

Building AI Applications with ChatGPT APIs

This guide covers all ChatGPT API features for effortless creation of robust AI powered apps. With its help, you’ll be able to leverage ChatGPT’s cutting-edge NLP models to take your app development skills to the next level. You’ll also work on ten exciting projects that will give you the practical know-how that you can apply to your existing applications.

BookSep 2023258 pages2

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

BookOct 2023636 pages5

Modern Data Architecture on AWS

Every organization wants an agile, performant, and cost-effective data platform that meets all their current and future business needs. Purpose-built AWS analytics services and their features play a big part in building such a modern data platform. This book brings to you all the design and architectural patterns that’ll help you achieve this goal.

BookAug 2023420 pages5

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

BookDec 2023240 pages

TinyML Cookbook

With over 70 project-based recipes, the TinyML Cookbook is a practical guide that will help you to get the most out of your microcontrollers. It provides a comprehensive understanding of the theoretical foundations while giving you hands-on experience training ML models for deployment on Arduino Nano 33 BLE Sense, Raspberry Pi Pico, and SparkFun RedBoard Artemis Nano microcontrollers.

BookNov 2023664 pages

You're reading from Enhancing Deep Learning with Bayesian Inference

Chapter 4 Introducing Bayesian Deep Learning

4.1 Technical requirements

4.2 The ideal BNN

4.3 BDL fundamentals

4.3.1 Gaussian assumptions

4.4 Tools for BDL

4.5 Summary

4.6 Further reading

Unlock this book and the full library FREE for 7 days

Authors (3)

Et al.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Generative AI with LangChain

This book is a comprehensive introduction to LLMs and LangChain, demystifying the basic mechanics of LangChain, its functionalities, and the myriad of applications it can be integrated into.

Mastering Tableau 2023

Building AI Applications with ChatGPT APIs

Building AI Applications with ChatGPT APIs

Data Engineering with AWS

Embark on a journey to master data engineering pipelines on AWS! Our book offers a hands-on experience of AWS services for ingesting, transforming, and consuming data. Whether you're an absolute beginner or someone with basic data engineering experience, this guide is an indispensable resource.

Modern Data Architecture on AWS

Practical Guide to Applied Conformal Prediction in Python

Discover the power of Conformal Prediction with the "Practical Guide to Applied Conformal Prediction in Python." Master the latest techniques to quantify uncertainty in machine learning and computer vision models, and seamlessly apply them to your industry applications.

TinyML Cookbook

Chapter 4
Introducing Bayesian Deep Learning