Search icon CANCEL
Subscription
0
Cart icon
Your Cart (0 item)
Close icon
You have no products in your basket yet
Save more on your purchases! discount-offer-chevron-icon
Savings automatically calculated. No voucher code required.
Arrow left icon
Explore Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletter Hub
Free Learning
Arrow right icon
timer SALE ENDS IN
0 Days
:
00 Hours
:
00 Minutes
:
00 Seconds
Arrow up icon
GO TO TOP
Mathematics of Machine Learning

You're reading from   Mathematics of Machine Learning Master linear algebra, calculus, and probability for machine learning

Arrow left icon
Product type Paperback
Published in May 2025
Publisher Packt
ISBN-13 9781837027873
Length 730 pages
Edition 1st Edition
Arrow right icon
Author (1):
Arrow left icon
Tivadar Danka Tivadar Danka
Author Profile Icon Tivadar Danka
Tivadar Danka
Arrow right icon
View More author details
Toc

Table of Contents (36) Chapters Close

Introduction Part 1: Linear Algebra FREE CHAPTER
1 Vectors and Vector Spaces 2 The Geometric Structure of Vector Spaces 3 Linear Algebra in Practice 4 Linear Transformations 5 Matrices and Equations 6 Eigenvalues and Eigenvectors 7 Matrix Factorizations 8 Matrices and Graphs References
Part 2: Calculus
9 Functions 10 Numbers, Sequences, and Series 11 Topology, Limits, and Continuity 12 Differentiation 13 Optimization 14 Integration References
Part 3: Multivariable Calculus
15 Multivariable Functions 16 Derivatives and Gradients 17 Optimization in Multiple Variables References
Part 4: Probability Theory
18 What is Probability? 19 Random Variables and Distributions 20 The Expected Value References
Part 5: Appendix
Other Books You May Enjoy
Index
Appendix A It’s Just Logic 1. Appendix B The Structure of Mathematics 2. Appendix C Basics of Set Theory 3. Appendix D Complex Numbers

16
Derivatives and Gradients

Now that we understand why multivariate functions and high-dimensional spaces are more complex than the single-variable case we studied earlier, it’s time to see how to do things in the general case.

To recap quickly, our goal in machine learning is to optimize functions with millions of variables. For instance, think about a neural network N(x,w) trained for binary classification, where

  • x n is the input data,
  • w m is the vector compressing all of the weight parameters,
  • and N(x,w) [0,1] is the prediction, representing the probability of belonging to the positive class.

In the case of, say, binary cross-entropy loss, we have the loss function

 d L(w ) = − ∑ y log N (x ,w ), i i k=1

where xi is the i-th data point with ground truth yi ∈{0,1}. See, I told you that we have to write much more in multivariable calculus. (We’ll talk about binary cross-entropy loss in Chapter 20.)

Training the neural network is the same as finding a...

lock icon The rest of the chapter is locked
Visually different images
CONTINUE READING
83
Tech Concepts
36
Programming languages
73
Tech Tools
Icon Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.
Icon Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.
Icon 50+ new titles added per month and exclusive early access to books as they are being written.
Mathematics of Machine Learning
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $19.99/month. Cancel anytime
Modal Close icon
Modal Close icon