Reader small image

You're reading from  Practical Guide to Applied Conformal Prediction in Python

Product typeBook
Published inDec 2023
PublisherPackt
ISBN-139781805122760
Edition1st Edition
Right arrow
Author (1)
Valery Manokhin
Valery Manokhin
author image
Valery Manokhin

Valeriy Manokhin is the leading expert in the field of machine learning and Conformal Prediction. He holds a Ph.D.in Machine Learning from Royal Holloway, University of London. His doctoral work was supervised by the creator of Conformal Prediction, Vladimir Vovk, and focused on developing new methods for quantifying uncertainty in machine learning models. Valeriy has published extensively in leading machine learning journals, and his Ph.D. dissertation 'Machine Learning for Probabilistic Prediction' is read by thousands of people across the world. He is also the creator of "Awesome Conformal Prediction," the most popular resource and GitHub repository for all things Conformal Prediction.
Read more about Valery Manokhin

Right arrow

Introducing imbalanced data

In machine learning, we often come across datasets that need to be more balanced. But what does it mean for a dataset to be imbalanced?

An imbalanced dataset is one where the distribution of samples across the different classes is not uniform. In other words, one type has significantly more samples than the other(s). This is a common scenario in many real-world applications. For instance, in a dataset for fraud detection, the number of non-fraudulent transactions (majority class) is typically much higher than the number of fraudulent ones (minority class).

Imagine a medical dataset recording instances of a rare disease. Most patients will be disease-free, resulting in a large class of healthy records, while only a tiny fraction will be affected by the disease. This disproportion in the distribution of categories is what we call imbalanced data.

Imbalanced data can lead to a significant challenge in predictive modeling. By their very nature, machine...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Practical Guide to Applied Conformal Prediction in Python
Published in: Dec 2023Publisher: PacktISBN-13: 9781805122760

Author (1)

author image
Valery Manokhin

Valeriy Manokhin is the leading expert in the field of machine learning and Conformal Prediction. He holds a Ph.D.in Machine Learning from Royal Holloway, University of London. His doctoral work was supervised by the creator of Conformal Prediction, Vladimir Vovk, and focused on developing new methods for quantifying uncertainty in machine learning models. Valeriy has published extensively in leading machine learning journals, and his Ph.D. dissertation 'Machine Learning for Probabilistic Prediction' is read by thousands of people across the world. He is also the creator of "Awesome Conformal Prediction," the most popular resource and GitHub repository for all things Conformal Prediction.
Read more about Valery Manokhin