Reader small image

You're reading from  Building Data Science Solutions with Anaconda

Product typeBook
Published inMay 2022
PublisherPackt
ISBN-139781800568785
Edition1st Edition
Concepts
Right arrow
Author (1)
Dan Meador
Dan Meador
author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador

Right arrow

Overcoming sample bias

Sample bias is when the choice of data doesn't reflect what is present in the real world. This is also referred to as selection bias. As with many types of bias, this can be completely harmless or very impactful, depending on the application.

In the following diagram, you can see a visual representation of what this looks like. There is hypothetical real-world data on the left that would be helpful (represented as Input z), but for one reason or another, it did not make it into the data that is included in the training dataset:

Figure 6.2 – Sample bias

When we leave this valuable data out, it is detrimental to everyone involved. The previous diagram is more abstract, so let's look at some more concrete examples of what sample bias could look like.

Examples of sample bias

The following items are examples of where sample bias could exist. Of course, this isn't close to an exhaustive list but helps to give...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Building Data Science Solutions with Anaconda
Published in: May 2022Publisher: PacktISBN-13: 9781800568785

Author (1)

author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador