Reader small image

You're reading from  Building Data Science Solutions with Anaconda

Product typeBook
Published inMay 2022
PublisherPackt
ISBN-139781800568785
Edition1st Edition
Concepts
Right arrow
Author (1)
Dan Meador
Dan Meador
author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador

Right arrow

Overcoming proxy bias

There are times that you can introduce bias even if you don't have any features or data points that directly link to a protected class. Remember that a protected class is something such as age, sex, and religion. This is introduced by proxy. And this boils down to data being present that strongly correlates with someone being in that group due to data in some ways bleeding into that proxy dataset.

In the next diagram, you can see a representation of how proxy bias can leak into data. On the left, you have perfectly valid X and Y data, but there is also data B, which is in the form of protected class data. Even though the data from B isn't directly used in the training dataset, it is brought in via proxy through the X dataset:

Figure 6.1 – Proxy bias

Let's look at some examples of what proxy bias could look like to make this a bit more concrete.

Examples of proxy bias

The following list contains some examples...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Building Data Science Solutions with Anaconda
Published in: May 2022Publisher: PacktISBN-13: 9781800568785

Author (1)

author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador