Search icon
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Building Data Science Solutions with Anaconda

You're reading from  Building Data Science Solutions with Anaconda

Product type Book
Published in May 2022
Publisher Packt
ISBN-13 9781800568785
Pages 330 pages
Edition 1st Edition
Languages
Author (1):
Dan Meador Dan Meador
Profile icon Dan Meador

Table of Contents (16) Chapters

Preface Part 1: The Data Science Landscape – Open Source to the Rescue
Chapter 1: Understanding the AI/ML landscape Chapter 2: Analyzing Open Source Software Chapter 3: Using the Anaconda Distribution to Manage Packages Chapter 4: Working with Jupyter Notebooks and NumPy Part 2: Data Is the New Oil, Models Are the New Refineries
Chapter 5: Cleaning and Visualizing Data Chapter 6: Overcoming Bias in AI/ML Chapter 7: Choosing the Best AI Algorithm Chapter 8: Dealing with Common Data Problems Part 3: Practical Examples and Applications
Chapter 9: Building a Regression Model with scikit-learn Chapter 10: Explainable AI - Using LIME and SHAP Chapter 11: Tuning Hyperparameters and Versioning Your Model Other Books You May Enjoy

Dealing with too much data

It's true that more data is usually better, but this isn't always the case. There are many times when having extra data has a negative impact on an outcome. Such a case was covered in Chapter 1, Understanding the AI/ML Landscape, where a father gave his child an extra example of what a tiger was, but that extra example was actually of a panther. That additional bit of information would then turn into a negative addition to the training set and create a worse learning outcome for your model.

How are you supposed to know this? Understand the data. This will be a common theme in this chapter, the book, and in the real world. If you don't start there, then everything else is more challenging. It's similar to being able to understand bias, as discussed in Chapter 6, Overcoming Bias in AI/ML.

Sometimes though, you won't or can't have a full grasp of the data, but you can use tools to help you out. The first clue that you can...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}