Reader small image

You're reading from  Machine Learning with Python

Product typeBook
Published inMar 2024
PublisherPackt
ISBN-139781835461969
Edition1st Edition
Right arrow
Author (1)
Oliver Theobald
Oliver Theobald
author image
Oliver Theobald

Oliver Theobald, a technical writer and best-selling author, excels in AI, fintech, and cloud computing. With global experience, he now resides between China and Japan, deepening his expertise in technology. As an instructor, Oliver emphasizes clarity and engagement, stripping away jargon to make complex topics accessible. His courses aim to empower both beginners and professionals with practical skills for success in the tech industry, making learning both effective and enjoyable.
Read more about Oliver Theobald

Right arrow

DATA SCRUBBING

 

Similar to Swiss or Japanese watch design, a good machine learning model should run smoothly and contain no extra parts. This means avoiding syntax or other errors that prevent the code from executing and removing redundant variables that might clog up the model’s decision path.

This inclination towards simplicity extends to beginners coding their first model. When working with a new algorithm, it helps to create a minimal viable model and add complexity to the code later. If you find yourself at an impasse, look at the troublesome element and ask, “Do I need it?” If the model can’t handle missing values or multiple variable types, the quickest cure is to remove those variables. This should help the afflicted model spring to life and breathe normally. Once the model is working, you can go back and add complexity to your code.

Let’s now take a look at specific data scrubbing techniques to prepare, streamline,...

lock icon
The rest of the chapter is locked
You have been reading a chapter from
Machine Learning with Python
Published in: Mar 2024Publisher: PacktISBN-13: 9781835461969
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
undefined
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime

Author (1)

author image
Oliver Theobald

Oliver Theobald, a technical writer and best-selling author, excels in AI, fintech, and cloud computing. With global experience, he now resides between China and Japan, deepening his expertise in technology. As an instructor, Oliver emphasizes clarity and engagement, stripping away jargon to make complex topics accessible. His courses aim to empower both beginners and professionals with practical skills for success in the tech industry, making learning both effective and enjoyable.
Read more about Oliver Theobald