Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Save more on your purchases!
Savings automatically calculated. No voucher code required
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Python Data Mining Quick Start Guide

You're reading from  Python Data Mining Quick Start Guide

Product type Book
Published in Apr 2019
Publisher Packt
ISBN-13 9781789800265
Pages 188 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Nathan Greeneltch Nathan Greeneltch
Profile icon Nathan Greeneltch

Table of Contents (9) Chapters

Preface 1. Data Mining and Getting Started with Python Tools 2. Basic Terminology and Our End-to-End Example 3. Collecting, Exploring, and Visualizing Data 4. Cleaning and Readying Data for Analysis 5. Grouping and Clustering Data 6. Prediction with Regression and Classification 7. Advanced Topics - Building a Data Processing Pipeline and Deploying It 8. Other Books You May Enjoy

Cleaning input data

Real data is dirty and its integrity must be ensured before useful insights can be harvested. Missing or corrupt values can contribute to spurious conclusions or completely uncovered insights. In addition to data integrity, feature scaling, and variable types (that is, continuous or discrete) contribute heavily to the effectiveness of downstream methods. I will explain the reasons for these contributions in the dedicated sections for each topic.

Missing values

Missing values can ruin a data mining job. Sometimes, an entire record or row is empty, and at other times a single cell or value inside a record is missing. The latter situation is much harder to spot and, indeed, these missing cells can be quiet...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}