Search icon
Subscription
0
Cart icon
Close icon
You have no products in your basket yet
Arrow left icon
All Products
Best Sellers
New Releases
Books
Videos
Audiobooks
Learning Hub
Newsletters
Free Learning
Arrow right icon
Mastering Data Mining with Python - Find patterns hidden in your data

You're reading from  Mastering Data Mining with Python - Find patterns hidden in your data

Product type Book
Published in Aug 2016
Publisher
ISBN-13 9781785889950
Pages 268 pages
Edition 1st Edition
Languages
Concepts
Author (1):
Megan Squire Megan Squire
Profile icon Megan Squire

Summary


In this last chapter, we looked at a variety of different types of data anomalies, including missing data, data errors, and outliers in data. We found many real-world examples of each of these errors, and determined that locating anomalies is important, no matter how we choose to do that. Some of the data anomalies must be located and fixed by hand using queries and domain knowledge, while others invite more sophisticated data mining approaches such as statistical methods and machine learning techniques.

The interesting thing about detecting outliers with machine learning is that we have decided to use data mining techniques in order to do better data mining. The author Douglas Adams once said that a computer nerd is someone who uses a computer in order to use a computer. I draw the line at calling us nerds when we use data mining in order to improve our data mining, but perhaps – as befits the title of the book – we can say with pride that we are getting better at Mastering Data...

lock icon The rest of the chapter is locked
Register for a free Packt account to unlock a world of extra content!
A free Packt account unlocks extra newsletters, articles, discounted offers, and much more. Start advancing your knowledge today.
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime}