An anomaly refers to something that is unexpected or a deviation from the norm. The classic example of an anomaly in data is an outlier, which is a data point that is distant in some way from the other data points in the collection. In addition to outliers, other types of anomalies could include data that is unexpectedly missing, or data that exhibits errors. In the grand scheme of the data mining process that we outlined in Chapter 1, Expanding Your Data Mining Toolbox, detecting data anomalies could be considered part of the data cleaning step, although in this chapter we will find that sometimes using data analysis techniques actually helps us with this cleaning task. In the next few pages, we will take a tour through these different types of anomalies, show what they might look like with real data examples, discuss why they happen, and outline a few simple ways to detect them.
- Tech Categories
- Best Sellers
- New Releases
- Books
- Videos
- Audiobooks
Tech Categories Popular Audiobooks
- Articles
- Newsletters
- Free Learning
You're reading from Mastering Data Mining with Python - Find patterns hidden in your data
Product typeBook
Published inAug 2016
Reading LevelIntermediate
Publisher
ISBN-139781785889950
Edition1st Edition
Languages
Tools
Concepts
Author (1)
Megan Squire
Megan Squire
Megan Squire is a professor of computing sciences at Elon University. Her primary research interest is in collecting, cleaning, and analyzing data about how free and open source software is made. She is one of the leaders of the FLOSSmole.org, FLOSSdata.org, and FLOSSpapers.org projects.
Read more about Megan Squire
The rest of the page is locked
Unlock this book and the full library FREE for 7 days
Get unlimited access to 7000+ expert-authored eBooks and videos courses covering every tech area you can think of
Renews at $15.99/month. Cancel anytime
You have been reading a chapter from
Mastering Data Mining with Python - Find patterns hidden in your dataPublished in: Aug 2016Publisher: ISBN-13: 9781785889950
© 2016 Packt Publishing Limited All Rights Reserved
Author (1)
Megan Squire
Megan Squire is a professor of computing sciences at Elon University. Her primary research interest is in collecting, cleaning, and analyzing data about how free and open source software is made. She is one of the leaders of the FLOSSmole.org, FLOSSdata.org, and FLOSSpapers.org projects.
Read more about Megan Squire