Reader small image

You're reading from  Building Data Science Solutions with Anaconda

Product typeBook
Published inMay 2022
PublisherPackt
ISBN-139781800568785
Edition1st Edition
Concepts
Right arrow
Author (1)
Dan Meador
Dan Meador
author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador

Right arrow

Clustering problems

In addition to anomaly detection, there is another class of problem that takes an unsupervised approach to trying to group entities together in order to understand more about the dataset. Clustering is the process of finding elements of a dataset that contain enough similar attributes that you can determine clear distinctions from among the individual points.

There are many applications of this technique, and we'll go over the following few examples now:

  • Grouping segments of a customer base
  • Knowing which emails are promotions and which are more important

To achieve this, we can use a few different algorithms such as the following:

  • DBScan
  • K-Means clustering

While there are many more, you can be sure that these have shown promising results across various datasets and are a great place to start.

Let's look at DBscan first.

DBScan

Density-Based Spatial Clustering of Applications with Noise (or DBScan for...

lock icon
The rest of the page is locked
Previous PageNext Page
You have been reading a chapter from
Building Data Science Solutions with Anaconda
Published in: May 2022Publisher: PacktISBN-13: 9781800568785

Author (1)

author image
Dan Meador

Dan Meador is an Engineering Manager at Anaconda and is the creator of Conda as well as a champion of open source at Anaconda. With a history of engineering and client facing roles, he has the ability to jump into any position. He has a track record of delivering as a leader and a follower in companies from the Fortune 10 to startups.
Read more about Dan Meador