Packt+ | Advance your knowledge in tech

You're reading from Data Science with Python[Instructor Edition] Combine Python with machine learning principles to discover hidden patterns in raw data

Product type Hardcover

Published in Jul 2019

Publisher

ISBN-13 9781838552862

Length 426 pages

Edition 1st Edition

Languages

Python

Tools

Combine

Concepts

Data Science

Authors (4):

Mohamed Noordeen Alaudeen

Rohan Chopra

Aaron England

Lakshay Sharma

View More author details

Table of Contents (10) Chapters

About the Book

1. Introduction to Data Science and Data Pre-Processing FREE CHAPTER

2. Data Visualization

3. Introduction to Machine Learning via Scikit-Learn

4. Dimensionality Reduction and Unsupervised Learning

5. Mastering Structured Data

6. Decoding Images

7. Processing Human Language

8. Tips and Tricks of the Trade

1. Appendix

K-means Clustering

Like HCA, K-means also uses distance to assign observations into clusters not labeled in data. However, rather than linking observations to each other as in HCA, k-means assigns observations to k (user-defined number) clusters.

To determine the cluster to which each observation belongs, k cluster centers are randomly generated, and observations are assigned to the cluster in which its Euclidean distance is closest to the cluster center. Like the starting weights in artificial neural networks, cluster centers are initialized at random. After cluster centers have been randomly generated there are two phases:

Assignment phase
Updating phase
Note
The randomly generated cluster centers are important to remember, and we will be visiting it later in this chapter. Some refer to this random generation of cluster centers as a weakness of the algorithm, because results vary between fitting the same model on the same data, and it is not guaranteed to assign observations to the...

The rest of the chapter is locked

Tech Concepts

Programming languages

Tech Tools

Unlimited access to the largest independent learning library in tech of over 8,000 expert-authored tech books and videos.

Innovative learning tools, including AI book assistants, code context explainers, and text-to-speech.

50+ new titles added per month and exclusive early access to books as they are being written.

You're reading from Data Science with Python[Instructor Edition] Combine Python with machine learning principles to discover hidden patterns in raw data

Table of Contents (10) Chapters

K-means Clustering

Note

Authors (4)

Other recommended products

Personalised recommendations for you

You're reading from Data Science with Python[Instructor Edition] Combine Python with machine learning principles to discover hidden patterns in raw data

Table of Contents (10) Chapters

K-means Clustering

Note

Authors (4)

Other recommended products

Personalised recommendations for you

Create a Free Account To Continue Reading

Sign in to activate your 7-day free access