Learn more about this course.

K-means clustering

Do you remember clustering? In this article, we discuss one of the popular clustering techniques, K-means clustering.

Dots are grouped by their distance. Stars move to the centre of the dots.

K-means clustering

Do you remember clustering? Clustering groups together a set of objects in a way that objects in the same cluster are more similar to each other than to objects in other clusters. When we make clusters, we measure similarity. We often use Euclidean distance. You remember that we talked about it. K-means clustering is one of the popular clustering techniques. Different clusters have centers of clusters. The center of the different clusters is called a centroid. A centroid can be an actual data point, but it can also be some other number. The name of K-means clustering came from the fact there are k centroids and k clusters in the data. Do we know what k is? We do not start from the exact k but we are guessing mostly based on our prior domain knowledge.

K-means clustering works by following steps. Let’s assume we decided k = 3.

Select the initial three centroids (centers of a cluster). In our picture below, those are three dotted starts.
Calculate the distance between centroids and the other data points.
Assign the data points to the cluster depending upon the distance between the centroid and the data points. Since you have chosen three centroids, you would come up with three clusters.
The algorithm repeats steps 1-3 while it updates the centroids. Whenever you repeat steps 1-3, the location of centroids will change. And you finally find the best centroids, and those centroids will be moved from the original centroids as you can see the picture below. (best means in terms of shorter distances within the groups and longer distance between the groups)

Want to keep
learning?

This content is taken from
Sungkyunkwan University (SKKU) online course,

Artificial Intelligence and Machine Learning for Business

View Course

Want to keep learning?

This content is taken from Sungkyunkwan University (SKKU) online course

Artificial Intelligence and Machine Learning for Business

View Course

See other articles from this course

This article is from the free online

Artificial Intelligence and Machine Learning for Business

Created by

Join Now

Reach your personal and professional goals

Unlock access to hundreds of expert online courses and degrees from top universities and educators to gain accredited qualifications and professional CV-building certificates.

Join over 18 million learners to launch, switch or build upon your career, all at your own pace, across a wide range of topic areas.

Start Learning now

Learn more about this course.

K-means clustering

Want to keep
learning?

Artificial Intelligence and Machine Learning for Business

Want to keep learning?

Artificial Intelligence and Machine Learning for Business

Artificial Intelligence and Machine Learning for Business

Artificial Intelligence and Machine Learning for Business

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Learn more about this course.

K-means clustering

Share this step

Want to keep learning?

Artificial Intelligence and Machine Learning for Business

Want to keep learning?

Artificial Intelligence and Machine Learning for Business

Share this

Artificial Intelligence and Machine Learning for Business

Artificial Intelligence and Machine Learning for Business

Reach your personal and professional goals

Register to receive updates

Learn more about this course.

Learn more about this course.

See all FutureLearn courses.

Want to keep
learning?