Clarification on Termination of K-Means

123 Views Asked by Bumbble Comm At 29 Mar 2026 - 4:47

Suppose we perform K-means clustering by,

Randomly assign observations to K clusters
Calculate cluster means
Reassign observations to the cluster with the closest cluster mean

And repeat steps 2 and 3. Note that we take a measure of distance as the Euclidean squared distance between observations.

I was looking at a proof on this algorithm terminating but I am struggling with this one line:

Since the algorithm iterates a function whose domain is a finite set, the iteration must eventually enter a cycle.

Could anybody explain this rigorously because I do not understand?

https://stats.stackexchange.com/questions/188087/proof-of-convergence-of-k-means

Original Q&A

There are 1 best solutions below

Bumbble Comm On 25 May 2021 - 3:49 BEST ANSWER

Each observation can be in one of the clusters $1,2\ldots,K$. Assuming there are $n$ observations, the total number of possible partitions is $K^n$ (eventually minus the partitions having at least one empty cluster). So by doing $K^n+1$ steps of K-means, the pigeonhole principle guarantees at least one of the partitions will repeat.

Clarification on Termination of K-Means

There are 1 best solutions below

Related Questions in DATA-ANALYSIS

Related Questions in CLUSTERING

Trending Questions

Popular # Hahtags

Popular Questions