Selecting a cluster based on minimum average distance

451 Views Asked by Bumbble Comm At 10 May 2026 - 3:07

I have a symmetric matrix of non-Euclidean distances of size $N$ (say, 500) and I would like to select one cluster of a fixed size $K$ (say, 25), so that it has the smallest average distance within this cluster. What is a good algorithm for doing that given combinatorial complexity of the problem?

Currently I have implemented the following algorithm, which is not perfect in finding the optimum:

Take $K$ points at random, form the cluster
Find $K$ points with smallest average distance to the points in the cluster at step 1). Call these $K$ points the new cluster
Repeat 1) and 2) until selected $K$ points are the same in both steps or until the new cluster has the larger average distance than the old cluster.

Original Q&A

There are 1 best solutions below

Bumbble Comm On 09 Apr 2019 - 2:32 BEST ANSWER

Seems like you're re-inventing the k-means algorithm that has been here for a while (Lloyd's algorithm from 1957). Although the problem normally minimizes Euclidean distances, it shouldn't be a problem as long as your distance function is a metric.

k-means++ due to careful seeding provides more stable results in less iterations (original paper).

Selecting a cluster based on minimum average distance

There are 1 best solutions below

Related Questions in CLUSTERING

Trending Questions

Popular # Hahtags

Popular Questions