Abstract

A random sample is divided into the $k$ clusters that minimise the within cluster sum of squares. Conditions are found that ensure the almost sure convergence, as the sample size increases, of the set of means of the $k$ clusters. The result is proved for a more general clustering criterion.

Keywords

MathematicsCluster analysisConsistency (knowledge bases)StatisticsStrong consistencySample (material)Convergence (economics)Cluster (spacecraft)CombinatoricsEconometricsDiscrete mathematicsComputer science

Related Publications

A dendrite method for cluster analysis

Abstract A method for identifying clusters of points in a multidimensional Euclidean space is described and its application to taxonomy considered. It reconciles, in a sense, tw...

1974 Communication in Statistics- Theory a... 6351 citations

Publication Info

Year
1981
Type
article
Volume
9
Issue
1
Citations
452
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

452
OpenAlex

Cite This

David Pollard (1981). Strong Consistency of $K$-Means Clustering. The Annals of Statistics , 9 (1) . https://doi.org/10.1214/aos/1176345339

Identifiers

DOI
10.1214/aos/1176345339