Abstract

We propose a new measure for assessing the quality of a clustering. A simple heuristic is shown to give worst-case guarantees under the new measure. Then we present two results regarding the quality of the clustering found by a popular spectral algorithm. One proffers worst case guarantees whilst the other shows that if there exists a "good" clustering then the spectral algorithm will find one close to it.

Keywords

Cluster analysisMeasure (data warehouse)HeuristicComputer scienceSpectral clusteringSimple (philosophy)Quality (philosophy)AlgorithmArtificial intelligenceMathematicsData mining

Affiliated Institutions

Related Publications

On clusterings

We motivate and develop a natural bicriteria measure for assessing the quality of a clustering that avoids the drawbacks of existing measures. A simple recursive heuristic is sh...

2004 Journal of the ACM 842 citations

How fast is the k-means method?

We present polynomial upper and lower bounds on the number of iterations performed by the k-means method (a.k.a. Lloyd's method) for k-means clustering. Our upper bounds are pol...

2005 Symposium on Discrete Algorithms 46 citations

Publication Info

Year
2002
Type
article
Pages
367-377
Citations
374
Access
Closed

External Links

Social Impact

Altmetric

Social media, news, blog, policy document mentions

Citation Metrics

374
OpenAlex

Cite This

Ramachandran Kannan, S. Vempala, A. Veta (2002). On clusterings-good, bad and spectral. , 367-377. https://doi.org/10.1109/sfcs.2000.892125

Identifiers

DOI
10.1109/sfcs.2000.892125