Abstract

We present a stochastic clustering algorithm which uses pairwise similarity of elements and show how it can be used to address various problems in computer vision, including the low-level image segmentation, mid-level perceptual grouping, and high-level image database organization. The clustering problem is viewed as a graph partitioning problem, where nodes represent data elements and the weights of the edges represent pairwise similarities. We generate samples of cuts in this graph, by using Karger's contraction algorithm (1996), and compute an "average" cut which provides the basis for our solution to the clustering problem. The stochastic nature of our method makes it robust against noise, including accidental edges and small spurious clusters. The complexity of our algorithm is very low: O(|E| log/sup 2/ N) for N objects, |E| similarity relations, and a fixed accuracy level. In addition, and without additional computational cost, our algorithm provides a hierarchy of nested partitions. We demonstrate the superiority of our method for image segmentation on a few synthetic and real images, both B&W and color. Our other examples include the concatenation of edges in a cluttered scene (perceptual grouping) and the organization of an image database for the purpose of multiview 3D object recognition.

Keywords

Cluster analysisArtificial intelligencePattern recognition (psychology)Image segmentationComputer sciencePairwise comparisonSegmentation-based object categorizationSegmentationComputational complexity theoryComputer visionScale-space segmentationAlgorithm

Affiliated Institutions

Related Publications

Normalized cuts and image segmentation

We propose a novel approach for solving the perceptual grouping problem in vision. Rather than focusing on local features and their consistencies in the image data, our approach...

2000 IEEE Transactions on Pattern Analysis... 15440 citations

Publication Info

Year
2001
Type
article
Volume
23
Issue
10
Pages
1053-1074
Citations
176
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

176
OpenAlex

Cite This

Yoram Gdalyahu, Daphna Weinshall, Michael Werman (2001). Self-organization in vision: stochastic clustering for image segmentation, perceptual grouping, and image database organization. IEEE Transactions on Pattern Analysis and Machine Intelligence , 23 (10) , 1053-1074. https://doi.org/10.1109/34.954598

Identifiers

DOI
10.1109/34.954598