Optimal multi-step k-nearest neighbor search

Abstract

For an increasing number of modern database applications, efficient support of similarity search becomes an important task. Along with the complexity of the objects such as images, molecules and mechanical parts, also the complexity of the similarity models increases more and more. Whereas algorithms that are directly based on indexes work well for simple medium-dimensional similarity distance functions, they do not meet the efficiency requirements of complex high-dimensional and adaptable distance functions. The use of a multi-step query processing strategy is recommended in these cases, and our investigations substantiate that the number of candidates which are produced in the filter step and exactly evaluated in the refinement step is a fundamental efficiency parameter. After revealing the strong performance shortcomings of the state-of-the-art algorithm for k-nearest neighbor search [Korn et al. 1996], we present a novel multi-step algorithm which is guaranteed to produce the minimum number of candidates. Experimental evaluations demonstrate the significant performance gain over the previous solution, and we observed average improvement factors of up to 120 for the number of candidates and up to 48 for the total runtime.

Keywords

Nearest neighbor searchSimilarity (geometry)k-nearest neighbors algorithmComputer scienceTask (project management)Filter (signal processing)Simple (philosophy)AlgorithmBest bin firstData miningArtificial intelligenceImage (mathematics)

Affiliated Institutions

Ludwig-Maximilians-Universität München DE

Related Publications

Similarity Search in High Dimensions via Hashing

Aristides Gionis , Piotr Indyk , Rajeev Motwani

The nearest- or near-neighbor query problems arise in a large variety of database applications, usually in the context of similarity searching. Of late, there has been increasin...

1999 3096 citations

Example-based super-resolution

William T. Freeman , Thouis R. Jones , Egon Pasztor

We call methods for achieving high-resolution enlargements of pixel-based images super-resolution algorithms. Many applications in graphics or image processing could benefit fro...

2002 IEEE Computer Graphics and Applications 2502 citations

Image and video upscaling from local self-examples

Gilad Freedman , Raanan Fattal

We propose a new high-quality and efficient single-image upscaling technique that extends existing example-based super-resolution frameworks. In our approach we do not rely on a...

2011 ACM Transactions on Graphics 693 citations

The processing of hexagonally sampled two-dimensional signals

R.M. Mersereau

Two-dimensional signals are normally processed as rectangularly sampled arrays; i.e., they are periodically sampled in each of two orthogonal independent variables. Another form...

1979 Proceedings of the IEEE 378 citations

Three‐dimensional Parameterization of the Stellar Locus with Application to QSO Color Selection

Heidi Jo Newberg , B. Yanny

A straightforward method for parameterizing and visualizing a locus of points in n-space is presented. The algorithm applies directly to the problem of distinguishing QSOs from ...

1997 The Astrophysical Journal Supplement ... 36 citations

Publication Info

Year: 1998
Type: article
Pages: 154-165
Citations: 438
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Optimal multi-step k-nearest neighbor search

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

438

OpenAlex

Cite This

APA Style

                            
                                    Thomas Seidl, 
                                
                                    Hans‐Peter Kriegel
                                
                            (1998). 
                            Optimal multi-step k-nearest neighbor search. 
                            
                            , 154-165.
                            https://doi.org/10.1145/276304.276319

Identifiers

DOI: 10.1145/276304.276319