A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

Abstract

We present a "parts and structure" model for object category recognition that can be learnt efficiently and in a semi-supervised manner: the model is learnt from example images containing category instances, without requiring segmentation from background clutter. The model is a sparse representation of the object, and consists of a star topology configuration of parts modeling the output of a variety of feature detectors. The optimal choice of feature types (whose repertoire includes interest points, curves and regions) is made automatically. In recognition, the model may be applied efficiently in an exhaustive manner, bypassing the need for feature detectors, to give the globally optimal match within a query image. The approach is demonstrated on a wide variety of categories, and delivers both successful classification and localization of the object within the image.

Keywords

Computer scienceArtificial intelligenceCognitive neuroscience of visual object recognitionObject (grammar)3D single-object recognitionPattern recognition (psychology)Machine learning

Affiliated Institutions

Related Publications

Object class recognition by unsupervised scale-invariant learning

Rob Fergus , Pietro Perona , Andrew Zisserman

We present a method to learn and recognize object class models from unlabeled and unsegmented cluttered scenes in a scale invariant manner. Objects are modeled as flexible const...

2003 2035 citations

Caltech-256 Object Category Dataset

G. S. Griffin , Alex Holub , Pietro Perona

We introduce a challenging set of 256 object categories containing a total of 30607 images. The original Caltech-101 [1] was collected by choosing a set of object categories, do...

2007 The Caltech Institute Archives (Calif... 2388 citations

A Discriminative Framework for Modelling Object Classes

Alex Holub , Pietro Perona

Here we explore a discriminative learning method on underlying generative models for the purpose of discriminating between object categories. Visual recognition algorithms learn...

2005 56 citations

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair , Geoffrey E. Hinton

Restricted Boltzmann machines were developed using binary stochastic hidden units. These can be generalized by replacing each binary unit by an infinite number of copies that al...

2010 International Conference on Machine L... 13197 citations

Video Google: a text retrieval approach to object matching in videos

Sivic , Zisserman

We describe an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. The object is represented by a s...

2003 6388 citations

Publication Info

Year: 2005
Type: article
Volume: 1
Pages: 380-387
Citations: 266
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

266

OpenAlex

Cite This

APA Style

                            
                                    Rob Fergus, 
                                
                                    Pietro Perona, 
                                
                                    Andrew Zisserman
                                
                            (2005). 
                            A Sparse Object Category Model for Efficient Learning and Exhaustive Recognition. 
                            
                            , 1
                            
                            , 380-387.
                            https://doi.org/10.1109/cvpr.2005.47

Identifiers

DOI: 10.1109/cvpr.2005.47