Shape Quantization and Recognition with Randomized Trees

Abstract

We explore a new approach to shape recognition based on a virtually infinite family of binary features (queries) of the image data, designed to accommodate prior information about shape invariance and regularity. Each query corresponds to a spatial arrangement of several local topographic codes (or tags), which are in themselves too primitive and common to be informative about shape. All the discriminating power derives from relative angles and distances among the tags. The important attributes of the queries are a natural partial ordering corresponding to increasing structure and complexity; semi-invariance, meaning that most shapes of a given class will answer the same way to two queries that are successive in the ordering; and stability, since the queries are not based on distinguished points and substructures. No classifier based on the full feature set can be evaluated, and it is impossible to determine a priori which arrangements are informative. Our approach is to select informative features and build tree classifiers at the same time by inductive learning. In effect, each tree provides an approximation to the full posterior where the features chosen depend on the branch that is traversed. Due to the number and nature of the queries, standard decision tree construction based on a fixed-length feature vector is not feasible. Instead we entertain only a small random sample of queries at each node, constrain their complexity to increase with tree depth, and grow multiple trees. The terminal nodes are labeled by estimates of the corresponding posterior distribution over shape classes. An image is classified by sending it down every tree and aggregating the resulting distributions. The method is applied to classifying handwritten digits and synthetic linear and nonlinear deformations of three hundred [Formula: see text] symbols. State-of-the-art error rates are achieved on the National Institute of Standards and Technology database of digits. The principal goal of the experiments on [Formula: see text] symbols is to analyze invariance, generalization error and related issues, and a comparison with artificial neural networks methods is presented in this context. [Figure: see text]

Keywords

MathematicsPattern recognition (psychology)A priori and a posterioriClassifier (UML)Tree (set theory)Feature vectorArtificial intelligenceBinary numberSet (abstract data type)Computer scienceAlgorithmCombinatorics

Affiliated Institutions

Related Publications

Joint induction of shape features and tree classifiers

Yali Amit , Donald Geman , Kenneth Wilder

We introduce a very large family of binary features for two-dimensional shapes. The salient ones for separating particular shapes are determined by inductive learning during the...

1997 IEEE Transactions on Pattern Analysis... 197 citations

Reading digits in natural images with unsupervised feature learning

Yuval Netzer , Tao Wang , Adam Coates +3 more

Detecting and reading text from natural images is a hard computer vision task that is central to a variety of emerging applications. Related problems like document character rec...

2024 4548 citations

Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy

Hanchuan Peng , Fuhui Long , Chen Ding

Feature selection is an important problem for pattern classification systems. We study how to select good features according to the maximal statistical dependency criterion base...

2005 IEEE Transactions on Pattern Analysis... 10050 citations

Scale-invariant heat kernel signatures for non-rigid shape recognition

Michael M. Bronstein , Iasonas Kokkinos

One of the biggest challenges in non-rigid shape retrieval and comparison is the design of a shape descriptor that would maintain invariance under a wide class of transformation...

2010 690 citations

Learning Hierarchical Features for Scene Labeling

Clément Farabet , Camille Couprie , Laurent Najman +1 more

Scene labeling consists of labeling each pixel in an image with the category of the object it belongs to. We propose a method that uses a multiscale convolutional network traine...

2012 IEEE Transactions on Pattern Analysis... 2684 citations

Publication Info

Year: 1997
Type: article
Volume: 9
Issue: 7
Pages: 1545-1588
Citations: 1252
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Shape Quantization and Recognition with Randomized Trees

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1252

OpenAlex

Cite This

APA Style

                            
                                    Yali Amit, 
                                
                                    Donald Geman
                                
                            (1997). 
                            Shape Quantization and Recognition with Randomized Trees. 
                            Neural Computation
                            , 9
                            (7)
                            , 1545-1588.
                            https://doi.org/10.1162/neco.1997.9.7.1545

Identifiers

DOI: 10.1162/neco.1997.9.7.1545