Learning SURF Cascade for Fast and Accurate Object Detection

Abstract

This paper presents a novel learning framework for training boosting cascade based object detector from large scale dataset. The framework is derived from the well-known Viola-Jones (VJ) framework but distinguished by three key differences. First, the proposed framework adopts multi-dimensional SURF features instead of single dimensional Haar features to describe local patches. In this way, the number of used local patches can be reduced from hundreds of thousands to several hundreds. Second, it adopts logistic regression as weak classifier for each local patch instead of decision trees in the VJ framework. Third, we adopt AUC as a single criterion for the convergence test during cascade training rather than the two trade-off criteria (false-positive-rate and hit-rate) in the VJ framework. The benefit is that the false-positive-rate can be adaptive among different cascade stages, and thus yields much faster convergence speed of SURF cascade. Combining these points together, the proposed approach has three good properties. First, the boosting cascade can be trained very efficiently. Experiments show that the proposed approach can train object detectors from billions of negative samples within one hour even on personal computers. Second, the built detector is comparable to the state-of-the-art algorithm not only on the accuracy but also on the processing speed. Third, the built detector is small in model-size due to short cascade stages.

Keywords

Boosting (machine learning)CascadeComputer scienceDetectorHaar-like featuresArtificial intelligenceObject detectionSpeedupClassifier (UML)Machine learningRate of convergenceFalse positive rateViola–Jones object detection frameworkPattern recognition (psychology)Computer visionKey (lock)Face detectionEngineering

Affiliated Institutions

Intel (United States) US

Related Publications

Cascade R-CNN: High Quality Object Detection and Instance Segmentation

Zhaowei Cai , Nuno Vasconcelos

In object detection, the intersection over union (IoU) threshold is frequently used to define positives/negatives. The threshold used to train a detector defines its quality. Wh...

2019 IEEE Transactions on Pattern Analysis... 1601 citations

Generalized Boosted Models: A guide to the gbm package

Greg Ridgeway

Boosting takes on various forms with different programs using different loss functions, different base models, and different optimization schemes. The gbm package takes the appr...

2006 769 citations

Hybrid Task Cascade for Instance Segmentation

Kai Chen , Wanli Ouyang , Chen Change Loy +9 more

Cascade is a classic yet powerful architecture that has boosted performance on various tasks. However, how to introduce cascade to instance segmentation remains an open question...

2019 1411 citations

Cascade R-CNN: Delving Into High Quality Object Detection

Zhaowei Cai , Nuno Vasconcelos

In object detection, an intersection over union (IoU) threshold is required to define positives and negatives. An object detector, trained with low IoU threshold, e.g. 0.5, usua...

2018 6294 citations

Rapid object detection using a boosted cascade of simple features

Paul Viola , Michael Jones

This paper describes a machine learning approach for visual object detection which is capable of processing images extremely rapidly and achieving high detection rates. This wor...

2005 Proceedings of the 2001 IEEE Computer... 17986 citations

Publication Info

Year: 2013
Type: article
Pages: 3468-3475
Citations: 216
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Learning SURF Cascade for Fast and Accurate Object Detection

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

216

OpenAlex

Cite This

APA Style

                            
                                    Jianguo Li, 
                                
                                    Yimin Zhang
                                
                            (2013). 
                            Learning SURF Cascade for Fast and Accurate Object Detection. 
                            
                            , 3468-3475.
                            https://doi.org/10.1109/cvpr.2013.445

Identifiers

DOI: 10.1109/cvpr.2013.445