DeepID-Net: Deformable deep convolutional neural networks for object detection

Wanli Ouyang; Xiaogang Wang; Xingyu Zeng; Shi Qiu; Ping Luo; Yonglong Tian; Hongsheng Li; Shuo Yang; Zhe Wang; Chen Change Loy; Xiaoou Tang

doi:10.1109/cvpr.2015.7298854

Abstract

In this paper, we propose deformable deep convolutional neural networks for generic object detection.This new deep learning object detection framework has innovations in multiple aspects.In the proposed new deep architecture, a new deformation constrained pooling (def-pooling) layer models the deformation of object parts with geometric constraint and penalty.A new pre-training strategy is proposed to learn feature representations more suitable for the object detection task and with good generalization capability.By changing the net structures, training strategies, adding and removing some key components in the detection pipeline, a set of models with large diversity are obtained, which significantly improves the effectiveness of model averaging.The proposed approach improves the mean averaged precision obtained by RCNN [14], which was the state-ofthe-art, from 31% to 50.3% on the ILSVRC2014 detection test set.It also outperforms the winner of ILSVRC2014, GoogLeNet, by 6.1%.Detailed component-wise analysis is also provided through extensive experimental evaluation, which provide a global view for people to understand the deep learning object detection pipeline.

Keywords

Artificial intelligenceComputer scienceObject detectionConvolutional neural networkPoolingDeep learningPipeline (software)GeneralizationObject (grammar)Pattern recognition (psychology)Set (abstract data type)Test setFeature (linguistics)Constraint (computer-aided design)Machine learningComputer visionEngineeringMathematics

Affiliated Institutions

Chinese University of Hong Kong HK

Related Publications

Deformable ConvNets V2: More Deformable, Better Results

Xizhou Zhu , Han Hu , Stephen Lin +1 more

The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects. Through an examination of its adaptive beh...

2019 2019 IEEE/CVF Conference on Computer ... 2431 citations

Deformable Convolutional Networks

Jifeng Dai , Haozhi Qi , Yuwen Xiong +4 more

Convolutional neural networks (CNNs) are inherently limited to model geometric transformations due to the fixed geometric structures in their building modules. In this work, we ...

2017 6444 citations

Spatial Pyramid Pooling in Deep Convolutional Networks for Visual Recognition

Kaiming He , Xiangyu Zhang , Shaoqing Ren +1 more

Existing deep convolutional neural networks (CNNs) require a fixed-size (e.g., 224 × 224) input image. This requirement is "artificial" and may reduce the recognition accuracy f...

2015 IEEE Transactions on Pattern Analysis... 10916 citations

Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks

Sean Bell , C. Lawrence Zitnick , Kavita Bala +1 more

It is well known that contextual and multi-scale representations are important for accurate visual recognition. In this paper we present the Inside-Outside Net (ION), an object ...

2016 2016 IEEE Conference on Computer Visi... 1297 citations

Fully Convolutional Networks for Multisource Building Extraction From an Open Aerial and Satellite Imagery Data Set

Shunping Ji , Shiqing Wei , Meng Lü

The application of the convolutional neural network has shown to greatly improve the accuracy of building extraction from remote sensing imagery. In this paper, we created and m...

2018 IEEE Transactions on Geoscience and R... 1575 citations

Publication Info

Year: 2015
Type: preprint
Citations: 429
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

DeepID-Net: Deformable deep convolutional neural networks for object detection

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

429

OpenAlex

Cite This

APA Style

                            
                                    Wanli Ouyang, 
                                
                                    Xiaogang Wang, 
                                
                                    Xingyu Zeng
                                
                                et al.
                            
                            (2015). 
                            DeepID-Net: Deformable deep convolutional neural networks for object detection. 
                            
                            .
                            https://doi.org/10.1109/cvpr.2015.7298854

Identifiers

DOI: 10.1109/cvpr.2015.7298854