Deformable ConvNets V2: More Deformable, Better Results

Abstract

The superior performance of Deformable Convolutional Networks arises from its ability to adapt to the geometric variations of objects. Through an examination of its adaptive behavior, we observe that while the spatial support for its neural features conforms more closely than regular ConvNets to object structure, this support may nevertheless extend well beyond the region of interest, causing features to be influenced by irrelevant image content. To address this problem, we present a reformulation of Deformable ConvNets that improves its ability to focus on pertinent image regions, through increased modeling power and stronger training. The modeling power is enhanced through a more comprehensive integration of deformable convolution within the network, and by introducing a modulation mechanism that expands the scope of deformation modeling. To effectively harness this enriched modeling capability, we guide network training via a proposed feature mimicking scheme that helps the network to learn features that reflect the object focus and classification power of R-CNN features. With the proposed contributions, this new version of Deformable ConvNets yields significant performance gains over the original model and produces leading results on the COCO benchmark for object detection and instance segmentation.

Keywords

Computer scienceFocus (optics)Artificial intelligenceBenchmark (surveying)Convolution (computer science)Feature (linguistics)SegmentationObject detectionConvolutional neural networkFeature extractionPattern recognition (psychology)Artificial neural networkObject (grammar)Computer visionImage segmentationImage (mathematics)

Affiliated Institutions

Related Publications

Object Detection With Deep Learning: A Review

Zhong‐Qiu Zhao , Peng Zheng , Shou-Tao Xu +1 more

Due to object detection's close relationship with video analysis and image understanding, it has attracted much research attention in recent years. Traditional object detection ...

2019 IEEE Transactions on Neural Networks ... 5019 citations

Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation

Ross Girshick , Jeff Donahue , Trevor Darrell +1 more

Object detection performance, as measured on the canonical PASCAL VOC dataset, has plateaued in the last few years. The best-performing methods are complex ensemble systems that...

2014 30615 citations

Emerging Properties in Self-Supervised Vision Transformers

Mathilde Caron , Hugo Touvron , Ishan Misra +4 more

In this paper, we question if self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets). Beyond t...

2021 2021 IEEE/CVF International Conferenc... 4220 citations

FCOS: Fully Convolutional One-Stage Object Detection

Zhi Tian , Chunhua Shen , Hao Chen +1 more

We propose a fully convolutional one-stage object detector (FCOS) to solve object detection in a per-pixel prediction fashion, analogue to semantic segmentation. Almost all stat...

2019 5672 citations

A ConvNet for the 2020s

Zhuang Liu , Hanzi Mao , Chao-Yuan Wu +3 more

The "Roaring 20s" of visual recognition began with the introduction of Vision Transformers (ViTs), which quickly superseded ConvNets as the state-of-the-art image classification...

2022 2022 IEEE/CVF Conference on Computer ... 5683 citations

Publication Info

Year: 2019
Type: preprint
Pages: 9300-9308
Citations: 2431
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deformable ConvNets V2: More Deformable, Better Results

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2431

OpenAlex

Cite This

APA Style

                            
                                    Xizhou Zhu, 
                                
                                    Han Hu, 
                                
                                    Stephen Lin
                                
                                et al.
                            
                            (2019). 
                            Deformable ConvNets V2: More Deformable, Better Results. 
                            
                            , 9300-9308.
                            https://doi.org/10.1109/cvpr.2019.00953

Identifiers

DOI: 10.1109/cvpr.2019.00953