Abstract

Object detection has been dominated by anchor-based detectors for several years. Recently, anchor-free detectors have become popular due to the proposal of FPN and Focal Loss. In this paper, we first point out that the essential difference between anchor-based and anchor-free detection is actually how to define positive and negative training samples, which leads to the performance gap between them. If they adopt the same definition of positive and negative samples during training, there is no obvious difference in the final performance, no matter regressing from a box or a point. This shows that how to select positive and negative training samples is important for current object detectors. Then, we propose an Adaptive Training Sample Selection (ATSS) to automatically select positive and negative samples according to statistical characteristics of object. It significantly improves the performance of anchor-based and anchor-free detectors and bridges the gap between them. Finally, we discuss the necessity of tiling multiple anchors per location on the image to detect objects. Extensive experiments conducted on MS COCO support our aforementioned analysis and conclusions. With the newly introduced ATSS, we improve state-of-the-art detectors by a large margin to 50.7% AP without introducing any overhead. The code is available at https://github.com/sfzhang15/ATSS.

Keywords

DetectorComputer scienceBridging (networking)Code (set theory)Object detectionSelection (genetic algorithm)Sample (material)Margin (machine learning)Artificial intelligenceOverhead (engineering)Pattern recognition (psychology)Computer visionMachine learningOperating systemSet (abstract data type)Physics

Affiliated Institutions

Related Publications

Publication Info

Year
2020
Type
article
Citations
2027
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

2027
OpenAlex
279
Influential
1763
CrossRef

Cite This

Shifeng Zhang, Cheng Chi, Yongqiang Yao et al. (2020). Bridging the Gap Between Anchor-Based and Anchor-Free Detection via Adaptive Training Sample Selection. 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) . https://doi.org/10.1109/cvpr42600.2020.00978

Identifiers

DOI
10.1109/cvpr42600.2020.00978
arXiv
1912.02424

Data Quality

Data completeness: 84%