COCO-Stuff: Thing and Stuff Classes in Context

Abstract

Semantic classes can be either things (objects with a well-defined shape, e.g. car, person) or stuff (amorphous background regions, e.g. grass, sky). While lots of classification and detection works focus on thing classes, less attention has been given to stuff classes. Nonetheless, stuff classes are important as they allow to explain important aspects of an image, including (1) scene type; (2) which thing classes are likely to be present and their location (through contextual reasoning); (3) physical attributes, material types and geometric properties of the scene. To understand stuff and things in context we introduce COCO-Stuff1, which augments all 164K images of the COCO 2017 dataset with pixel-wise annotations for 91 stuff classes. We introduce an efficient stuff annotation protocol based on superpixels, which leverages the original thing annotations. We quantify the speed versus quality trade-off of our protocol and explore the relation between annotation time and boundary complexity. Furthermore, we use COCO-Stuff to analyze: (a) the importance of stuff and thing classes in terms of their surface cover and how frequently they are mentioned in image captions; (b) the spatial relations between stuff and things, highlighting the rich contextual relations that make our dataset unique; (c) the performance of a modern semantic segmentation method on stuff and thing classes, and whether stuff is easier to segment than things.

Keywords

Computer scienceContext (archaeology)AnnotationRelation (database)Artificial intelligenceFocus (optics)SegmentationInformation retrievalData miningGeography

Affiliated Institutions

Related Publications

Dual Attention Network for Scene Segmentation

Jun Fu , Jing Liu , Haijie Tian +4 more

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture context...

2019 6497 citations

The Cityscapes Dataset for Semantic Urban Scene Understanding

Marius Cordts , Mohamed Omran , Sebastian Ramos +6 more

Visual understanding of complex urban street scenes is an enabling factor for a wide range of applications. Object detection has benefited enormously from large-scale datasets, ...

2016 11212 citations

Scene Segmentation with DAG-Recurrent Neural Networks

Bing Shuai , Zhen Zuo , Bing Wang +1 more

In this paper, we address the challenging task of scene segmentation. In order to capture the rich contextual dependencies over image regions, we propose Directed Acyclic Graph-...

2017 IEEE Transactions on Pattern Analysis... 143 citations

The Role of Context for Object Detection and Semantic Segmentation in the Wild

Roozbeh Mottaghi , Xianjie Chen , Xiaobai Liu +5 more

In this paper we study the role of context in existing state-of-the-art detection and segmentation approaches. Towards this goal, we label every pixel of PASCAL VOC 2010 detecti...

2014 1424 citations

BiSeNet: Bilateral Segmentation Network for Real-Time Semantic Segmentation

Changqian Yu , Jingbo Wang , Chao Peng +3 more

Semantic segmentation requires both rich spatial information and sizeable receptive field. However, modern approaches usually compromise spatial resolution to achieve real-time ...

2018 Lecture notes in computer science 2572 citations

Publication Info

Year: 2018
Type: article
Pages: 1209-1218
Citations: 1163
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

COCO-Stuff: Thing and Stuff Classes in Context

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1163

OpenAlex

Cite This

APA Style

                            
                                    Holger Caesar, 
                                
                                    Jasper Uijlings, 
                                
                                    Vittorio Ferrari
                                
                            (2018). 
                            COCO-Stuff: Thing and Stuff Classes in Context. 
                            
                            , 1209-1218.
                            https://doi.org/10.1109/cvpr.2018.00132

Identifiers

DOI: 10.1109/cvpr.2018.00132