Context Encoding for Semantic Segmentation

Abstract

Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous convolution, utilizing multi-scale features and refining boundaries. In this paper, we explore the impact of global contextual information in semantic segmentation by introducing the Context Encoding Module, which captures the semantic context of scenes and selectively highlights class-dependent featuremaps. The proposed Context Encoding Module significantly improves semantic segmentation results with only marginal extra computation cost over FCN. Our approach has achieved new state-of-the-art results 51.7% mIoU on PASCAL-Context, 85.9% mIoU on PASCAL VOC 2012. Our single model achieves a final score of 0.5567 on ADE20K test set, which surpasses the winning entry of COCO-Place Challenge 2017. In addition, we also explore how the Context Encoding Module can improve the feature representation of relatively shallow networks for the image classification on CIFAR-10 dataset. Our 14 layer network has achieved an error rate of 3.45%, which is comparable with state-of-the-art approaches with over 10Ã- more layers. The source code for the complete system are publicly available1.

Keywords

Pascal (unit)Computer scienceSegmentationEncoding (memory)Artificial intelligencePattern recognition (psychology)Convolutional neural networkContext (archaeology)Convolution (computer science)Image segmentationContext modelTest setArtificial neural networkObject (grammar)Programming language

Affiliated Institutions

Related Publications

Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation

Liang-Chieh Chen , Yukun Zhu , George Papandreou +2 more

2018 Lecture notes in computer science 13300 citations

Dual Attention Network for Scene Segmentation

Jun Fu , Jing Liu , Haijie Tian +4 more

In this paper, we address the scene segmentation task by capturing rich contextual dependencies based on the self-attention mechanism. Unlike previous works that capture context...

2019 6497 citations

Learning Deconvolution Network for Semantic Segmentation

Hyeonwoo Noh , Seunghoon Hong , Bohyung Han

We propose a novel semantic segmentation algorithm by learning a deep deconvolution network. We learn the network on top of the convolutional layers adopted from VGG 16-layer ne...

2015 3978 citations

Swin Transformer: Hierarchical Vision Transformer using Shifted Windows

Ze Liu , Yutong Lin , Yue Cao +5 more

This paper presents a new vision Transformer, called Swin Transformer, that capably serves as a general-purpose backbone for computer vision. Challenges in adapting Transformer ...

2021 2021 IEEE/CVF International Conferenc... 25813 citations

Fully convolutional networks for semantic segmentation

Jonathan Long , Evan Shelhamer , Trevor Darrell

Convolutional networks are powerful visual models that yield hierarchies of features. We show that convolutional networks by themselves, trained end-to-end, pixels-to-pixels, ex...

2015 35498 citations

Publication Info

Year: 2018
Type: article
Pages: 7151-7160
Citations: 1436
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv Semantic Scholar

Social Impact

Altmetric

Context Encoding for Semantic Segmentation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1436

OpenAlex

140

Influential

1120

CrossRef

Cite This

APA Style

                            
                                    Hang Zhang, 
                                
                                    Kristin Dana, 
                                
                                    Jianping Shi
                                
                                et al.
                            
                            (2018). 
                            Context Encoding for Semantic Segmentation. 
                            2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
                            
                            , 7151-7160.
                            https://doi.org/10.1109/cvpr.2018.00747

Identifiers

DOI: 10.1109/cvpr.2018.00747
arXiv: 1803.08904

Data Quality

Data completeness: 88%