Abstract

Recent work has made significant progress in improving spatial resolution for pixelwise labeling with Fully Convolutional Network (FCN) framework by employing Dilated/Atrous convolution, utilizing multi-scale features and refining boundaries. In this paper, we explore the impact of global contextual information in semantic segmentation by introducing the Context Encoding Module, which captures the semantic context of scenes and selectively highlights class-dependent featuremaps. The proposed Context Encoding Module significantly improves semantic segmentation results with only marginal extra computation cost over FCN. Our approach has achieved new state-of-the-art results 51.7% mIoU on PASCAL-Context, 85.9% mIoU on PASCAL VOC 2012. Our single model achieves a final score of 0.5567 on ADE20K test set, which surpasses the winning entry of COCO-Place Challenge 2017. In addition, we also explore how the Context Encoding Module can improve the feature representation of relatively shallow networks for the image classification on CIFAR-10 dataset. Our 14 layer network has achieved an error rate of 3.45%, which is comparable with state-of-the-art approaches with over 10Ã- more layers. The source code for the complete system are publicly available1.

Keywords

Pascal (unit)Computer scienceSegmentationEncoding (memory)Artificial intelligencePattern recognition (psychology)Convolutional neural networkContext (archaeology)Convolution (computer science)Image segmentationContext modelTest setArtificial neural networkObject (grammar)Programming language

Affiliated Institutions

Related Publications

Publication Info

Year
2018
Type
article
Pages
7151-7160
Citations
1436
Access
Closed

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1436
OpenAlex
140
Influential
1120
CrossRef

Cite This

Hang Zhang, Kristin Dana, Jianping Shi et al. (2018). Context Encoding for Semantic Segmentation. 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition , 7151-7160. https://doi.org/10.1109/cvpr.2018.00747

Identifiers

DOI
10.1109/cvpr.2018.00747
arXiv
1803.08904

Data Quality

Data completeness: 88%