UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation

Abstract

The state-of-the-art models for medical image segmentation are variants of U-Net and fully convolutional networks (FCN). Despite their success, these models have two limitations: (1) their optimal depth is apriori unknown, requiring extensive architecture search or inefficient ensemble of models of varying depths; and (2) their skip connections impose an unnecessarily restrictive fusion scheme, forcing aggregation only at the same-scale feature maps of the encoder and decoder sub-networks. To overcome these two limitations, we propose UNet++, a new neural architecture for semantic and instance segmentation, by (1) alleviating the unknown network depth with an efficient ensemble of U-Nets of varying depths, which partially share an encoder and co-learn simultaneously using deep supervision; (2) redesigning skip connections to aggregate features of varying semantic scales at the decoder sub-networks, leading to a highly flexible feature fusion scheme; and (3) devising a pruning scheme to accelerate the inference speed of UNet++. We have evaluated UNet++ using six different medical image segmentation datasets, covering multiple imaging modalities such as computed tomography (CT), magnetic resonance imaging (MRI), and electron microscopy (EM), and demonstrating that (1) UNet++ consistently outperforms the baseline models for the task of semantic segmentation across different datasets and backbone architectures; (2) UNet++ enhances segmentation quality of varying-size objects-an improvement over the fixed-depth U-Net; (3) Mask RCNN++ (Mask R-CNN with UNet++ design) outperforms the original Mask R-CNN for the task of instance segmentation; and (4) pruned UNet++ models achieve significant speedup while showing only modest performance degradation. Our implementation and pre-trained models are available at https://github.com/MrGiovanni/UNetPlusPlus.

Keywords

Computer scienceSegmentationArtificial intelligencePattern recognition (psychology)Feature (linguistics)EncoderConvolutional neural networkImage segmentationSpeedupInferenceComputer vision

Affiliated Institutions

Arizona State University US

Related Publications

Swin-Unet: Unet-Like Pure Transformer for Medical Image Segmentation

Hu Cao , Yueyue Wang , Joy Chen +4 more

In the past few years, convolutional neural networks (CNNs) have achieved milestones in medical image analysis. Especially, the deep neural networks based on U-shaped architectu...

2023 Lecture notes in computer science 2757 citations

UNet 3+: A Full-Scale Connected UNet for Medical Image Segmentation

Huimin Huang , Lanfen Lin , Ruofeng Tong +6 more

Recently, a growing interest has been seen in deep learning-based semantic segmentation. UNet, which is one of deep learning networks with an encoder-decoder architecture, is wi...

2020 ICASSP 2020 - 2020 IEEE International... 2398 citations

UNETR: Transformers for 3D Medical Image Segmentation

Ali Hatamizadeh , Yucheng Tang , Vishwesh Nath +5 more

Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past...

2022 2022 IEEE/CVF Winter Conference on Ap... 2272 citations

Panoptic Feature Pyramid Networks

Alexander Kirillov , Ross Girshick , Kaiming He +1 more

The recently introduced panoptic segmentation task has renewed our community's interest in unifying the tasks of instance segmentation (for thing classes) and semantic segmentat...

2019 1298 citations

Road Extraction by Deep Residual U-Net

Zhengxin Zhang , Qingjie Liu , Yunhong Wang

Road extraction from aerial images has been a hot research topic in the field\nof remote sensing image analysis. In this letter, a semantic segmentation\nneural network which co...

2018 IEEE Geoscience and Remote Sensing Le... 2795 citations

Publication Info

Year: 2019
Type: article
Volume: 39
Issue: 6
Pages: 1856-1867
Citations: 3567
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

3567

OpenAlex

Cite This

APA Style

                            
                                
                                    Zongwei Zhou, 
                                
                                    Md Mahfuzur Rahman Siddiquee, 
                                
                                    Nima Tajbakhsh
                                
                                et al.
                            
                            (2019). 
                            UNet++: Redesigning Skip Connections to Exploit Multiscale Features in Image Segmentation. 
                            IEEE Transactions on Medical Imaging
                            , 39
                            (6)
                            , 1856-1867.
                            https://doi.org/10.1109/tmi.2019.2959609
                        

Identifiers

DOI: 10.1109/tmi.2019.2959609