Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

Abstract

Numerous deep learning applications benefit from multitask learning with multiple regression and classification objectives. In this paper we make the observation that the performance of such systems is strongly dependent on the relative weighting between each task's loss. Tuning these weights by hand is a difficult and expensive process, making multi-task learning prohibitive in practice. We propose a principled approach to multi-task deep learning which weighs multiple loss functions by considering the homoscedastic uncertainty of each task. This allows us to simultaneously learn various quantities with different units or scales in both classification and regression settings. We demonstrate our model learning per-pixel depth regression, semantic and instance segmentation from a monocular input image. Perhaps surprisingly, we show our model can learn multi-task weightings and outperform separate models trained individually on each task.

Keywords

HomoscedasticityComputer scienceArtificial intelligenceTask (project management)WeightingMulti-task learningRegressionMachine learningSemantics (computer science)Deep learningProcess (computing)MonocularPattern recognition (psychology)MathematicsStatisticsHeteroscedasticity

Affiliated Institutions

University of Cambridge GB

Related Publications

Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

René Ranftl , Katrin Lasinger , David Hafner +2 more

The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different en...

2022 Repository for Publications and Resea... 1149 citations

Deep Ordinal Regression Network for Monocular Depth Estimation

Huan Fu , Mingming Gong , Chaohui Wang +2 more

Monocular depth estimation, which plays a crucial role in understanding 3D scene geometry, is an ill-posed problem. Recent methods have gained significant improvement by explori...

2018 2018 IEEE/CVF Conference on Computer ... 1832 citations

Pedestrian detection aided by deep learning semantic tasks

Yonglong Tian , Ping Luo , Xiaogang Wang +1 more

Deep learning methods have achieved great successes in pedestrian detection, owing to its ability to learn discriminative features from raw pixels. However, they treat pedestria...

2015 418 citations

Learning Deep Architectures for AI

Yoshua Bengio

Can machine learning deliver AI? Theoretical results, inspiration from the brain and cognition, as well as machine learning experiments suggest that in order to learn the kind o...

2009 now publishers, Inc. eBooks 5043 citations

Learning Transferable Features with Deep Adaptation Networks

Mingsheng Long , Yue Cao , Jianmin Wang +1 more

Recent studies reveal that a deep neural network can learn transferable features which generalize well to novel tasks for domain adaptation. However, as deep features eventually...

2015 arXiv (Cornell University) 1222 citations

Publication Info

Year: 2018
Type: article
Pages: 7482-7491
Citations: 2556
Access: Closed

External Links

Download PDF (Free) View on DOI.org arXiv Semantic Scholar

Social Impact

Altmetric

Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2556

OpenAlex

344

Influential

1136

CrossRef

Cite This

APA Style

                            
                                    Roberto Cipolla, 
                                
                                    Yarin Gal, 
                                
                                    Alex Kendall
                                
                            (2018). 
                            Multi-task Learning Using Uncertainty to Weigh Losses for Scene Geometry and Semantics. 
                            2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition
                            
                            , 7482-7491.
                            https://doi.org/10.1109/cvpr.2018.00781

Identifiers

DOI: 10.1109/cvpr.2018.00781
arXiv: 1705.07115

Data Quality

Data completeness: 84%