Pedestrian detection aided by deep learning semantic tasks

Abstract

Deep learning methods have achieved great successes in pedestrian detection, owing to its ability to learn discriminative features from raw pixels. However, they treat pedestrian detection as a single binary classification task, which may confuse positive with hard negative samples (Fig.1 (a)). To address this ambiguity, this work jointly optimize pedestrian detection with semantic tasks, including pedestrian attributes (e.g. `carrying backpack') and scene attributes (e.g. `vehicle', `tree', and `horizontal'). Rather than expensively annotating scene attributes, we transfer attributes information from existing scene segmentation datasets to the pedestrian dataset, by proposing a novel deep model to learn high-level features from multiple tasks and multiple data sources. Since distinct tasks have distinct convergence rates and data from different datasets have different distributions, a multi-task deep model is carefully designed to coordinate tasks and reduce discrepancies among datasets. Extensive evaluations show that the proposed approach outperforms the state-of-the-art on the challenging Caltech [9] and ETH [10] datasets where it reduces the miss rates of previous deep models by 17 and 5.5 percent, respectively.

Keywords

Computer sciencePedestrian detectionArtificial intelligencePedestrianDeep learningDiscriminative modelTask (project management)SegmentationConvolutional neural networkAmbiguityTree (set theory)Machine learningObject detectionTransfer of learningPattern recognition (psychology)Computer vision

Affiliated Institutions

Related Publications

Object Detection With Deep Learning: A Review

Zhong‐Qiu Zhao , Peng Zheng , Shou-Tao Xu +1 more

Due to object detection's close relationship with video analysis and image understanding, it has attracted much research attention in recent years. Traditional object detection ...

2019 IEEE Transactions on Neural Networks ... 5019 citations

Joint Deep Learning for Pedestrian Detection

Wanli Ouyang , Xiaogang Wang

Feature extraction, deformation handling, occlusion handling, and classification are four important components in pedestrian detection. Existing methods learn or design these co...

2013 681 citations

Modeling scenes with local descriptors and latent aspects

Pedro Quelhas , Florent Monay , Jean‐Marc Odobez +3 more

We present a new approach to model visual scenes in image collections, based on local invariant features and probabilistic latent space models. Our formulation provides answers ...

2005 345 citations

Meta-Transfer Learning for Few-Shot Learning

Qianru Sun , Yaoyao Liu , Tat‐Seng Chua +1 more

Meta-learning has been proposed as a framework to address the challenging few-shot learning setting. The key idea is to leverage a large number of similar few-shot tasks in orde...

2019 1224 citations

Deep Multi-Modal Object Detection and Semantic Segmentation for Autonomous Driving: Datasets, Methods, and Challenges

Di Feng , Christian Schütz , Lars Rosenbaum +5 more

Recent advancements in perception for autonomous driving are driven by deep learning. In order to achieve robust and accurate scene understanding, autonomous vehicles are usuall...

2020 IEEE Transactions on Intelligent Tran... 1211 citations

Publication Info

Year: 2015
Type: preprint
Pages: 5079-5087
Citations: 418
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Pedestrian detection aided by deep learning semantic tasks

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

418

OpenAlex

Cite This

APA Style

                            
                                    Yonglong Tian, 
                                
                                    Ping Luo, 
                                
                                    Xiaogang Wang
                                
                                et al.
                            
                            (2015). 
                            Pedestrian detection aided by deep learning semantic tasks. 
                            
                            , 5079-5087.
                            https://doi.org/10.1109/cvpr.2015.7299143

Identifiers

DOI: 10.1109/cvpr.2015.7299143