Deep Learning Face Attributes in the Wild

Abstract

Predicting face attributes in the wild is challenging due to complex face variations. We propose a novel deep learning framework for attribute prediction in the wild. It cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently. LNet is pre-trained by massive general object categories for face localization, while ANet is pre-trained by massive face identities for attribute prediction. This framework not only outperforms the state-of-the-art with a large margin, but also reveals valuable facts on learning face representation. (1) It shows how the performances of face localization (LNet) and attribute prediction (ANet) can be improved by different pre-training strategies. (2) It reveals that although the filters of LNet are fine-tuned only with image-level attribute tags, their response maps over entire images have strong indication of face locations. This fact enables training LNet for face localization with only image-level annotations, but without face bounding boxes or landmarks, which are required by all attribute recognition works. (3) It also demonstrates that the high-level hidden neurons of ANet automatically discover semantic concepts after pre-training with massive face identities, and such concepts are significantly enriched after fine-tuning with attribute tags. Each attribute can be well explained with a sparse linear combination of these concepts.

Keywords

Computer scienceArtificial intelligenceFace (sociological concept)Pattern recognition (psychology)Margin (machine learning)Facial recognition systemRepresentation (politics)Bounding overwatchImage (mathematics)Object (grammar)Minimum bounding boxDeep learningComputer visionMachine learning

Affiliated Institutions

Related Publications

Convolutional Neural Networks for Medical Image Analysis: Full Training or Fine Tuning?

Nima Tajbakhsh , J. Shin , Suryakanth Gurudu +4 more

Training a deep convolutional neural network (CNN) from scratch is difficult because it requires a large amount of labeled training data and a great deal of expertise to ensure ...

2016 IEEE Transactions on Medical Imaging 2961 citations

A Deep Sum-Product Architecture for Robust Facial Attributes Analysis

Ping Luo , Xiaogang Wang , Xiaoou Tang

Recent works have shown that facial attributes are useful in a number of applications such as face recognition and retrieval. However, estimating attributes in images with large...

2013 92 citations

PANDA: Pose Aligned Networks for Deep Attribute Modeling

Ning Zhang , Manohar Paluri , Marc’Aurelio Ranzato +2 more

We propose a method for inferring human attributes (such as gender, hair style, clothes style, expression, action) from images of people under large variation of viewpoint, pose...

2014 490 citations

Learning Deep Features for Discriminative Localization

Bolei Zhou , Aditya Khosla , Àgata Lapedriza +2 more

In this work, we revisit the global average pooling layer proposed in [13], and shed light on how it explicitly enables the convolutional neural network (CNN) to have remarkable...

2016 10334 citations

CosFace: Large Margin Cosine Loss for Deep Face Recognition

Hao Wang , Yitong Wang , Zheng Zhou +5 more

Face recognition has made extraordinary progress owing to the advancement of deep convolutional neural networks (CNNs). The central task of face recognition, including face veri...

2018 2018 IEEE/CVF Conference on Computer ... 2715 citations

Publication Info

Year: 2015
Type: article
Pages: 3730-3738
Citations: 7360
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Deep Learning Face Attributes in the Wild

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

7360

OpenAlex

Cite This

APA Style

                            
                                    Ziwei Liu, 
                                
                                    Ping Luo, 
                                
                                    Xiaogang Wang
                                
                                et al.
                            
                            (2015). 
                            Deep Learning Face Attributes in the Wild. 
                            
                            , 3730-3738.
                            https://doi.org/10.1109/iccv.2015.425

Identifiers

DOI: 10.1109/iccv.2015.425