Learning to Align from Scratch | RDL Research Database

Abstract

Unsupervised joint alignment of images has been demonstrated to improve performance on recognition tasks such as face verification. Such alignment reduces undesired variability due to factors such as pose, while only requiring weak supervision in the form of poorly aligned examples. However, prior work on unsupervised alignment of complex, real-world images has required the careful selection of feature representation based on hand-crafted image descriptors, in order to achieve an appropriate, smooth optimization landscape. In this paper, we instead propose a novel combination of unsupervised joint alignment with unsupervised feature learning. Specifically, we incorporate deep learning into the congealing alignment framework. Through deep learning, we obtain features that can represent the image at differing resolutions based on network depth, and that are tuned to the statistics of the specific data being aligned. In addition, we modify the learning algorithm for the restricted Boltzmann machine by incorporating a group sparsity penalty, leading to a topographic organization of the learned filters and improving subsequent alignment results. We apply our method to the Labeled Faces in the Wild database (LFW). Using the aligned images produced by our proposed unsupervised algorithm, we achieve higher accuracy in face verification compared to prior work in both unsupervised and supervised alignment. We also match the accuracy for the best available commercial method. 1

Keywords

Artificial intelligenceComputer scienceUnsupervised learningPattern recognition (psychology)Face (sociological concept)Representation (politics)Restricted Boltzmann machineFeature (linguistics)Image (mathematics)Feature learningMachine learningFeature extractionSelection (genetic algorithm)Deep learning

Affiliated Institutions

Related Publications

Learning hierarchical representations for face verification with convolutional deep belief networks

Guoyang Huang , Honglak Lee , Erik Learned-Miller

Most modern face recognition systems rely on a feature representation given by a hand-crafted image descriptor, such as Local Binary Patterns (LBP), and achieve improved perform...

2012 412 citations

Probabilistic Elastic Matching for Pose Variant Face Verification

Haoxiang Li , Gang Hua , Zhe Lin +2 more

Pose variation remains to be a major challenge for real-world face recognition. We approach this problem through a probabilistic elastic matching method. We take a part based re...

2013 199 citations

Rectified Linear Units Improve Restricted Boltzmann Machines

Vinod Nair , Geoffrey E. Hinton

Restricted Boltzmann machines were developed using binary stochastic hidden units. These can be generalized by replacing each binary unit by an infinite number of copies that al...

2010 International Conference on Machine L... 13197 citations

Sparse Feature Learning for Deep Belief Networks

Marc’Aurelio Ranzato , Y-Lan Boureau , Y. Le Cun

Unsupervised learning algorithms aim to discover the structure hidden in the data, and to learn representations that are more suitable as input to a supervised machine than the ...

2007 Neural Information Processing Systems 713 citations

Fusing Robust Face Region Descriptors via Multiple Metric Learning for Face Recognition in the Wild

Zhen Cui , Li Wen , Dong Xu +2 more

In many real-world face recognition scenarios, face images can hardly be aligned accurately due to complex appearance variations or low-quality images. To address this issue, we...

2013 194 citations

Publication Info

Year: 2012
Type: article
Volume: 25
Pages: 764-772
Citations: 248
Access: Closed

External Links

Citation Metrics

248

OpenAlex

Cite This

APA Style

                            
                                    Gary B. Huang, 
                                
                                    Marwan Mattar, 
                                
                                    Honglak Lee
                                
                                et al.
                            
                            (2012). 
                            Learning to Align from Scratch. 
                            CORE Scholar (Wright State University)
                            , 25
                            
                            , 764-772.