Convolutional Pose Machines | RDL Research Database

Abstract

Pose Machines provide a sequential prediction framework for learning rich implicit spatial models. In this work we show a systematic design for how convolutional networks can be incorporated into the pose machine framework for learning image features and image-dependent spatial models for the task of pose estimation. The contribution of this paper is to implicitly model long-range dependencies between variables in structured prediction tasks such as articulated pose estimation. We achieve this by designing a sequential architecture composed of convolutional networks that directly operate on belief maps from previous stages, producing increasingly refined estimates for part locations, without the need for explicit graphical model-style inference. Our approach addresses the characteristic difficulty of vanishing gradients during training by providing a natural learning objective function that enforces intermediate supervision, thereby replenishing back-propagated gradients and conditioning the learning procedure. We demonstrate state-of-the-art performance and outperform competing methods on standard benchmarks including the MPII, LSP, and FLIC datasets.

Keywords

PoseComputer scienceInferenceArtificial intelligenceTask (project management)Machine learningRange (aeronautics)Image (mathematics)Convolutional neural networkConvolution (computer science)Function (biology)Graphical modelPattern recognition (psychology)Artificial neural network

Affiliated Institutions

Carnegie Mellon University US

Related Publications

Efficient object localization using Convolutional Networks

Jonathan Tompson , Ross Goroshin , Arjun Jain +2 more

Recent state-of-the-art performance on human-body pose estimation has been achieved with Deep Convolutional Networks (ConvNets). Traditional ConvNet architectures include poolin...

2015 1324 citations

Flowing ConvNets for Human Pose Estimation in Videos

Tomas Pfister , James Charles , Andrew Zisserman

The objective of this work is human pose estimation in videos, where multiple frames are available. We investigate a ConvNet architecture that is able to benefit from temporal c...

2015 499 citations

Pictorial structures revisited: People detection and articulated pose estimation

Mykhaylo Andriluka , Stefan Roth , Bernt Schiele

Non-rigid object detection and articulated pose estimation are two related and challenging problems in computer vision. Numerous models have been proposed over the years and oft...

2009 2009 IEEE Conference on Computer Visi... 805 citations

Advances in kernel methods: support vector learning

Bernhard Schölkopf , Christopher J. C. Burges , Alexander J. Smola

Introduction to support vector learning roadmap. Part 1 Theory: three remarks on the support vector method of function estimation, Vladimir Vapnik generalization performance of ...

1999 International Conference on Neural In... 5814 citations

Multi-source Deep Learning for Human Pose Estimation

Wanli Ouyang , Xiao Chu , Xiaogang Wang

Visual appearance score, appearance mixture type and deformation are three important information sources for human pose estimation. This paper proposes to build a multi-source d...

2014 273 citations

Publication Info

Year: 2016
Type: preprint
Citations: 2728
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Convolutional Pose Machines

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

2728

OpenAlex

Cite This

APA Style

                            
                                    Shih-En Wei, 
                                
                                    Varun Ramakrishna, 
                                
                                    Takeo Kanade
                                
                                et al.
                            
                            (2016). 
                            Convolutional Pose Machines. 
                            
                            .
                            https://doi.org/10.1109/cvpr.2016.511

Identifiers

DOI: 10.1109/cvpr.2016.511