Understanding image representations by measuring their equivariance and equivalence

Karel Lenc; Andrea Vedaldi

doi:10.1109/cvpr.2015.7298701

Abstract

Despite the importance of image representations such as histograms of oriented gradients and deep Convolutional Neural Networks (CNN), our theoretical understanding of them remains limited. Aiming at filling this gap, we investigate three key mathematical properties of representations: equivariance, invariance, and equivalence. Equivariance studies how transformations of the input image are encoded by the representation, invariance being a special case where a transformation has no effect. Equivalence studies whether two representations, for example two different parametrisations of a CNN, capture the same visual information or not. A number of methods to establish these properties empirically are proposed, including introducing transformation and stitching layers in CNNs. These methods are then applied to popular representations to reveal insightful aspects of their structure, including clarifying at which layers in a CNN certain geometric invariances are achieved. While the focus of the paper is theoretical, direct applications to structured-output regression are demonstrated too.

Keywords

Equivalence (formal languages)Convolutional neural networkTransformation (genetics)Computer scienceTransformation geometryArtificial intelligenceRepresentation (politics)Focus (optics)Image (mathematics)Geometric transformationAlgebra over a fieldImage stitchingTheoretical computer sciencePattern recognition (psychology)MathematicsAlgorithmPure mathematics

Affiliated Institutions

University of Oxford GB

Related Publications

Understanding deep image representations by inverting them

Aravindh Mahendran , Andrea Vedaldi

Image representations, from SIFT and Bag of Visual Words to Convolutional Neural Networks (CNNs), are a crucial component of almost any image understanding system. Nevertheless,...

2015 1831 citations

RGB-D Object Recognition via Incorporating Latent Data Structure and Prior Knowledge

Jinhui Tang , Lu Jin , Zechao Li +1 more

For the task of RGB-D object recognition, it is important to identify suitable representations of images, which can boost the performance of object recognition. In this work, we...

2015 IEEE Transactions on Multimedia 60 citations

Second-Order Attention Network for Single Image Super-Resolution

Tao Dai , Jianrui Cai , Yongbing Zhang +2 more

Recently, deep convolutional neural networks (CNNs) have been widely explored in single image super-resolution (SISR) and obtained remarkable performance. However, most of the e...

2019 2019 IEEE/CVF Conference on Computer ... 1811 citations

Quantized Convolutional Neural Networks for Mobile Devices

Jiaxiang Wu , Cong Leng , Yuhang Wang +2 more

Recently, convolutional neural networks (CNN) have demonstrated impressive performance in various computer vision tasks. However, high performance hardware is typically indispen...

2016 1228 citations

Global Second-Order Pooling Convolutional Networks

Zilin Gao , Jiangtao Xie , Qilong Wang +1 more

Deep Convolutional Networks (ConvNets) are fundamental to, besides large-scale visual recognition, a lot of vision tasks. As the primary goal of the ConvNets is to characterize ...

2019 461 citations

Publication Info

Year: 2015
Type: article
Pages: 991-999
Citations: 367
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Understanding image representations by measuring their equivariance and equivalence

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

367

OpenAlex

Cite This

APA Style

                            
                                    Karel Lenc, 
                                
                                    Andrea Vedaldi
                                
                            (2015). 
                            Understanding image representations by measuring their equivariance and equivalence. 
                            
                            , 991-999.
                            https://doi.org/10.1109/cvpr.2015.7298701

Identifiers

DOI: 10.1109/cvpr.2015.7298701