Speeding up Convolutional Neural Networks with Low Rank Expansions

Abstract

The focus of this paper is speeding up the application of convolutional neural networks. While delivering impressive results across a range of computer vision and machine learning tasks, these networks are computationally demanding, limiting their deployability. Convolutional layers generally consume the bulk of the processing time, and so in this work we present two simple schemes for drastically speeding up these layers. This is achieved by exploiting cross-channel or filter redundancy to construct a low rank basis of filters that are rank-1 in the spatial domain. Our methods are architecture agnostic, and can be easily applied to existing CPU and GPU convolutional frameworks for tuneable speedup performance. We demonstrate this with a real world network designed for scene text character recognition [15], showing a possible 2.5× speedup with no loss in accuracy, and 4.5× speedup with less than 1% drop in accuracy, still achieving state-of-the-art on standard benchmarks.

Keywords

Convolutional neural networkComputer scienceRank (graph theory)Artificial intelligenceMathematicsCombinatorics

Affiliated Institutions

University of Oxford GB

Related Publications

GhostNet: More Features From Cheap Operations

Kai Han , Yunhe Wang , Qi Tian +3 more

Deploying convolutional neural networks (CNNs) on embedded devices is difficult due to the limited memory and computation resources. The redundancy in feature maps is an importa...

2020 3894 citations

SwinIR: Image Restoration Using Swin Transformer

Jingyun Liang , Jiezhang Cao , Guolei Sun +3 more

Image restoration is a long-standing low-level vision problem that aims to restore high-quality images from low-quality images (e.g., downscaled, noisy and compressed images). W...

2021 3538 citations

Striving for Simplicity: The All Convolutional Net

Jost Tobias Springenberg , Alexey Dosovitskiy , Thomas Brox +1 more

Most modern convolutional neural networks (CNNs) used for object recognition are built using the same principles: Alternating convolution and max-pooling layers followed by a sm...

2014 arXiv (Cornell University) 2592 citations

EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks

Mingxing Tan , Quoc V. Le

Convolutional Neural Networks (ConvNets) are commonly developed at a fixed resource budget, and then scaled up for better accuracy if more resources are available. In this paper...

2019 arXiv (Cornell University) 5008 citations

ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices

Xiangyu Zhang , Xinyu Zhou , Mengxiao Lin +1 more

We introduce an extremely computation-efficient CNN architecture named ShuffleNet, which is designed specially for mobile devices with very limited computing power (e.g., 10-150...

2018 2018 IEEE/CVF Conference on Computer ... 8394 citations

Publication Info

Year: 2014
Type: article
Pages: 88.1-88.13
Citations: 1130
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

Speeding up Convolutional Neural Networks with Low Rank Expansions

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

1130

OpenAlex

Cite This

APA Style

                            
                                    Max Jaderberg, 
                                
                                    Andrea Vedaldi, 
                                
                                    Andrew Zisserman
                                
                            (2014). 
                            Speeding up Convolutional Neural Networks with Low Rank Expansions. 
                            
                            , 88.1-88.13.
                            https://doi.org/10.5244/c.28.88

Identifiers

DOI: 10.5244/c.28.88