Abstract

Although Deep Convolutional Neural Networks (CNNs) have liberated their power in various computer vision tasks, the most important components of CNN, convolutional layers and fully connected layers, are still limited to linear transformations. In this paper, we propose a novel Factorized Bilinear (FB) layer to model the pairwise feature interactions by considering the quadratic terms in the transformations. Compared with existing methods that tried to incorporate complex non-linearity structures into CNNs, the factorized parameterization makes our FB layer only require a linear increase of parameters and affordable computational cost. To further reduce the risk of overfitting of the FB layer, a specific remedy called DropFactor is devised during the training process. We also analyze the connection between FB layer and some existing models, and show FB layer is a generalization to them. Finally, we validate the effectiveness of FB layer on several widely adopted datasets including CIFAR-10, CIFAR-100 and ImageNet, and demonstrate superior results compared with various state-of-the-art deep models.

Keywords

OverfittingBilinear interpolationComputer scienceConvolutional neural networkLayer (electronics)Pairwise comparisonGeneralizationFeature (linguistics)Artificial intelligencePattern recognition (psychology)Image (mathematics)Process (computing)Machine learningAlgorithmArtificial neural networkMathematicsComputer vision

Affiliated Institutions

Related Publications

Network In Network

Abstract: We propose a novel deep network structure called In Network (NIN) to enhance model discriminability for local patches within the receptive field. The conventional con...

2014 arXiv (Cornell University) 1037 citations

Selective Kernel Networks

In standard Convolutional Neural Networks (CNNs), the receptive fields of artificial neurons in each layer are designed to share the same size. It is well-known in the neuroscie...

2019 2019 IEEE/CVF Conference on Computer ... 2769 citations

Publication Info

Year
2017
Type
article
Pages
2098-2106
Citations
103
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

103
OpenAlex

Cite This

Yanghao Li, Naiyan Wang, Jiaying Liu et al. (2017). Factorized Bilinear Models for Image Recognition. , 2098-2106. https://doi.org/10.1109/iccv.2017.229

Identifiers

DOI
10.1109/iccv.2017.229