Abstract

A wavelet scattering network computes a translation invariant image representation which is stable to deformations and preserves high-frequency information for classification. It cascades wavelet transform convolutions with nonlinear modulus and averaging operators. The first network layer outputs SIFT-type descriptors, whereas the next layers provide complementary invariant information that improves classification. The mathematical analysis of wavelet scattering networks explains important properties of deep convolution networks for classification. A scattering representation of stationary processes incorporates higher order moments and can thus discriminate textures having the same Fourier power spectrum. State-of-the-art classification results are obtained for handwritten digits and texture discrimination, with a Gaussian kernel SVM and a generative PCA classifier.

Keywords

Pattern recognition (psychology)Artificial intelligenceWaveletInvariant (physics)Wavelet transformConvolution (computer science)MathematicsContextual image classificationFourier transformFeature extractionKernel (algebra)Computer scienceArtificial neural networkMathematical analysisImage (mathematics)Pure mathematics

Affiliated Institutions

Related Publications

Publication Info

Year
2013
Type
article
Volume
35
Issue
8
Pages
1872-1886
Citations
1576
Access
Closed

External Links

Social Impact

Social media, news, blog, policy document mentions

Citation Metrics

1576
OpenAlex

Cite This

Joan Bruna, Stéphane Mallat (2013). Invariant Scattering Convolution Networks. IEEE Transactions on Pattern Analysis and Machine Intelligence , 35 (8) , 1872-1886. https://doi.org/10.1109/tpami.2012.230

Identifiers

DOI
10.1109/tpami.2012.230