An analysis of noise in recurrent neural networks: convergence and generalization

Abstract

Concerns the effect of noise on the performance of feedforward neural nets. We introduce and analyze various methods of injecting synaptic noise into dynamically driven recurrent nets during training. Theoretical results show that applying a controlled amount of noise during training may improve convergence and generalization performance. We analyze the effects of various noise parameters and predict that best overall performance can be achieved by injecting additive noise at each time step. Noise contributes a second-order gradient term to the error function which can be viewed as an anticipatory agent to aid convergence. This term appears to find promising regions of weight space in the beginning stages of training when the training error is large and should improve convergence on error surfaces with local minima. The first-order term is a regularization term that can improve generalization. Specifically, it can encourage internal representations where the state nodes operate in the saturated regions of the sigmoid discriminant function. While this effect can improve performance on automata inference problems with binary inputs and target outputs, it is unclear what effect it will have on other types of problems. To substantiate these predictions, we present simulations on learning the dual parity grammar from temporal strings for all noise models, and present simulations on learning a randomly generated six-state grammar using the predicted best noise model.

Keywords

Computer scienceArtificial neural networkNoise (video)Sigmoid functionGeneralizationArtificial intelligenceMaxima and minimaRegularization (linguistics)AlgorithmMathematics

Affiliated Institutions

Princeton University US

Related Publications

CutMix: Regularization Strategy to Train Strong Classifiers With Localizable Features

Sangdoo Yun , Dongyoon Han , Sanghyuk Chun +3 more

Regional dropout strategies have been proposed to enhance performance of convolutional neural network classifiers. They have proved to be effective for guiding the model to atte...

2019 4293 citations

Robust Web Image/Video Super-Resolution

Zhiwei Xiong , Xiaoyan Sun , Feng Wu

This paper proposes a robust single-image super-resolution method for enlarging low quality web image/video degraded by downsampling and compression. To simultaneously improve t...

2010 IEEE Transactions on Image Processing 136 citations

Generalization of Back propagation to Recurrent and Higher Order Neural Networks

Fernando J. Pineda

A general method for deriving backpropagation algorithms for networks with recurrent and higher order networks is introduced. The propagation of activation in these networks is ...

1987 Neural Information Processing Systems 123 citations

Inception-v4, Inception-ResNet and the Impact of Residual Connections on Learning

Christian Szegedy , Sergey Ioffe , Vincent Vanhoucke +1 more

Very deep convolutional networks have been central to the largest advances in image recognition performance in recent years. One example is the Inception architecture that has b...

2017 Proceedings of the AAAI Conference on... 4483 citations

Training Very Deep Networks

Rupesh K. Srivastava , Klaus Greff , Jürgen Schmidhuber

Theoretical and empirical evidence indicates that the depth of neural networks is crucial for their success. However, training becomes more difficult as depth increases, and tra...

2015 arXiv (Cornell University) 1100 citations

Publication Info

Year: 1996
Type: article
Volume: 7
Issue: 6
Pages: 1424-1438
Citations: 138
Access: Closed

External Links

View on DOI.org

Social Impact

Altmetric

An analysis of noise in recurrent neural networks: convergence and generalization

PlumX Metrics

Social media, news, blog, policy document mentions

Citation Metrics

138

OpenAlex

Cite This

APA Style

                            
                                    Kam-Chuen Jim, 
                                
                                    C. Lee Giles, 
                                
                                    B.G. Horne
                                
                            (1996). 
                            An analysis of noise in recurrent neural networks: convergence and generalization. 
                            IEEE Transactions on Neural Networks
                            , 7
                            (6)
                            , 1424-1438.
                            https://doi.org/10.1109/72.548170

Identifiers

DOI: 10.1109/72.548170