Generalization and Scaling in Reinforcement Learning

Abstract

In associative reinforcement learning, an environment generates input vectors, a learning system generates possible output vectors, and a reinforcement function computes feedback signals from the input-output pairs. The task is to discover and remember input-output pairs that generate rewards. Especially difficult cases occur when rewards are rare, since the expected time for any algorithm can grow exponentially with the size of the problem. Nonetheless, if a reinforcement function possesses regularities, and a learning algorithm exploits them, learning time can be reduced below that of non-generalizing algorithms. This paper describes a neural network algorithm called complementary reinforcement back-propagation (CRBP), and reports simulation results on problems designed to offer differing opportunities for generalization.

Keywords

Reinforcement learningGeneralizationComputer scienceArtificial intelligenceReinforcementAssociative propertyFunction (biology)ExploitLearning classifier systemFunction approximationArtificial neural networkTask (project management)Machine learningMathematicsEngineering

Related Publications

Neural computation by concentrating information in time.

David W. Tank , J. J. Hopfield

An analog model neural network that can solve a general problem of recognizing patterns in a time-dependent signal is presented. The networks use a patterned set of delays to co...

1987 Proceedings of the National Academy o... 357 citations

Single-Image Super-Resolution Using Sparse Regression and Natural Image Prior

Kwang In Kim , Younghee Kwon

This paper proposes a framework for single-image super-resolution. The underlying idea is to learn a map from input low-resolution images to target high-resolution images based ...

2010 IEEE Transactions on Pattern Analysis... 957 citations

A new optimizer using particle swarm theory

R.C. Eberhart , James Kennedy

The optimization of nonlinear functions using particle swarm methodology is described. Implementations of two paradigms are discussed and compared, including a recently develope...

2002 14646 citations

Network In Network

Min Lin , Qiang Chen , Shuicheng Yan

Abstract: We propose a novel deep network structure called In Network (NIN) to enhance model discriminability for local patches within the receptive field. The conventional con...

2014 arXiv (Cornell University) 1037 citations

Random-Walk Computation of Similarities between Nodes of a Graph with Application to Collaborative Recommendation

François Fouss , Alain Pirotte , Jean-Michel Renders +1 more

This work presents a new perspective on characterizing the similarity between elements of a database or, more generally, nodes of a weighted and undirected graph. It is based on...

2007 IEEE Transactions on Knowledge and Da... 1256 citations

Publication Info

Year: 1989
Type: article
Volume: 2
Pages: 550-557
Citations: 54
Access: Closed

External Links

Citation Metrics

OpenAlex

Cite This

APA Style

                            
                                    David H. Ackley, 
                                
                                    Michael L. Littman
                                
                            (1989). 
                            Generalization and Scaling in Reinforcement Learning. 
                            
                            , 2
                            
                            , 550-557.