A new approach to the design of reinforcement schemes for learning automata
A new class of reinforcement schemes for learning automata that makes use of estimates of the random characteristics of the environment is introduced. Both a single automaton an...