By Topic

The absolutely expedient nonlinear reinforcement schemes under the unknown multiteacher environment

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

1 Author(s)
Baba, N. ; Faculty of Engng., Tokushima Univ., Tokushima City, Japan

Learning behaviours of variable-structure stochastic automata under a multiteacher environment are considered. The concepts of absolute expediency and ε-optimality in a single-teacher environment are extended by the introduction of an average weighted reward and are redefined for a multiteacher environment. As an extended form of the absolutely expedient learning algorithm, a general class of nonlinear learning algorithm, called the GAE scheme, is proposed as a reinforcement scheme in a multiteacher environment. It is shown that the GAE scheme is absolutely expedient and ε-optimal in the general n-teacher environment. Learning behaviours of the GAE scheme in various multiteacher environments are simulated by computer and the results indicate the effectiveness of the GAE scheme.

Published in:

Systems, Man and Cybernetics, IEEE Transactions on  (Volume:SMC-13 ,  Issue: 1 )