For real applications of reinforcement learning, it is necessary to reduce the number of trial-and-errors. The paper proposes a method to use knowledge in reinforcement learning. We have regarded a Bayesian network as a stochastic policy, and adapted a rigid propagation procedure for a stochastic gradient method. We made preliminary experiments to demonstrate our method in a robot navigation task
Published in:
Neural Networks Proceedings, 1998. IEEE World Congress on Computational Intelligence. The 1998 IEEE International Joint Conference on
(Volume:3
)
Date of Conference: 4-9 May 1998