A fuzzy reinforcement learning (FRL) scheme which is based on the principles of sliding-mode control and fuzzy logic is proposed. The FRL uses only immediate reward. Sufficient conditions for the convergence of the FRL to the optimal task performance are studied. The validity of the method is tested through simulation examples of a robot which deburrs a metal surface
Published in:
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
(Volume:32
,
Issue:
1
)
Date of Publication: Feb 2002