Skip to Main Content
An extended algorithm of the relative reward strength algorithm is proposed. It is shown that the proposed algorithm ensures the convergence with probability I to the optimal path under the certain type of nonstationary environment. Several computer simulation results confirm the effectiveness of the proposed algorithm.
Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on (Volume:32 , Issue: 6 )
Date of Publication: Dec 2002