Abstract:
This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties supp...Show MoreMetadata
Abstract:
This paper proposes a new algorithm for Temporal-Difference (TD) learning using online support vector regression. It benefits from the good generalization properties support vector regression (SVR) has, and also can do incremental learning and automatically track variation of environment with time-varying characteristics. Using the online SVR we can obtain good estimation of value function in TD learning in linear and nonlinear prediction problems. Experimental results demonstrate the effectiveness of the proposed method by comparison with others methods.
Published in: 2015 12th International Conference on Informatics in Control, Automation and Robotics (ICINCO)
Date of Conference: 21-23 July 2015
Date Added to IEEE Xplore: 10 December 2015
Electronic ISBN:978-9-8975-8149-6
Conference Location: Colmar, France