Abstract:
A multi-resolution state-space discretization method with pseudo-random griding is developed for the episodic unsupervised learning method of Q-Learning. It is used as th...Show MoreMetadata
Abstract:
A multi-resolution state-space discretization method with pseudo-random griding is developed for the episodic unsupervised learning method of Q-Learning. It is used as the learning agent for closed-loop control of morphing or highly reconfigurable systems. This paper develops a method whereby a state-space is adaptively discretized by progressively finer pseudo-random grids around the Regions Of Interest within the state or learning space in an effort to break the Curse of Dimensionality. Utility of the method is demonstrated with application to the problem of a morphing airfoil, which is simulated by a computationally intensive computational fluid dynamics model. By setting the multi-resolution method to define the Region Of Interest by the goal the agent seeks, it is shown that this method with the pseudo-random grid can learn a specific goal within ±0.001, while reducing the total number of state-action pairs needed to achieve this level of specificity to less than 3000.
Date of Conference: 18-23 July 2010
Date Added to IEEE Xplore: 14 October 2010
ISBN Information:
ISSN Information:
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Discrete State Space ,
- Learning Spaces ,
- Curse Of Dimensionality ,
- Discrete Method ,
- State-action Pair ,
- State-space Method ,
- Monte Carlo Simulation ,
- Value Function ,
- Policy Change ,
- Shape Changes ,
- Convergence Rate ,
- Random Generation ,
- Aerodynamic ,
- Series Of Steps ,
- Stopping Criterion ,
- Markov Decision Process ,
- Small Problems ,
- Policy Learning ,
- Discrete Levels ,
- Reinforcement Learning Problem ,
- Action-value Function ,
- Greedy Policy ,
- Temporal Difference Learning ,
- Environmental Details ,
- CFD Model ,
- Flight Phase ,
- Learning Problem ,
- Percentage Of Success ,
- Optimal Control ,
- Optimal Shape
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Discrete State Space ,
- Learning Spaces ,
- Curse Of Dimensionality ,
- Discrete Method ,
- State-action Pair ,
- State-space Method ,
- Monte Carlo Simulation ,
- Value Function ,
- Policy Change ,
- Shape Changes ,
- Convergence Rate ,
- Random Generation ,
- Aerodynamic ,
- Series Of Steps ,
- Stopping Criterion ,
- Markov Decision Process ,
- Small Problems ,
- Policy Learning ,
- Discrete Levels ,
- Reinforcement Learning Problem ,
- Action-value Function ,
- Greedy Policy ,
- Temporal Difference Learning ,
- Environmental Details ,
- CFD Model ,
- Flight Phase ,
- Learning Problem ,
- Percentage Of Success ,
- Optimal Control ,
- Optimal Shape