An Actor-Critic-Identifier Architecture for Adaptive Approximate Optimal Control | part of Reinforcement Learning and Approximate Dynamic Programming for Feedback Control | Wiley-IEEE Press books | IEEE Xplore