Centralized Training and Decentralized Control through the Actor-Critic Paradigm for Highly Optimized Multicores | IEEE Conference Publication | IEEE Xplore