Efficient Off-Policy Q-Learning for Data-Based Discrete-Time LQR Problems | IEEE Journals & Magazine | IEEE Xplore