Optimal Trajectory Tracking Control for a Quadrotor UAV Based on Off-Policy Reinforcement Learning | IEEE Conference Publication | IEEE Xplore