Data-Driven Policy Gradient Method for Optimal Output Feedback Control of LQR | IEEE Conference Publication | IEEE Xplore