Learning an End-To-End Policy for AUV Control Within Just Forty Minutes Using Parallel Simulation | IEEE Conference Publication | IEEE Xplore