Reinforcement Learning-Based Trajectory Optimization for Data Muling With Underwater Mobile Nodes | IEEE Journals & Magazine | IEEE Xplore