Learning Oscillator-Based Gait Controller for String-Form Soft Robots Using Parameter-Exploring Policy Gradients | IEEE Conference Publication | IEEE Xplore