Learning CPG-based biped locomotion with a policy gradient method | IEEE Conference Publication | IEEE Xplore