Phase-dependent trajectory optimization for CPG-based biped walking using path integral reinforcement learning | IEEE Conference Publication | IEEE Xplore