Bipedal walking energy minimization by reinforcement learning with evolving policy parameterization | IEEE Conference Publication | IEEE Xplore