Acquiring of walking behavior for four-legged robots using actor-critic method based on policy gradient | IEEE Conference Publication | IEEE Xplore