Reinforcement Learning in a Safety-Embedded MDP with Trajectory Optimization | IEEE Conference Publication | IEEE Xplore