Model-free safe policy learning via hard action barrier functions | IEEE Conference Publication | IEEE Xplore