Tractable Reinforcement Learning for Signal Temporal Logic Tasks With Counterfactual Experience Replay | IEEE Journals & Magazine | IEEE Xplore