Risk Aversion Operator for Addressing Maximization Bias in Q-Learning | IEEE Journals & Magazine | IEEE Xplore