Extending the Peak Bandwidth of Parameters for Softmax Selection in Reinforcement Learning | IEEE Journals & Magazine | IEEE Xplore