A reinforcement learning framework based on regret minimization for approximating best response in fictitious self-play | IEEE Conference Publication | IEEE Xplore