Loading [MathJax]/extensions/MathMenu.js
D2CFR: Minimize Counterfactual Regret With Deep Dueling Neural Network | IEEE Journals & Magazine | IEEE Xplore

D2CFR: Minimize Counterfactual Regret With Deep Dueling Neural Network


Abstract:

Counterfactual regret minimization (CFR) is a popular method for finding approximate Nash equilibrium in two-player zero-sum games with imperfect information. Solving lar...Show More

Abstract:

Counterfactual regret minimization (CFR) is a popular method for finding approximate Nash equilibrium in two-player zero-sum games with imperfect information. Solving large-scale games with CFR needs a combination of abstraction techniques and certain expert knowledge, which constrains its scalability. Recent neural-based CFR methods mitigate the need for abstraction and expert knowledge by training an efficient network to directly obtain counterfactual regret without abstraction. However, these methods only consider estimating regret values for individual actions, neglecting the evaluation of state values, which are significant for decision-making. In this article, we introduce deep dueling CFR (D2CFR), which emphasizes the state value estimation by employing a novel value network with a dueling structure. Moreover, a rectification module based on a time-shifted Monte Carlo simulation is designed to rectify the inaccurate state value estimation. Extensive experimental results are conducted to show that D2CFR converges faster and outperforms comparison methods on test games.
Published in: IEEE Transactions on Neural Networks and Learning Systems ( Volume: 35, Issue: 12, December 2024)
Page(s): 18343 - 18356
Date of Publication: 06 October 2023

ISSN Information:

PubMed ID: 37801386

Funding Agency:


Contact IEEE to Subscribe

References

References is not available for this document.