I. Introduction
Simultaneous games have been a challenging topic of artificial intelligence, particularly significant in non-cooperative games [1], [2]. The model of them can be instanced by many well-known games, including Goofspiel, StarCraft, and pursuit-evasion games, etc. Compared with most of the algorithms in solving the simultaneous game, the counterfactual regret minimization (CFR) can simulate characteristics of simultaneous decision-making accurately, over time, with high probability to find the optimal solution [3], [4].