I. Introduction
Despite numerous advances in deep reinforcement learning (DRL), the majority of successes have been in two-player, zero-sum games, where it is guaranteed to converge to an optimal policy [1], such as Chess and Go [2]. Rare (and relatively recent) exceptions include Blade & Soul [3], no-press diplomacy [4], Poker
We note that, even in this case, a two-player version of Texas Hold ’em was initially assumed [5] but later superseded by a multi-player system.
[6], and StarCraft [7], [8]. In particular, there has been little work on agent development for the full 4-player game of Monopoly, despite it being one of the most popular strategic board games in the last 85 years.