I. Introduction
Efficient decision-making algorithms that can operate under computational constraints are essential in developing robotic systems. These systems need to select reasonable decisions from an extensive range of options within a limited time frame while constrained by finite computational resources. In multi-agent scenarios, such as generating motion trajectories that coordinate multiple agents, the challenge is further amplified, as the trajectories of different agents are interdependent, and the decisions made by one agent can impact the decisions of others. Moreover, when agents have different goals, balancing individual and collective decision-making objectives while considering each agent's computational limitations can make the problem even more complex.