By Topic

Asymptotically Optimal Decentralized Control for Large Population Stochastic Multiagent Systems

Sign In

Cookies must be enabled to login.After enabling cookies , please use refresh or reload or ctrl+f5 on the browser for the login options.

Formats Non-Member Member
$31 $13
Learn how you can qualify for the best price for this item!
Become an IEEE Member or Subscribe to
IEEE Xplore for exclusive pricing!
close button

puzzle piece

IEEE membership options for an individual and IEEE Xplore subscriptions for an organization offer the most affordable access to essential journal articles, conference papers, standards, eBooks, and eLearning courses.

Learn more about:

IEEE membership

IEEE Xplore subscriptions

2 Author(s)
Tao Li ; Lab. of Syst. & Control, Acad. of Math. & Syst. Sci., Beijing ; Ji-Feng Zhang

The interaction of interest-coupled decision-makers and the uncertainty of individual behavior are prominent characteristics of multiagent systems (MAS). How to break through the framework of conventional control theory, which aims at single decision-maker and single decision objective, and to extend the methodology and tools in the stochastic adaptive control theory to analyze MAS are of great significance. In this paper, a preliminary exploration is made in this direction, and the decentralized control problem is considered for large population stochastic MAS with coupled cost functions. Different from the deterministic discounted costs in the existing differential game models, a time-averaged stochastic cost function is adopted for each agent. The decentralized control law is constructed based on the state aggregation method and tracking-like quadratic optimal control. By using probability limit theory, the stability and optimality of the closed-loop system are analyzed. The main contributions of this paper include the following points. 1) The concepts of asymptotic Nash-equilibrium in probability and almost surely, respectively, are introduced and the relationship between these concepts is illuminated, which provide necessary tools for analyzing the optimality of the decentralized control laws. 2) The closed-loop system is shown to be almost surely uniformly stable, and bounded independently of the number of agents N . 3) The population state average (PSA) is shown to converge to the infinite population mean (IPM) trajectory in the sense of both L2-norm and time average almost surely, as N increases to infinity. 4) The decentralized control law is designed and shown to be almost surely asymptotically optimal; the cost of each agent based on local measurements converges to that based on global measurements almost surely, as N increases to infinity.

Published in:

Automatic Control, IEEE Transactions on  (Volume:53 ,  Issue: 7 )