**4**Author(s)

The purpose of this paper is to synthesize initial mean consensus behavior of a set of agents from the fundamental optimization principles of i) stochastic dynamic games, and ii) optimal control. In the stochastic dynamic game model each agent seeks to minimize its individual quadratic discounted cost function involving the mean of the states of all other agents. In this formulation we derive the limiting infinite population mean field equation system and explicitly compute its unique solution. The resulting mean field (MF) control strategies drive each agent to track the overall population's initial state distribution mean, and by applying these decentralized strategies, any finite population system reaches mean consensus asymptotically as time goes to infinity. Furthermore, these control laws possess an ε*N*-Nash equilibrium property where ε*N* goes to zero as the population size N goes to infinity. Finally, the analysis is extended to the case of random mean field couplings. In the social cooperative formulation the basic objective is to minimize a social cost as the sum of the individual cost functions containing mean field coupling. In this formulation we show that for any individual agent the decentralized mean field social (MF Social) control strategy is the same as the mean field Nash (MF Nash) equilibrium strategy. Hence MF-Nash Controls *U*_{Nash}*N*=MF - Social Controls *U*_{Soc}*N*. On the other hand, the solution to the centralized LQR optimal control formulation yields the Standard Consensus (SC) algorithm whenever the graph representing the corresponding topology of the network is Completely Connected (CC). Hence Cent. LQR Controls *U*_{Cent}*N*=SC-CC Controls *U*_{SC}*N*. Moreover, a system with centralized control laws reaches consensus on the initial state distribution mean as time and population size N go to infinity. Hence, asymptotically in time M- -Nash Controls *U*_{Nash}*N*=MF-Social Controls *U*_{Soc}*N* = Cent. LQR Controls *U*_{Cent}^{∞} = SC-CC Controls *U*_{SC}^{∞}. Finally, the analysis is extended to the long time average (LTA) cost functions case.

- Page(s):
- 639 - 653
- ISSN :
- 0018-9286
- INSPEC Accession Number:
- 13324274
- DOI:
- 10.1109/TAC.2012.2215399

- Date of Publication :
- 24 August 2012
- Date of Current Version :
- 18 February 2013
- Issue Date :
- March 2013
- Sponsored by :
- IEEE Control Systems Society
- Publisher:
- IEEE