Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games

被引:0
作者
Zaman, Muhammad Aneeq Uz [1 ]
Zhang, Kaiqing [1 ]
Miehling, Erik [1 ]
Basar, Tamer [1 ]
机构
[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA
来源
2020 AMERICAN CONTROL CONFERENCE (ACC) | 2020年
关键词
TRACKING CONTROL; SYSTEMS;
D O I
10.23919/acc45564.2020.9147474
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
While the topic of mean-field games (MFGs) has a relatively long history, heretofore there has been limited work concerning algorithms for the computation of equilibrium control policies. In this paper, we develop a computable policy iteration algorithm for approximating the mean-field equilibrium in linear-quadratic MFGs with discounted cost. Given the mean-field, each agent faces a linear-quadratic tracking problem, the solution of which involves a dynamical system evolving in retrograde time. This makes the development of forward-in-time algorithm updates challenging. By identifying a structural property of the mean-field update operator, namely that it preserves sequences of a particular form, we develop a forward-in-time equilibrium computation algorithm. Bounds that quantify the accuracy of the computed mean-field equilibrium as a function of the algorithm's stopping condition are provided. The optimality of the computed equilibrium is validated numerically. In contrast to the most recent/concurrent results, our algorithm appears to be the first to study infinite-horizon MFGs with non-stationary mean-field equilibria, though with focus on the linear quadratic setting.
引用
收藏
页码:333 / 339
页数:7
相关论文
共 31 条
[1]  
[Anonymous], 2009, An introduction to multiagent systems
[2]  
[Anonymous], 1997, OPTIMIZATION VECTOR
[3]  
[Anonymous], 2003, THESIS
[4]  
Basar T., 1999, DYNAMIC NONCOOPERATI, V23
[5]   Linear-Quadratic Mean Field Games [J].
Bensoussan, A. ;
Sung, K. C. J. ;
Yam, S. C. P. ;
Yung, S. P. .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 169 (02) :496-529
[6]  
Bertsekas D., 2012, Dynamic programming and optimal control, V1
[7]  
Bradtke S. J., 1993, P ADV NEUR INF PROC, P295
[8]   Mean-field analysis of an inductive reasoning game: Application to influenza vaccination [J].
Breban, Romulus ;
Vardavas, Raffaele ;
Blower, Sally .
PHYSICAL REVIEW E, 2007, 76 (03)
[9]   Mean field game of controls and an application to trade crowding [J].
Cardaliaguet, Pierre ;
Lehalle, Charles-Albert .
MATHEMATICS AND FINANCIAL ECONOMICS, 2018, 12 (03) :335-363
[10]   Electrical Vehicles in the Smart Grid: A Mean Field Game Analysis [J].
Couillet, Romain ;
Perlaza, Samir M. ;
Tembine, Hamidou ;
Debbah, Merouane .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (06) :1086-1096