Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games

被引：0

作者：

Zaman, Muhammad Aneeq Uz ^{[1
]}

Zhang, Kaiqing ^{[1
]}

Miehling, Erik ^{[1
]}

Basar, Tamer ^{[1
]}

机构：

[1] Univ Illinois, Coordinated Sci Lab, Urbana, IL 61801 USA

来源：

2020 AMERICAN CONTROL CONFERENCE (ACC) | 2020年

关键词：

TRACKING CONTROL; SYSTEMS;

D O I：

10.23919/acc45564.2020.9147474

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

While the topic of mean-field games (MFGs) has a relatively long history, heretofore there has been limited work concerning algorithms for the computation of equilibrium control policies. In this paper, we develop a computable policy iteration algorithm for approximating the mean-field equilibrium in linear-quadratic MFGs with discounted cost. Given the mean-field, each agent faces a linear-quadratic tracking problem, the solution of which involves a dynamical system evolving in retrograde time. This makes the development of forward-in-time algorithm updates challenging. By identifying a structural property of the mean-field update operator, namely that it preserves sequences of a particular form, we develop a forward-in-time equilibrium computation algorithm. Bounds that quantify the accuracy of the computed mean-field equilibrium as a function of the algorithm's stopping condition are provided. The optimality of the computed equilibrium is validated numerically. In contrast to the most recent/concurrent results, our algorithm appears to be the first to study infinite-horizon MFGs with non-stationary mean-field equilibria, though with focus on the linear quadratic setting.

引用

页码：333 / 339

页数：7

共 31 条

[1]

[Anonymous], 2009, An introduction to multiagent systems

[2]

[Anonymous], 1997, OPTIMIZATION VECTOR

[3]

[Anonymous], 2003, THESIS

[4]

Basar T., 1999, DYNAMIC NONCOOPERATI, V23

[5] Linear-Quadratic Mean Field Games [J].

Bensoussan, A. ;

Sung, K. C. J. ;

Yam, S. C. P. ;

Yung, S. P. .

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 169 (02) :496-529

[6]

Bertsekas D., 2012, Dynamic programming and optimal control, V1

[7]

Bradtke S. J., 1993, P ADV NEUR INF PROC, P295

[8] Mean-field analysis of an inductive reasoning game: Application to influenza vaccination [J].

Breban, Romulus ;

Vardavas, Raffaele ;

Blower, Sally .

PHYSICAL REVIEW E, 2007, 76 (03)

[9] Mean field game of controls and an application to trade crowding [J].

Cardaliaguet, Pierre ;

Lehalle, Charles-Albert .

MATHEMATICS AND FINANCIAL ECONOMICS, 2018, 12 (03) :335-363

[10] Electrical Vehicles in the Smart Grid: A Mean Field Game Analysis [J].

Couillet, Romain ;

Perlaza, Samir M. ;

Tembine, Hamidou ;

Debbah, Merouane .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2012, 30 (06) :1086-1096

← 1 2 3 4 →