Learning Discrete-Time Major-Minor Mean Field Games

被引：0

作者：

Cui, Kai ^{[1
]}

Dayanikli, Gokce ^{[2
]}

Lauriere, Mathieu ^{[3
]}

Geist, Matthieu ^{[4
]}

Pietquin, Olivier ^{[5
]}

Koeppl, Heinz ^{[1
]}

机构：

[1] Tech Univ Darmstadt, Darmstadt, Germany

[2] Univ Illinois, Urbana, IL USA

[3] NYU Shanghai, Shanghai, Peoples R China

[4] Google DeepMind, London, England

[5] Cohere, Toronto, ON, Canada

来源：

THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 9 | 2024年

关键词：

NASH EQUILIBRIA; CONVERGENCE; COMPLEXITY; SYSTEMS;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recent techniques based on Mean Field Games (MFGs) allow the scalable analysis of multi-player games with many similar, rational agents. However, standard MFGs remain limited to homogeneous players that weakly influence each other, and cannot model major players that strongly influence other players, severely limiting the class of problems that can be handled. We propose a novel discrete time version of major-minor MFGs (M3FGs), along with a learning algorithm based on fictitious play and partitioning the probability simplex. Importantly, M3FGs generalize MFGs with common noise and can handle not only random exogeneous environment states but also major players. A key challenge is that the mean field is stochastic and not deterministic as in standard MFGs. Our theoretical investigation verifies both the M3FG model and its algorithmic solution, showing firstly the well-posedness of the M3FG model starting from a finite game of interest, and secondly convergence and approximation guarantees of the fictitious play algorithm. Then, we empirically verify the obtained theoretical results, ablating some of the theoretical assumptions made, and show successful equilibrium learning in three example problems. Overall, we establish a learning framework for a novel and broad class of tractable games.

引用

页码：9616 / 9625

页数：10

共 50 条

[11] Partially Observed Discrete-Time Risk-Sensitive Mean Field Games
Naci Saldi
Tamer Başar
Maxim Raginsky
Dynamic Games and Applications, 2023, 13 : 929 - 960
[12] Partially Observed Discrete-Time Risk-Sensitive Mean Field Games
Saldi, Naci
Basar, Tamer
Raginsky, Maxim
DYNAMIC GAMES AND APPLICATIONS, 2023, 13 (03) : 929 - 960
[13] Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games
Zaman, Muhammad Aneeq Uz
Zhang, Kaiqing
Miehling, Erik
Basar, Tamer
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 2278 - 2284
[14] Discrete-time average-cost mean-field games on Polish spaces
Saldi, Naci
TURKISH JOURNAL OF MATHEMATICS, 2020, 44 (02) : 463 - 480
[15] Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces
Wiecek, Piotr
DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (01) : 222 - 256
[16] A PROBABILISTIC APPROACH TO MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS
Carmona, Rene
Zhu, Xiuneng
ANNALS OF APPLIED PROBABILITY, 2016, 26 (03): : 1535 - 1580
[17] Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces
Piotr Więcek
Dynamic Games and Applications, 2020, 10 : 222 - 256
[18] Reinforcement Learning for Non-stationary Discrete-Time Linear–Quadratic Mean-Field Games in Multiple Populations
Muhammad Aneeq uz Zaman
Erik Miehling
Tamer Başar
Dynamic Games and Applications, 2023, 13 : 118 - 164
[19] Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games
Zaman, Muhammad Aneeq Uz
Zhang, Kaiqing
Miehling, Erik
Basar, Tamer
2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 333 - 339
[20] Convex analysis for LQG systems with applications to major-minor LQG mean-field game systems
Firoozi, Dena
Jaimungal, Sebastian
Caines, Peter E.
SYSTEMS & CONTROL LETTERS, 2020, 142

← 1 2 3 4 5 →