Learning Discrete-Time Major-Minor Mean Field Games

被引:0
|
作者
Cui, Kai [1 ]
Dayanikli, Gokce [2 ]
Lauriere, Mathieu [3 ]
Geist, Matthieu [4 ]
Pietquin, Olivier [5 ]
Koeppl, Heinz [1 ]
机构
[1] Tech Univ Darmstadt, Darmstadt, Germany
[2] Univ Illinois, Urbana, IL USA
[3] NYU Shanghai, Shanghai, Peoples R China
[4] Google DeepMind, London, England
[5] Cohere, Toronto, ON, Canada
关键词
NASH EQUILIBRIA; CONVERGENCE; COMPLEXITY; SYSTEMS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent techniques based on Mean Field Games (MFGs) allow the scalable analysis of multi-player games with many similar, rational agents. However, standard MFGs remain limited to homogeneous players that weakly influence each other, and cannot model major players that strongly influence other players, severely limiting the class of problems that can be handled. We propose a novel discrete time version of major-minor MFGs (M3FGs), along with a learning algorithm based on fictitious play and partitioning the probability simplex. Importantly, M3FGs generalize MFGs with common noise and can handle not only random exogeneous environment states but also major players. A key challenge is that the mean field is stochastic and not deterministic as in standard MFGs. Our theoretical investigation verifies both the M3FG model and its algorithmic solution, showing firstly the well-posedness of the M3FG model starting from a finite game of interest, and secondly convergence and approximation guarantees of the fictitious play algorithm. Then, we empirically verify the obtained theoretical results, ablating some of the theoretical assumptions made, and show successful equilibrium learning in three example problems. Overall, we establish a learning framework for a novel and broad class of tractable games.
引用
收藏
页码:9616 / 9625
页数:10
相关论文
共 50 条
  • [11] Partially Observed Discrete-Time Risk-Sensitive Mean Field Games
    Naci Saldi
    Tamer Başar
    Maxim Raginsky
    Dynamic Games and Applications, 2023, 13 : 929 - 960
  • [12] Partially Observed Discrete-Time Risk-Sensitive Mean Field Games
    Saldi, Naci
    Basar, Tamer
    Raginsky, Maxim
    DYNAMIC GAMES AND APPLICATIONS, 2023, 13 (03) : 929 - 960
  • [13] Reinforcement Learning in Non-Stationary Discrete-Time Linear-Quadratic Mean-Field Games
    Zaman, Muhammad Aneeq Uz
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 2278 - 2284
  • [14] Discrete-time average-cost mean-field games on Polish spaces
    Saldi, Naci
    TURKISH JOURNAL OF MATHEMATICS, 2020, 44 (02) : 463 - 480
  • [15] Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces
    Wiecek, Piotr
    DYNAMIC GAMES AND APPLICATIONS, 2020, 10 (01) : 222 - 256
  • [16] A PROBABILISTIC APPROACH TO MEAN FIELD GAMES WITH MAJOR AND MINOR PLAYERS
    Carmona, Rene
    Zhu, Xiuneng
    ANNALS OF APPLIED PROBABILITY, 2016, 26 (03): : 1535 - 1580
  • [17] Discrete-Time Ergodic Mean-Field Games with Average Reward on Compact Spaces
    Piotr Więcek
    Dynamic Games and Applications, 2020, 10 : 222 - 256
  • [18] Reinforcement Learning for Non-stationary Discrete-Time Linear–Quadratic Mean-Field Games in Multiple Populations
    Muhammad Aneeq uz Zaman
    Erik Miehling
    Tamer Başar
    Dynamic Games and Applications, 2023, 13 : 118 - 164
  • [19] Approximate Equilibrium Computation for Discrete-Time Linear-Quadratic Mean-Field Games
    Zaman, Muhammad Aneeq Uz
    Zhang, Kaiqing
    Miehling, Erik
    Basar, Tamer
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 333 - 339
  • [20] Convex analysis for LQG systems with applications to major-minor LQG mean-field game systems
    Firoozi, Dena
    Jaimungal, Sebastian
    Caines, Peter E.
    SYSTEMS & CONTROL LETTERS, 2020, 142