Multiagent Online Learning in Time-Varying Games

被引:5
|
作者
Duvocelle, Benoit [1 ]
Mertikopoulos, Panayotis [2 ,3 ]
Staudigl, Mathias [4 ]
Vermeulen, Dries [1 ]
机构
[1] Maastricht Univ, Dept Quantitat Econ, NL-6200 MD Maastricht, Netherlands
[2] Univ Grenoble Alpes, LIG, Grenoble INP, CNRS,Inria, F-38000 Grenoble, France
[3] Criteo AI Lab, F-38130 Echirolles, France
[4] Maastricht Univ, Dept Adv Comp Sci, NL-6200 MD Maastricht, Netherlands
关键词
dynamic regret; Nash equilibrium; mirror descent; time-varying games; STOCHASTIC-APPROXIMATION; OPTIMIZATION; DYNAMICS; CONVERGENCE; GRADIENT; DESCENT; PLAY; FORM;
D O I
10.1287/moor.2022.1283
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We examine the long-run behavior of multiagent online learning in games that evolve over time. Specifically, we focus on a wide class of policies based on mirror descent, and we show that the induced sequence of play (a) converges to a Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit, and (b) it stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone). Our results apply to both gradient- and payoff-based feedback-that is, when players only get to observe the payoffs of their chosen actions.
引用
收藏
页码:914 / 941
页数:28
相关论文
共 50 条
  • [21] Continuous Distributed Robust Optimization of Multiagent Systems With Time-Varying Cost
    Zhang, Renyongkang
    Guo, Ge
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2024, 11 (02): : 586 - 598
  • [22] Online Proximal-ADMM for Time-Varying Constrained Convex Optimization
    Zhang, Yijian
    Dall'Anese, Emiliano
    Hong, Mingyi
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2021, 7 : 144 - 155
  • [23] Prescribed-Time Time-Varying Output Formation Tracking for Heterogeneous Multiagent Systems
    Shi, Zhexin
    Feng, Zhi
    Wang, Qing
    Dong, Xiwang
    Lu, Jinhu
    Ren, Zhang
    Wang, Danwei
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (09): : 11622 - 11632
  • [24] Distributed Online Learning over Time-varying Graphs via Proximal Gradient Descent
    Dixit, Rishabh
    Bedi, Amrit Singh
    Rajawat, Ketan
    Koppel, Alec
    2019 IEEE 58TH CONFERENCE ON DECISION AND CONTROL (CDC), 2019, : 2745 - 2751
  • [25] Event-Triggered Consensus of Multiagent Systems With Time-Varying Communication Delay
    Chen, Mengshen
    Yan, Huaicheng
    Zhang, Hao
    Chen, Shiming
    Li, Zhichen
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (05): : 2706 - 2720
  • [26] Wave Equation-Based Time-Varying Formation Control of Multiagent Systems
    Qi, Jie
    Zhang, Jing
    Ding, Yongsheng
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (05) : 1578 - 1591
  • [27] Decentralized Fictitious Play in Near-Potential Games With Time-Varying Communication Networks
    Aydin, Sarper
    Arefizadeh, Sina
    Eksin, Ceyhun
    IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 1226 - 1231
  • [28] A time-varying iterative learning control scheme
    Tharayil, M
    Alleyne, A
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 3782 - 3787
  • [29] Gradient Dynamics in Linear Quadratic Network Games with Time-Varying Connectivity and Population Fluctuation
    Al Taha, Feras
    Rokade, Kiran
    Parise, Francesca
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 1991 - 1996
  • [30] An Augmented Game Approach for Design and Analysis of Distributed Learning Dynamics in Multiagent Games
    Tan, Shaolin
    Fang, Zhihong
    Wang, Yaonan
    Lu, Jinhu
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (11) : 6951 - 6962