Multiagent Online Learning in Time-Varying Games

被引:5
|
作者
Duvocelle, Benoit [1 ]
Mertikopoulos, Panayotis [2 ,3 ]
Staudigl, Mathias [4 ]
Vermeulen, Dries [1 ]
机构
[1] Maastricht Univ, Dept Quantitat Econ, NL-6200 MD Maastricht, Netherlands
[2] Univ Grenoble Alpes, LIG, Grenoble INP, CNRS,Inria, F-38000 Grenoble, France
[3] Criteo AI Lab, F-38130 Echirolles, France
[4] Maastricht Univ, Dept Adv Comp Sci, NL-6200 MD Maastricht, Netherlands
关键词
dynamic regret; Nash equilibrium; mirror descent; time-varying games; STOCHASTIC-APPROXIMATION; OPTIMIZATION; DYNAMICS; CONVERGENCE; GRADIENT; DESCENT; PLAY; FORM;
D O I
10.1287/moor.2022.1283
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
We examine the long-run behavior of multiagent online learning in games that evolve over time. Specifically, we focus on a wide class of policies based on mirror descent, and we show that the induced sequence of play (a) converges to a Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit, and (b) it stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone). Our results apply to both gradient- and payoff-based feedback-that is, when players only get to observe the payoffs of their chosen actions.
引用
收藏
页码:914 / 941
页数:28
相关论文
共 50 条
  • [1] Asynchronous and Time-Varying Proximal Type Dynamics in Multiagent Network Games
    Cenedese, Carlo
    Belgioioso, Giuseppe
    Kawano, Yu
    Grammatico, Sergio
    Cao, Ming
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2861 - 2867
  • [2] Centralized and Distributed Online Learning for Sparse Time-Varying Optimization
    Fosson, Sophie M.
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2542 - 2557
  • [3] Learning Time-Varying Graphs From Online Data
    Natali, Alberto
    Isufi, Elvin
    Coutino, Mario
    Leus, Geert
    IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2022, 3 : 212 - 228
  • [4] Distributed Adaptive Subgradient Algorithms for Online Learning Over Time-Varying Networks
    Zhang, Mingchuan
    Hao, Bowei
    Ge, Quanbo
    Zhu, Junlong
    Zheng, Ruijuan
    Wu, Qingtao
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4518 - 4529
  • [5] Stability of Evolutionary Games with Time-varying Payoffs
    Wang, Yuanhua
    Cheng, Daizhan
    PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 579 - 584
  • [6] Payoff Distribution in Robust Coalitional Games on Time-Varying Networks
    Raja, Aitazaz Ali
    Grammatico, Sergio
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (01): : 511 - 520
  • [7] Model and Control for a Class of Networked Evolutionary Games with Finite Memories and Time-Varying Networks
    Fu, Shihua
    Zhao, Guodong
    Li, Haitao
    Alsaedi, Ahmed
    Alsaadi, Fuad E.
    CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (07) : 3093 - 3114
  • [8] SOCIAL LEARNING IN NETWORKS WITH TIME-VARYING TOPOLOGIES
    Liu, Qipeng
    Wang, Xiaofan
    ASIAN JOURNAL OF CONTROL, 2014, 16 (05) : 1342 - 1349
  • [9] Distributed Nonconvex Multiagent Optimization Over Time-Varying Networks
    Sun, Ying
    Scutari, Gesualdo
    Palomar, Daniel
    2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 788 - 794
  • [10] A Varying-Gain Recurrent Neural Network and Its Application to Solving Online Time-Varying Matrix Equation
    Zhang, Zhijun
    Deng, Xianzhi
    Qu, Xilong
    Liao, Bolin
    Kong, Ling-Dong
    Li, Lulan
    IEEE ACCESS, 2018, 6 : 77940 - 77952