Multiagent Online Learning in Time-Varying Games

被引：5

作者：

Duvocelle, Benoit ^{[1
]}

Mertikopoulos, Panayotis ^{[2
,3
]}

Staudigl, Mathias ^{[4
]}

Vermeulen, Dries ^{[1
]}

机构：

[1] Maastricht Univ, Dept Quantitat Econ, NL-6200 MD Maastricht, Netherlands

[2] Univ Grenoble Alpes, LIG, Grenoble INP, CNRS,Inria, F-38000 Grenoble, France

[3] Criteo AI Lab, F-38130 Echirolles, France

[4] Maastricht Univ, Dept Adv Comp Sci, NL-6200 MD Maastricht, Netherlands

来源：

MATHEMATICS OF OPERATIONS RESEARCH | 2023年 / 48卷 / 02期

关键词：

dynamic regret; Nash equilibrium; mirror descent; time-varying games; STOCHASTIC-APPROXIMATION; OPTIMIZATION; DYNAMICS; CONVERGENCE; GRADIENT; DESCENT; PLAY; FORM;

D O I：

10.1287/moor.2022.1283

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

We examine the long-run behavior of multiagent online learning in games that evolve over time. Specifically, we focus on a wide class of policies based on mirror descent, and we show that the induced sequence of play (a) converges to a Nash equilibrium in time-varying games that stabilize in the long run to a strictly monotone limit, and (b) it stays asymptotically close to the evolving equilibrium of the sequence of stage games (assuming they are strongly monotone). Our results apply to both gradient- and payoff-based feedback-that is, when players only get to observe the payoffs of their chosen actions.

引用

页码：914 / 941

页数：28

共 50 条

[1] Asynchronous and Time-Varying Proximal Type Dynamics in Multiagent Network Games
Cenedese, Carlo
Belgioioso, Giuseppe
Kawano, Yu
Grammatico, Sergio
Cao, Ming
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2861 - 2867
[2] Centralized and Distributed Online Learning for Sparse Time-Varying Optimization
Fosson, Sophie M.
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (06) : 2542 - 2557
[3] Learning Time-Varying Graphs From Online Data
Natali, Alberto
Isufi, Elvin
Coutino, Mario
Leus, Geert
IEEE OPEN JOURNAL OF SIGNAL PROCESSING, 2022, 3 : 212 - 228
[4] Distributed Adaptive Subgradient Algorithms for Online Learning Over Time-Varying Networks
Zhang, Mingchuan
Hao, Bowei
Ge, Quanbo
Zhu, Junlong
Zheng, Ruijuan
Wu, Qingtao
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4518 - 4529
[5] Stability of Evolutionary Games with Time-varying Payoffs
Wang, Yuanhua
Cheng, Daizhan
PROCEEDINGS OF THE 35TH CHINESE CONTROL CONFERENCE 2016, 2016, : 579 - 584
[6] Payoff Distribution in Robust Coalitional Games on Time-Varying Networks
Raja, Aitazaz Ali
Grammatico, Sergio
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2022, 9 (01): : 511 - 520
[7] Model and Control for a Class of Networked Evolutionary Games with Finite Memories and Time-Varying Networks
Fu, Shihua
Zhao, Guodong
Li, Haitao
Alsaedi, Ahmed
Alsaadi, Fuad E.
CIRCUITS SYSTEMS AND SIGNAL PROCESSING, 2018, 37 (07) : 3093 - 3114
[8] SOCIAL LEARNING IN NETWORKS WITH TIME-VARYING TOPOLOGIES
Liu, Qipeng
Wang, Xiaofan
ASIAN JOURNAL OF CONTROL, 2014, 16 (05) : 1342 - 1349
[9] Distributed Nonconvex Multiagent Optimization Over Time-Varying Networks
Sun, Ying
Scutari, Gesualdo
Palomar, Daniel
2016 50TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS AND COMPUTERS, 2016, : 788 - 794
[10] A Varying-Gain Recurrent Neural Network and Its Application to Solving Online Time-Varying Matrix Equation
Zhang, Zhijun
Deng, Xianzhi
Qu, Xilong
Liao, Bolin
Kong, Ling-Dong
Li, Lulan
IEEE ACCESS, 2018, 6 : 77940 - 77952

← 1 2 3 4 5 →