On Last-Iterate Convergence Beyond Zero-Sum Games

被引：0

作者：

Anagnostides, Ioannis ^{[1
]}

Panageas, Ioannis ^{[2
]}

Farina, Gabriele ^{[1
]}

Sandholm, Tuomas ^{[1
,3
,4
,5
]}

机构：

[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA

[2] Univ Calif Irvine, Irvine, CA 92717 USA

[3] Strategy Robot Inc, Pittsburgh, PA USA

[4] Optimized Markets Inc, Pittsburgh, PA USA

[5] Strateg Machine Inc, Charlotte, NC USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162 | 2022年

基金：

美国国家科学基金会;

关键词：

PROPORTIONAL RESPONSE DYNAMICS; SADDLE-POINT PROBLEMS; OPTIMISTIC GRADIENT; NASH EQUILIBRIA; COMPLEXITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Most existing results about last-iterate convergence of learning dynamics are limited to twoplayer zero-sum games, and only apply under rigid assumptions about what dynamics the players follow. In this paper we provide new results and techniques that apply to broader families of games and learning dynamics. First, we show that in a class of games that includes constant-sum polymatrix and strategically zero-sum games, the trajectories of dynamics such as optimistic mirror descent (OMD) exhibit a boundedness property, which holds even when players employ different algorithms and prediction mechanisms. This property enables us to obtain O(1/root T) rates and optimal O(1) regret bounds. Our analysis also reveals a surprising property: OMD either reaches arbitrarily close to a Nash equilibrium or it outperforms the robust price of anarchy in efficiency. Moreover, for potential games we establish convergence to an c-equilibrium after O(1/c(2)) iterations for mirror descent under a broad class of regularizers, as well as optimal O(1) regret bounds for OMD variants. Our framework also extends to near-potential games, and unifies known analyses for distributed learning in Fisher's market model. Finally, we analyze the convergence, efficiency, and robustness of optimistic gradient descent (OGD) in general-sum continuous games.

引用

页码：536 / 581

页数：46

共 87 条

[1] Abernethy J. D., 2011, P 24 ANN C LEARNING, V19, P27
[2] Adler I, 2009, LECT NOTES COMPUT SC, V5929, P471, DOI 10.1007/978-3-642-10841-9_44
[3] Aumann Robert J., 1974, J MATH ECON, V1, P67, DOI DOI 10.1016/0304-4068(74)90037-8
[4] THE PRICE OF ROUTING UNSPLITTABLE FLOW
Awerbuch, Baruch
Azar, Yossi
Epstein, Amir
[J]. SIAM JOURNAL ON COMPUTING, 2013, 42 (01) : 160 - 177
[5] Azizian W, 2021, PR MACH LEARN RES, V134, P326
[6] Query Complexity of Approximate Nash Equilibria
Babichenko, Yakov
[J]. STOC'14: PROCEEDINGS OF THE 46TH ANNUAL 2014 ACM SYMPOSIUM ON THEORY OF COMPUTING, 2014, : 535 - 544
[7] Bailey James P., 2019, ADV NEURAL INFORM PR, V32
[8] Bielawski Jakub, 2021, PR MACH LEARN RES, V139
[9] Billings D., 2005, P 21 C UNC ART INT, P550
[10] Birnbaum B. E., 2011, P 12 ACM C EL COMM, P127

← 1 2 3 4 5 6 7 8 9 →