Faster Optimistic Online Mirror Descent for Extensive-Form Games

被引:0
|
作者
Jiang, Huacong [1 ]
Liu, Weiming [1 ]
Li, Bin [1 ]
机构
[1] Univ Sci & Technol China, Hefei, Peoples R China
基金
中国国家自然科学基金;
关键词
Adaptive optimistic online mirror descent; Extensive-form games; Nash equilibrium; Counterfactual regret minimization; POKER;
D O I
10.1007/978-3-031-20862-1_7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Online Mirror Descent (OMD) is a kind of regret minimization algorithms for Online Convex Optimization (OCO). Recently, they are applied to solve Extensive-Form Games (EFGs) for approximating Nash equilibrium. Especially, optimistic variants of OMD are developed, which have a better theoretical convergence rate compared to common regret minimization algorithms, e.g., Counterfactual Regret Minimization (CFR), for EFGs. However, despite the theoretical advantage, existing OMD and their optimistic variants have been shown to converge to a Nash equilibrium slower than the state-of-the-art (SOTA) CFR variants in practice. The reason for the inferior performance may be that they usually use constant regularizers whose parameters have to be chosen at the beginning. Inspired by the adaptive nature of CFRs, in this paper, an adaptive method is presented to speed up the optimistic variants of OMD. Based on this method, Adaptive Optimistic OMD (Ada-OOMD) for EFGs is proposed. In this algorithm, the regularizers can adapt to real-time regrets, thus the algorithm may converge faster in practice. Experimental results show that Ada-OOMD is at least two orders of magnitude faster than existing optimistic OMD algorithms. In some extensive-form games, such as Kuhn poker and Goofspiel, the convergence speed of Ada-OOMD even exceeds the SOTA CFRs. https://github.com/github-jhc/ada-oomd
引用
收藏
页码:90 / 103
页数:14
相关论文
共 50 条
  • [1] Efficient Phi-Regret Minimization in Extensive-Form Games via Online Mirror Descent
    Bai, Yu
    Jin, Chi
    Mei, Song
    Song, Ziang
    Yu, Tiancheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [2] RATIONALITY IN EXTENSIVE-FORM GAMES
    RENY, PJ
    JOURNAL OF ECONOMIC PERSPECTIVES, 1992, 6 (04): : 103 - 118
  • [3] Timeability of Extensive-Form Games
    Jakobsen, Sune K.
    Sorensen, Troels B.
    Conitzer, Vincent
    ITCS'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INNOVATIONS IN THEORETICAL COMPUTER SCIENCE, 2016, : 191 - 199
  • [4] Computational Extensive-Form Games
    Halpern, Joseph Y.
    Pass, Rafael
    Seeman, Lior
    EC'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON ECONOMICS AND COMPUTATION, 2016, : 681 - 698
  • [5] Quantum extensive-form games
    Kazuki Ikeda
    Quantum Information Processing, 22
  • [6] Quantum extensive-form games
    Ikeda, Kazuki
    QUANTUM INFORMATION PROCESSING, 2023, 22 (01)
  • [7] Coarse Correlation in Extensive-Form Games
    Farina, Gabriele
    Bianchi, Tommaso
    Sandholm, Tuomas
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 1934 - 1941
  • [8] Strategic negotiations for extensive-form games
    Dave de Jonge
    Dongmo Zhang
    Autonomous Agents and Multi-Agent Systems, 2020, 34
  • [9] Strategic negotiations for extensive-form games
    de Jonge, Dave
    Zhang, Dongmo
    AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS, 2020, 34 (01)
  • [10] Extensive-form games and strategic complementarities
    Echenique, F
    GAMES AND ECONOMIC BEHAVIOR, 2004, 46 (02) : 348 - 364