On convergence rates of game theoretic reinforcement learning algorithms

被引:5
|
作者
Hu, Zhisheng [1 ]
Zhu, Minghui [1 ]
Chen, Ping [2 ]
Liu, Peng [3 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA
[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;
D O I
10.1016/j.automatica.2019.02.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:90 / 101
页数:12
相关论文
共 50 条
  • [21] Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
    Cheong, Kang Hao
    Zhao, Jie
    Physical Review Research, 6 (03):
  • [22] On the role of reinforcement learning in experimental games: The cognitive game-theoretic approach
    Erev, I
    Roth, AE
    GAMES AND HUMAN BEHAVIOR: ESSAYS IN HONOR OF AMNON RAPOPORT, 1999, : 53 - 77
  • [23] A Game-Theoretic Framework with Reinforcement Learning for Multinode Cooperation in Wireless Networks
    Baidas, Mohammed W.
    2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 981 - 986
  • [24] Reinforcement Learning Based Incentive Mechanism for Federated Meta Learning: A Game-Theoretic Perspective
    Zhang, Shenglv
    Zhou, Yuren
    Qu, Haohao
    Zhu, Yiting
    You, Linlin
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1152 - 1159
  • [25] Parallelization of game theoretic centrality algorithms
    Sankar, M. Vishnu
    Ravindran, Balaraman
    SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2015, 40 (06): : 1821 - 1843
  • [26] Comparing reinforcement learning approaches for solving game theoretic models: a dynamic airline pricing game example
    Collins, A.
    Thomas, L.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2012, 63 (08) : 1165 - 1173
  • [27] A general class of no-regret learning algorithms and game-theoretic equilibria
    Greenwald, A
    Jafari, A
    LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 2 - 12
  • [28] Improving multi-robot coordination by game-theoretic learning algorithms
    Smyrnakis, Michalis
    Qu, Hongyang
    Veres, Sandor
    2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 417 - 424
  • [29] Improving Multi-Robot Coordination by Game-Theoretic Learning Algorithms
    Smyrnakis, Michalis
    Qu, Hongyang
    Veres, Sandor M.
    INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (07)
  • [30] Innovative Application of Computer Game Algorithms of Surakarta Based on Reinforcement Learning
    Tao, Jun
    Wu, Gui
    Zeng, Peng
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2312 - 2315