On convergence rates of game theoretic reinforcement learning algorithms

被引:5
|
作者
Hu, Zhisheng [1 ]
Zhu, Minghui [1 ]
Chen, Ping [2 ]
Liu, Peng [3 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA
[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;
D O I
10.1016/j.automatica.2019.02.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:90 / 101
页数:12
相关论文
共 50 条
  • [1] Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
    Zheng, Liyuan
    Fiez, Tanner
    Alumbaugh, Zane
    Chasnov, Benjamin
    Ratliff, Lillian J.
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9217 - 9224
  • [2] A game theoretic approach to curriculum reinforcement learning
    Smyrnakis, Michalis
    Hoang, Lan
    2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1212 - 1217
  • [3] Convergence of reinforcement learning algorithms and acceleration of learning
    Potapov, A
    Ali, MK
    PHYSICAL REVIEW E, 2003, 67 (02):
  • [4] CONVERGENCE OF LEARNING ALGORITHMS WITH CONSTANT LEARNING RATES
    KUAN, CM
    HORNIK, K
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (05): : 484 - 489
  • [5] Robust Reinforcement Learning: A Constrained Game-theoretic Approach
    Yu, Jing
    Gehring, Clement
    Schafer, Florian
    Anandkumar, Animashree
    LEARNING FOR DYNAMICS AND CONTROL, VOL 144, 2021, 144
  • [6] Game Theoretic Reinforcement Learning Framework For Industrial Internet of Things
    Tai Manh Ho
    Kim-Khoa Nguyen
    Cheriet, Mohamed
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2112 - 2117
  • [7] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
    Lanctot, Marc
    Zambaldi, Vinicius
    Gruslys, Audrunas
    Lazaridou, Angeliki
    Tuyls, Karl
    Perolat, Julien
    Silver, David
    Graepel, Thore
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [8] The convergence rates of Shannon sampling learning algorithms
    SHENG BaoHuai Department of Mathematics
    Science China(Mathematics), 2012, 55 (06) : 1243 - 1256
  • [9] The convergence rates of Shannon sampling learning algorithms
    BaoHuai Sheng
    Science China Mathematics, 2012, 55 : 1243 - 1256
  • [10] Convergence rates of learning, algorithms, by random projection
    Chen, Di-Rong
    Li, Han
    APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2014, 37 (01) : 36 - 51