On convergence rates of game theoretic reinforcement learning algorithms

被引:5
作者
Hu, Zhisheng [1 ]
Zhu, Minghui [1 ]
Chen, Ping [2 ]
Liu, Peng [3 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA
[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;
D O I
10.1016/j.automatica.2019.02.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:90 / 101
页数:12
相关论文
共 50 条
  • [31] Distributed joint rate and power control game-theoretic algorithms for wireless data
    Hayajneh, M
    Abdallah, CT
    IEEE COMMUNICATIONS LETTERS, 2004, 8 (08) : 511 - 513
  • [32] An Evolutionary Game Theoretic Perspective on Learning in Multi-Agent Systems
    Karl Tuyls
    Ann Nowe
    Tom Lenaerts
    Bernard Manderick
    Synthese, 2004, 139 : 297 - 330
  • [33] Learning Robust Predictive Control: A Spatial–Temporal Game Theoretic Approach
    Yang, Xindi
    Zhang, Hao
    Wang, Zhuping
    Su, Shun-Feng
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 2869 - 2880
  • [34] An evolutionary game theoretic perspective on learning in multi-agent systems
    Tuyls, K
    Nowe, A
    Lenaerts, T
    Manderick, B
    SYNTHESE, 2004, 139 (02) : 297 - 330
  • [35] GAME-THEORETIC LEARNING FOR ACTIVATION OF DIFFUSION LEAST MEAN SQUARES
    Gharehshiran, Omid Namvar
    Krishnamurthy, Vikram
    Yin, George
    2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
  • [36] Neighbor intervention: A game-theoretic model
    Mesterton-Gibbons, Mike
    Sherratt, Tom N.
    JOURNAL OF THEORETICAL BIOLOGY, 2009, 256 (02) : 263 - 275
  • [37] Game-Theoretic Communication Structures in Microgrids
    Ekneligoda, Nishantha C.
    Weaver, Wayne W.
    IEEE TRANSACTIONS ON POWER DELIVERY, 2012, 27 (04) : 2334 - 2341
  • [38] A Review of Multi-Agent Reinforcement Learning Algorithms
    Liang, Jiaxin
    Miao, Haotian
    Li, Kai
    Tan, Jianheng
    Wang, Xi
    Luo, Rui
    Jiang, Yueqiu
    ELECTRONICS, 2025, 14 (04):
  • [39] Game Theory and Reinforcement Learning in Cognitive Radar Game Modeling and Algorithm Research: A Review
    He, Bin
    Yang, Ning
    Zhang, Xulong
    Wang, Wenjun
    IEEE SENSORS JOURNAL, 2024, 24 (20) : 31696 - 31711
  • [40] A GAME THEORETIC ANALYSIS OF THE COPS AND ROBBER GAME
    Konstantinidis, Georgios
    JOURNAL OF DYNAMICS AND GAMES, 2014, 1 (04): : 599 - 619