On convergence rates of game theoretic reinforcement learning algorithms

被引:5
作者
Hu, Zhisheng [1 ]
Zhu, Minghui [1 ]
Chen, Ping [2 ]
Liu, Peng [3 ]
机构
[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA
[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China
[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA
基金
美国国家科学基金会;
关键词
Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;
D O I
10.1016/j.automatica.2019.02.032
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.
引用
收藏
页码:90 / 101
页数:12
相关论文
共 50 条
  • [41] A Novel Scheduling Algorithm based on Game Theory and Reinforcement Learning
    Zou Wensheng
    [J]. MECHANICAL, MATERIALS AND MANUFACTURING ENGINEERING, PTS 1-3, 2011, 66-68 : 1948 - 1953
  • [42] A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution
    Ni, Zhen
    Paul, Shuva
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (09) : 2684 - 2695
  • [43] Reinforcement learning and decision making in monkeys during a competitive game
    Lee, D
    Conroy, ML
    McGreevy, BP
    Barraclough, DJ
    [J]. COGNITIVE BRAIN RESEARCH, 2004, 22 (01): : 45 - 58
  • [44] Dealer markets: A reinforcement learning mean field game approach
    Bernasconi, Martino
    Vittori, E.
    Trovo, F.
    Restelli, M.
    [J]. NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2023, 68
  • [45] Efficient wireless packet scheduling in a non-cooperative environment: Game theoretic analysis and algorithms
    Kong, Zhen
    Kwok, Yu-Kwong
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (08) : 790 - 799
  • [46] Maximizing QoS in Heterogeneous Wireless Sensor Networks Using Game Theory and Learning Algorithms
    El Hammouti, Hajar
    Echabbi, Loubna
    Ben Maissa, Yann
    [J]. ADVANCES IN UBIQUITOUS NETWORKING, 2016, 366 : 225 - 236
  • [47] Game Theoretic Distributed Power Control Algorithms for Uplink Wireless Data in Flat Fading Channels
    Hayajneh, Mohammad
    Abdallah, Chaouki
    [J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (04) : 520 - 538
  • [48] Game Theoretic and Auction-based Algorithms towards Opportunistic Communications in LPWA LoRa Networks
    Haghighi, Mo
    Qin, Zhijin
    Carboni, Davide
    Adeel, Usman
    Shi, Fengrui
    McCann, Julie A.
    [J]. 2016 IEEE 3RD WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2016, : 735 - 740
  • [49] Hierarchical Game-Theoretic and Reinforcement Learning Framework for Computational Offloading in UAV-Enabled Mobile Edge Computing Networks With Multiple Service Providers
    Asheralieva, Alia
    Niyato, Dusit
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05): : 8753 - 8769
  • [50] Validating Game-Theoretic Models of Terrorism: Insights from Machine Learning
    Bang, James T.
    Basuchoudhary, Atin
    Mitra, Aniruddha
    [J]. GAMES, 2021, 12 (03):