On convergence rates of game theoretic reinforcement learning algorithms

被引：5

作者：

Hu, Zhisheng ^{[1
]}

Zhu, Minghui ^{[1
]}

Chen, Ping ^{[2
]}

Liu, Peng ^{[3
]}

机构：

[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA

[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China

[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA

来源：

AUTOMATICA | 2019年 / 104卷

基金：

美国国家科学基金会;

关键词：

Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;

D O I：

10.1016/j.automatica.2019.02.032

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：90 / 101

页数：12

共 50 条

[41] A Novel Scheduling Algorithm based on Game Theory and Reinforcement Learning
Zou Wensheng
[J]. MECHANICAL, MATERIALS AND MANUFACTURING ENGINEERING, PTS 1-3, 2011, 66-68 : 1948 - 1953
[42] A Multistage Game in Smart Grid Security: A Reinforcement Learning Solution
Ni, Zhen
Paul, Shuva
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (09) : 2684 - 2695
[43] Reinforcement learning and decision making in monkeys during a competitive game
Lee, D
Conroy, ML
McGreevy, BP
Barraclough, DJ
[J]. COGNITIVE BRAIN RESEARCH, 2004, 22 (01): : 45 - 58
[44] Dealer markets: A reinforcement learning mean field game approach
Bernasconi, Martino
Vittori, E.
Trovo, F.
Restelli, M.
[J]. NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2023, 68
[45] Efficient wireless packet scheduling in a non-cooperative environment: Game theoretic analysis and algorithms
Kong, Zhen
Kwok, Yu-Kwong
[J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2010, 70 (08) : 790 - 799
[46] Maximizing QoS in Heterogeneous Wireless Sensor Networks Using Game Theory and Learning Algorithms
El Hammouti, Hajar
Echabbi, Loubna
Ben Maissa, Yann
[J]. ADVANCES IN UBIQUITOUS NETWORKING, 2016, 366 : 225 - 236
[47] Game Theoretic Distributed Power Control Algorithms for Uplink Wireless Data in Flat Fading Channels
Hayajneh, Mohammad
Abdallah, Chaouki
[J]. INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2015, 10 (04) : 520 - 538
[48] Game Theoretic and Auction-based Algorithms towards Opportunistic Communications in LPWA LoRa Networks
Haghighi, Mo
Qin, Zhijin
Carboni, Davide
Adeel, Usman
Shi, Fengrui
McCann, Julie A.
[J]. 2016 IEEE 3RD WORLD FORUM ON INTERNET OF THINGS (WF-IOT), 2016, : 735 - 740
[49] Hierarchical Game-Theoretic and Reinforcement Learning Framework for Computational Offloading in UAV-Enabled Mobile Edge Computing Networks With Multiple Service Providers
Asheralieva, Alia
Niyato, Dusit
[J]. IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (05): : 8753 - 8769
[50] Validating Game-Theoretic Models of Terrorism: Insights from Machine Learning
Bang, James T.
Basuchoudhary, Atin
Mitra, Aniruddha
[J]. GAMES, 2021, 12 (03):

← 1 2 3 4 5 →