On convergence rates of game theoretic reinforcement learning algorithms

被引：5

作者：

Hu, Zhisheng ^{[1
]}

Zhu, Minghui ^{[1
]}

Chen, Ping ^{[2
]}

Liu, Peng ^{[3
]}

机构：

[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA

[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China

[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA

来源：

AUTOMATICA | 2019年 / 104卷

基金：

美国国家科学基金会;

关键词：

Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;

D O I：

10.1016/j.automatica.2019.02.032

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：90 / 101

页数：12

共 50 条

[31] Distributed joint rate and power control game-theoretic algorithms for wireless data
Hayajneh, M
Abdallah, CT
IEEE COMMUNICATIONS LETTERS, 2004, 8 (08) : 511 - 513
[32] An Evolutionary Game Theoretic Perspective on Learning in Multi-Agent Systems
Karl Tuyls
Ann Nowe
Tom Lenaerts
Bernard Manderick
Synthese, 2004, 139 : 297 - 330
[33] Learning Robust Predictive Control: A Spatial–Temporal Game Theoretic Approach
Yang, Xindi
Zhang, Hao
Wang, Zhuping
Su, Shun-Feng
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, : 2869 - 2880
[34] An evolutionary game theoretic perspective on learning in multi-agent systems
Tuyls, K
Nowe, A
Lenaerts, T
Manderick, B
SYNTHESE, 2004, 139 (02) : 297 - 330
[35] GAME-THEORETIC LEARNING FOR ACTIVATION OF DIFFUSION LEAST MEAN SQUARES
Gharehshiran, Omid Namvar
Krishnamurthy, Vikram
Yin, George
2013 PROCEEDINGS OF THE 21ST EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2013,
[36] Neighbor intervention: A game-theoretic model
Mesterton-Gibbons, Mike
Sherratt, Tom N.
JOURNAL OF THEORETICAL BIOLOGY, 2009, 256 (02) : 263 - 275
[37] Game-Theoretic Communication Structures in Microgrids
Ekneligoda, Nishantha C.
Weaver, Wayne W.
IEEE TRANSACTIONS ON POWER DELIVERY, 2012, 27 (04) : 2334 - 2341
[38] A Review of Multi-Agent Reinforcement Learning Algorithms
Liang, Jiaxin
Miao, Haotian
Li, Kai
Tan, Jianheng
Wang, Xi
Luo, Rui
Jiang, Yueqiu
ELECTRONICS, 2025, 14 (04):
[39] Game Theory and Reinforcement Learning in Cognitive Radar Game Modeling and Algorithm Research: A Review
He, Bin
Yang, Ning
Zhang, Xulong
Wang, Wenjun
IEEE SENSORS JOURNAL, 2024, 24 (20) : 31696 - 31711
[40] A GAME THEORETIC ANALYSIS OF THE COPS AND ROBBER GAME
Konstantinidis, Georgios
JOURNAL OF DYNAMICS AND GAMES, 2014, 1 (04): : 599 - 619

← 1 2 3 4 5 →