On convergence rates of game theoretic reinforcement learning algorithms

被引：5

作者：

Hu, Zhisheng ^{[1
]}

Zhu, Minghui ^{[1
]}

Chen, Ping ^{[2
]}

Liu, Peng ^{[3
]}

机构：

[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA

[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China

[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA

来源：

AUTOMATICA | 2019年 / 104卷

基金：

美国国家科学基金会;

关键词：

Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;

D O I：

10.1016/j.automatica.2019.02.032

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：90 / 101

页数：12

共 50 条

[1] Stackelberg Actor-Critic: Game-Theoretic Reinforcement Learning Algorithms
Zheng, Liyuan
Fiez, Tanner
Alumbaugh, Zane
Chasnov, Benjamin
Ratliff, Lillian J.
THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 9217 - 9224
[2] A game theoretic approach to curriculum reinforcement learning
Smyrnakis, Michalis
Hoang, Lan
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1212 - 1217
[3] Convergence of reinforcement learning algorithms and acceleration of learning
Potapov, A
Ali, MK
PHYSICAL REVIEW E, 2003, 67 (02):
[4] CONVERGENCE OF LEARNING ALGORITHMS WITH CONSTANT LEARNING RATES
KUAN, CM
HORNIK, K
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (05): : 484 - 489
[5] Robust Reinforcement Learning: A Constrained Game-theoretic Approach
Yu, Jing
Gehring, Clement
Schafer, Florian
Anandkumar, Animashree
LEARNING FOR DYNAMICS AND CONTROL, VOL 144, 2021, 144
[6] Game Theoretic Reinforcement Learning Framework For Industrial Internet of Things
Tai Manh Ho
Kim-Khoa Nguyen
Cheriet, Mohamed
2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 2112 - 2117
[7] A Unified Game-Theoretic Approach to Multiagent Reinforcement Learning
Lanctot, Marc
Zambaldi, Vinicius
Gruslys, Audrunas
Lazaridou, Angeliki
Tuyls, Karl
Perolat, Julien
Silver, David
Graepel, Thore
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
[8] The convergence rates of Shannon sampling learning algorithms
SHENG BaoHuai Department of Mathematics
Science China(Mathematics), 2012, 55 (06) : 1243 - 1256
[9] The convergence rates of Shannon sampling learning algorithms
BaoHuai Sheng
Science China Mathematics, 2012, 55 : 1243 - 1256
[10] Convergence rates of learning, algorithms, by random projection
Chen, Di-Rong
Li, Han
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2014, 37 (01) : 36 - 51

← 1 2 3 4 5 →