On convergence rates of game theoretic reinforcement learning algorithms

被引：5

作者：

Hu, Zhisheng ^{[1
]}

Zhu, Minghui ^{[1
]}

Chen, Ping ^{[2
]}

Liu, Peng ^{[3
]}

机构：

[1] Penn State Univ, Sch Elect Engn & Comp Sci, 201 Old Main, University Pk, PA 16802 USA

[2] BDA, JD Com, 18 Kechuang 11 St, Beijing 10111, Peoples R China

[3] Penn State Univ, Coll Informat Sci & Technol, University Pk, PA 16802 USA

来源：

AUTOMATICA | 2019年 / 104卷

基金：

美国国家科学基金会;

关键词：

Distributed control; Game theory; Learning algorithms; NASH EQUILIBRIUM SEEKING; BEHAVIOR;

D O I：

10.1016/j.automatica.2019.02.032

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates a class of multi-player discrete games where each player aims to maximize its own utility function. Each player does not know the other players' action sets, their deployed actions or the structures of its own or the others' utility functions. Instead, each player only knows its own deployed actions and its received utility values in recent history. We propose a reinforcement learning algorithm which converges to the set of action profiles which have maximal stochastic potential with probability one. Furthermore, an upper bound on the convergence rate is derived and is minimized when the exploration rates are restricted to p-series. The algorithm performance is verified using a case study in the smart grid. (C) 2019 Elsevier Ltd. All rights reserved.

引用

页码：90 / 101

页数：12

共 50 条

[21] Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
Cheong, Kang Hao
Zhao, Jie
Physical Review Research, 6 (03):
[22] On the role of reinforcement learning in experimental games: The cognitive game-theoretic approach
Erev, I
Roth, AE
GAMES AND HUMAN BEHAVIOR: ESSAYS IN HONOR OF AMNON RAPOPORT, 1999, : 53 - 77
[23] A Game-Theoretic Framework with Reinforcement Learning for Multinode Cooperation in Wireless Networks
Baidas, Mohammed W.
2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 981 - 986
[24] Reinforcement Learning Based Incentive Mechanism for Federated Meta Learning: A Game-Theoretic Perspective
Zhang, Shenglv
Zhou, Yuren
Qu, Haohao
Zhu, Yiting
You, Linlin
2022 IEEE 34TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, ICTAI, 2022, : 1152 - 1159
[25] Parallelization of game theoretic centrality algorithms
Sankar, M. Vishnu
Ravindran, Balaraman
SADHANA-ACADEMY PROCEEDINGS IN ENGINEERING SCIENCES, 2015, 40 (06): : 1821 - 1843
[26] Comparing reinforcement learning approaches for solving game theoretic models: a dynamic airline pricing game example
Collins, A.
Thomas, L.
JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2012, 63 (08) : 1165 - 1173
[27] A general class of no-regret learning algorithms and game-theoretic equilibria
Greenwald, A
Jafari, A
LEARNING THEORY AND KERNEL MACHINES, 2003, 2777 : 2 - 12
[28] Improving multi-robot coordination by game-theoretic learning algorithms
Smyrnakis, Michalis
Qu, Hongyang
Veres, Sandor
2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017), 2017, : 417 - 424
[29] Improving Multi-Robot Coordination by Game-Theoretic Learning Algorithms
Smyrnakis, Michalis
Qu, Hongyang
Veres, Sandor M.
INTERNATIONAL JOURNAL ON ARTIFICIAL INTELLIGENCE TOOLS, 2018, 27 (07)
[30] Innovative Application of Computer Game Algorithms of Surakarta Based on Reinforcement Learning
Tao, Jun
Wu, Gui
Zeng, Peng
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 2312 - 2315

← 1 2 3 4 5 →