A New Multi-Agent Reinforcement Learning Method Based on Evolving Dynamic Correlation Matrix

被引：8

作者：

Gan, Xingli ^{[1
,2
]}

Guo, Hongliang ^{[3
]}

Li, Zhan ^{[3
]}

机构：

[1] China Elect Technol Grp Corp, Res Inst 54, Shijiazhuang 050081, Hebei, Peoples R China

[2] State Key Lab Satellite Nav Syst & Equipment Tech, Shijiazhuang 050000, Hebei, Peoples R China

[3] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Sichuan, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

中国国家自然科学基金;

关键词：

Reinforcement learning; Heuristic algorithms; Correlation; Evolutionary computation; Convergence; Tuning; Roads; Multi-agent reinforcement learning; dynamic correlation matrix; convergence; meta-parameter evolution;

D O I：

10.1109/ACCESS.2019.2946848

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Multi-agent reinforcement learning approaches can be roughly classified into two categories. One is the agent-based approach which can be implemented in real distributed systems, though most approaches of this type cannot provide meaningful theoretical verifications. The other can be seen as the more formalized approach, which can provide theoretical results. However, most of current algorithms usually require unrealistic global communication, which makes them impractical for real distributed systems. In this article, we propose a dynamic correlation matrix based multi-agent reinforcement learning approach where the meta-parameters are evolved using an evolutionary algorithm. We believe that our approach is able to fill the gap between the two kinds of traditional multi-agent reinforcement learning approaches by providing both agent-level implementation and system-level convergence verification. The basic idea of this approach is that agents learn not only from local environmental feedback, i.e., their own experiences and rewards, but also from other agents experiences. In this way, the agents learning speed can be increased significantly. The performance of the proposed algorithm is demonstrated on a number of application scenarios, including blackjack games, urban traffic control systems and multi-robot foraging.

引用

页码：162127 / 162138

页数：12

共 50 条

[1] Review of multi-agent reinforcement learning based dynamic spectrum allocation method
Song B.
Ye W.
Meng X.
Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (11): : 3338 - 3351
[2] Multi-agent reinforcement learning based textile dyeing workshop dynamic scheduling method
He J.
Zhang J.
Zhang P.
Zheng P.
Wang M.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 61 - 74
[3] Train rescheduling method based on multi-agent reinforcement learning
Cao, Yuli
Xu, Zhongwei
Mei, Meng
2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 301 - 305
[4] Dynamic Arterial Coordinated Control Based on Multi-agent Reinforcement Learning
Fang, Liangliang
Zhang, Weibin
PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2716 - 2721
[5] Multi-Agent Reinforcement Learning for Dynamic Spectrum Access
Jiang, Huijuan
Wang, Tianyu
Wang, Shaowei
ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
[6] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
Han, Dongge
Boehmer, Wendelin
Wooldridge, Michael
Rogers, Alex
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
[7] Dynamic Multi-Agent Reinforcement Learning for Control Optimization
Fagan, Derek
Meier, Rene
PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 99 - 104
[8] Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
Han, Dongge
Bohmer, Wendelin
Wooldridge, Michael
Rogers, Alex
PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 80 - 92
[9] Multi-Agent Reinforcement Learning in Dynamic Industrial Context
Zhang, Hongyi
Li, Jingya
Qi, Zhiqiang
Aronsson, Anders
Bosch, Jan
Olsson, Helena Holmstrom
2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 448 - 457
[10] Reinforcement learning based on multi-agent in RoboCup
Zhang, W
Li, JG
Ruan, XG
ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975

← 1 2 3 4 5 →