A New Multi-Agent Reinforcement Learning Method Based on Evolving Dynamic Correlation Matrix

被引:8
|
作者
Gan, Xingli [1 ,2 ]
Guo, Hongliang [3 ]
Li, Zhan [3 ]
机构
[1] China Elect Technol Grp Corp, Res Inst 54, Shijiazhuang 050081, Hebei, Peoples R China
[2] State Key Lab Satellite Nav Syst & Equipment Tech, Shijiazhuang 050000, Hebei, Peoples R China
[3] Univ Elect Sci & Technol China, Sch Automat Engn, Chengdu 610054, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Heuristic algorithms; Correlation; Evolutionary computation; Convergence; Tuning; Roads; Multi-agent reinforcement learning; dynamic correlation matrix; convergence; meta-parameter evolution;
D O I
10.1109/ACCESS.2019.2946848
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-agent reinforcement learning approaches can be roughly classified into two categories. One is the agent-based approach which can be implemented in real distributed systems, though most approaches of this type cannot provide meaningful theoretical verifications. The other can be seen as the more formalized approach, which can provide theoretical results. However, most of current algorithms usually require unrealistic global communication, which makes them impractical for real distributed systems. In this article, we propose a dynamic correlation matrix based multi-agent reinforcement learning approach where the meta-parameters are evolved using an evolutionary algorithm. We believe that our approach is able to fill the gap between the two kinds of traditional multi-agent reinforcement learning approaches by providing both agent-level implementation and system-level convergence verification. The basic idea of this approach is that agents learn not only from local environmental feedback, i.e., their own experiences and rewards, but also from other agents experiences. In this way, the agents learning speed can be increased significantly. The performance of the proposed algorithm is demonstrated on a number of application scenarios, including blackjack games, urban traffic control systems and multi-robot foraging.
引用
收藏
页码:162127 / 162138
页数:12
相关论文
共 50 条
  • [1] Review of multi-agent reinforcement learning based dynamic spectrum allocation method
    Song B.
    Ye W.
    Meng X.
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2021, 43 (11): : 3338 - 3351
  • [2] Multi-agent reinforcement learning based textile dyeing workshop dynamic scheduling method
    He J.
    Zhang J.
    Zhang P.
    Zheng P.
    Wang M.
    Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2023, 29 (01): : 61 - 74
  • [3] Train rescheduling method based on multi-agent reinforcement learning
    Cao, Yuli
    Xu, Zhongwei
    Mei, Meng
    2022 IEEE 6TH ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2022, : 301 - 305
  • [4] Dynamic Arterial Coordinated Control Based on Multi-agent Reinforcement Learning
    Fang, Liangliang
    Zhang, Weibin
    PROCEEDINGS OF THE 33RD CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2021), 2021, : 2716 - 2721
  • [5] Multi-Agent Reinforcement Learning for Dynamic Spectrum Access
    Jiang, Huijuan
    Wang, Tianyu
    Wang, Shaowei
    ICC 2019 - 2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2019,
  • [6] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [7] Dynamic Multi-Agent Reinforcement Learning for Control Optimization
    Fagan, Derek
    Meier, Rene
    PROCEEDINGS FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS, MODELLING AND SIMULATION, 2014, : 99 - 104
  • [8] Multi-agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Bohmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    PRICAI 2019: TRENDS IN ARTIFICIAL INTELLIGENCE, PT II, 2019, 11671 : 80 - 92
  • [9] Multi-Agent Reinforcement Learning in Dynamic Industrial Context
    Zhang, Hongyi
    Li, Jingya
    Qi, Zhiqiang
    Aronsson, Anders
    Bosch, Jan
    Olsson, Helena Holmstrom
    2023 IEEE 47TH ANNUAL COMPUTERS, SOFTWARE, AND APPLICATIONS CONFERENCE, COMPSAC, 2023, : 448 - 457
  • [10] Reinforcement learning based on multi-agent in RoboCup
    Zhang, W
    Li, JG
    Ruan, XG
    ADVANCES IN INTELLIGENT COMPUTING, PT 1, PROCEEDINGS, 2005, 3644 : 967 - 975