A leader-following paradigm based deep reinforcement learning method for multi-agent cooperation games

被引：10

作者：

Zhang, Feiye ^{[1
]}

Yang, Qingyu ^{[1
,2
]}

An, Dou ^{[1
,2
]}

机构：

[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, 28, West Xianning Rd, Xian 710049, Shaanxi, Peoples R China

[2] Xi An Jiao Tong Univ, MOE Key Lab Intelligent Networks & Network Secur, 28, West Xianning Rd, Xian 710049, Shaanxi, Peoples R China

来源：

NEURAL NETWORKS | 2022年 / 156卷

基金：

美国国家科学基金会; 中国国家自然科学基金;

关键词：

Multi-agent systems; Deep reinforcement learning; Centralized training with decentralized; execution; Cooperative games; LEVEL;

D O I：

10.1016/j.neunet.2022.09.012

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Multi-agent deep reinforcement learning algorithms with centralized training with decentralized execution (CTDE) paradigm has attracted growing attention in both industry and research community. However, the existing CTDE methods follow the action selection paradigm that all agents choose actions at the same time, which ignores the heterogeneous roles of different agents. Motivated by the human wisdom in cooperative behaviors, we present a novel leader-following paradigm based deep multi-agent cooperation method (LFMCO) for multi-agent cooperative games. Specifically, we define a leader as someone who broadcasts a message representing the selected action to all subordinates. After that, the followers choose their individual action based on the received message from the leader. To measure the influence of leader's action on followers, we introduced a concept of information gain, i.e., the change of followers' value function entropy, which is positively correlated with the influence of leader's action. We evaluate the LFMCO on several cooperation scenarios of StarCraft2. Simulation results confirm the significant performance improvements of LFMCO compared with four state-of-the-art benchmarks on the challenging cooperative environment.(c) 2022 Elsevier Ltd. All rights reserved.

引用

页码：1 / 12

页数：12

共 50 条

[21] Leader-following consensus of nonlinear multi-agent systems based on position and velocity estimations
Mohammadi, Mahsa
Baradarannia, Mahdi
Farzamnia, Ali
2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION AND AUTOMATION (ICCIA), 2021, : 240 - 246
[22] Observer-based leader-following consensus of uncertain nonlinear multi-agent systems
Shi, P.
Shen, Q. K.
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2017, 27 (17) : 3794 - 3811
[23] Leader-following consensus protocols for formation control of multi-agent network
Xiaoyuan Luo 1
2.Institute of Electrical Engineering
Journal of Systems Engineering and Electronics, 2011, 22 (06) : 991 - 997
[24] Noise-induced consensus of leader-following multi-agent systems
Li, Wang
Dai, Haifeng
Zhao, Lingzhi
Zhao, Donghua
Sun, Yongzheng
MATHEMATICS AND COMPUTERS IN SIMULATION, 2023, 203 : 1 - 11
[25] Containment Tracking of Leader-following Multi-agent Systems with Measurement Noise
Li Wuquan
Xie Lihua
Zhang Ji-Feng
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7330 - 7334
[26] Leader-following consensus of second-order multi-agent systems with a smart leader
Liu, Honggang
Liu, Zhongxin
Chen, Zengqiang
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8090 - 8095
[27] Regional Leader-following Consensus of Multi-agent Systems with Saturating Actuators
Li, Yuanlong
Lin, Zongli
PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8401 - 8406
[28] Quantized impulsive consensus of nonlinear Leader-following multi-agent systems
Li, Gang
Jiang, Xiaowei
You, Le
Zhang, Xianhe
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4557 - 4561
[29] Leader-following consensus of heterogeneous multi-agent systems with input delays
Chen, Jie
Guan, Zhi-Hong
Zhang, Ding-Xue
Ding, Li
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 6815 - 6819
[30] Leader-Following Output Formation Control of Heterogeneous Multi-Agent Systems
Wang, Jian
Shi, Liangren
2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2689 - 2694

← 1 2 3 4 5 →