A leader-following paradigm based deep reinforcement learning method for multi-agent cooperation games

被引:10
|
作者
Zhang, Feiye [1 ]
Yang, Qingyu [1 ,2 ]
An, Dou [1 ,2 ]
机构
[1] Xi An Jiao Tong Univ, Fac Elect & Informat Engn, 28, West Xianning Rd, Xian 710049, Shaanxi, Peoples R China
[2] Xi An Jiao Tong Univ, MOE Key Lab Intelligent Networks & Network Secur, 28, West Xianning Rd, Xian 710049, Shaanxi, Peoples R China
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Multi-agent systems; Deep reinforcement learning; Centralized training with decentralized; execution; Cooperative games; LEVEL;
D O I
10.1016/j.neunet.2022.09.012
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-agent deep reinforcement learning algorithms with centralized training with decentralized execution (CTDE) paradigm has attracted growing attention in both industry and research community. However, the existing CTDE methods follow the action selection paradigm that all agents choose actions at the same time, which ignores the heterogeneous roles of different agents. Motivated by the human wisdom in cooperative behaviors, we present a novel leader-following paradigm based deep multi-agent cooperation method (LFMCO) for multi-agent cooperative games. Specifically, we define a leader as someone who broadcasts a message representing the selected action to all subordinates. After that, the followers choose their individual action based on the received message from the leader. To measure the influence of leader's action on followers, we introduced a concept of information gain, i.e., the change of followers' value function entropy, which is positively correlated with the influence of leader's action. We evaluate the LFMCO on several cooperation scenarios of StarCraft2. Simulation results confirm the significant performance improvements of LFMCO compared with four state-of-the-art benchmarks on the challenging cooperative environment.(c) 2022 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 12
页数:12
相关论文
共 50 条
  • [21] Leader-following consensus of nonlinear multi-agent systems based on position and velocity estimations
    Mohammadi, Mahsa
    Baradarannia, Mahdi
    Farzamnia, Ali
    2021 7TH INTERNATIONAL CONFERENCE ON CONTROL, INSTRUMENTATION AND AUTOMATION (ICCIA), 2021, : 240 - 246
  • [22] Observer-based leader-following consensus of uncertain nonlinear multi-agent systems
    Shi, P.
    Shen, Q. K.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2017, 27 (17) : 3794 - 3811
  • [23] Leader-following consensus protocols for formation control of multi-agent network
    Xiaoyuan Luo 1
    2.Institute of Electrical Engineering
    Journal of Systems Engineering and Electronics, 2011, 22 (06) : 991 - 997
  • [24] Noise-induced consensus of leader-following multi-agent systems
    Li, Wang
    Dai, Haifeng
    Zhao, Lingzhi
    Zhao, Donghua
    Sun, Yongzheng
    MATHEMATICS AND COMPUTERS IN SIMULATION, 2023, 203 : 1 - 11
  • [25] Containment Tracking of Leader-following Multi-agent Systems with Measurement Noise
    Li Wuquan
    Xie Lihua
    Zhang Ji-Feng
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 7330 - 7334
  • [26] Leader-following consensus of second-order multi-agent systems with a smart leader
    Liu, Honggang
    Liu, Zhongxin
    Chen, Zengqiang
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8090 - 8095
  • [27] Regional Leader-following Consensus of Multi-agent Systems with Saturating Actuators
    Li, Yuanlong
    Lin, Zongli
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8401 - 8406
  • [28] Quantized impulsive consensus of nonlinear Leader-following multi-agent systems
    Li, Gang
    Jiang, Xiaowei
    You, Le
    Zhang, Xianhe
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 4557 - 4561
  • [29] Leader-following consensus of heterogeneous multi-agent systems with input delays
    Chen, Jie
    Guan, Zhi-Hong
    Zhang, Ding-Xue
    Ding, Li
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 6815 - 6819
  • [30] Leader-Following Output Formation Control of Heterogeneous Multi-Agent Systems
    Wang, Jian
    Shi, Liangren
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 2689 - 2694