Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引:45
|
作者
Zhou, Wenhong [1 ]
LI, Jie [1 ]
Liu, Zhihong [1 ]
Shen, Lincheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;
D O I
10.1016/j.cja.2021.09.008
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:100 / 112
页数:13
相关论文
共 50 条
  • [21] Flocking algorithm with multi-target tracking for multi-agent systems
    Luo, Xiaoyuan
    Li, Shaobao
    Guan, Xinping
    PATTERN RECOGNITION LETTERS, 2010, 31 (09) : 800 - 805
  • [22] Improving Computational Complexity of Multi-Target Multi-Agent Reinforcement for Hyperspectral Satellite Sensor Tasking
    Saeed, Amir K.
    Yasin, Alhassan S.
    Johnson, Benjamin A.
    Holguin, Francisco
    Rodriguez, Benjamin M.
    PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
  • [23] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
    Guven, Islam
    Yanmaz, Evsen
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
  • [24] The research on intelligent cooperative combat of UAV cluster with multi-agent reinforcement learning
    Xu D.
    Chen G.
    Aerospace Systems, 2022, 5 (1) : 107 - 121
  • [25] Route Guidance System Using Multi-agent Reinforcement Learning
    Arokhlo, Mortaza Zolfpour
    Selamat, Ali
    Hashim, Siti Zaiton Mohd
    Selamat, Md Hafiz
    2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11), 2011,
  • [26] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
    Lin, Jieyu
    Dzeparoska, Kristina
    Zhang, Sai Qian
    Leon-Garcia, Alberto
    Papernot, Nicolas
    2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
  • [27] Extrinsic-and-Intrinsic Reward-Based Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Target Encirclement
    Chen, Jinchao
    Wang, Yang
    Zhang, Ying
    Lu, Yantao
    Shu, Qiuhao
    Hu, Yujiao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
  • [28] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
    Xu, Zhiwei
    Zhang, Bin
    Li, Dapeng
    Zhang, Zeren
    Zhou, Guangchong
    Chen, Hao
    Fan, Guoliang
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
  • [29] Multi-Target Tracking Resources Allocation Using Multi-Agent Modeling and Auction Algorithm
    de Rochechouart, Maxence
    Seghrouchni, Amal El Fallah
    Barbaresco, Frederic
    Abu Zitar, Raed
    2023 24TH INTERNATIONAL RADAR SYMPOSIUM, IRS, 2023,
  • [30] Combined Macroscopic and Microscopic Multi-Agent Control For Multi-Target Tracking
    Abdulghafoor, Alaa Z.
    Bakolas, Efstathios
    IFAC PAPERSONLINE, 2022, 55 (37): : 669 - 674