Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引：45

作者：

Zhou, Wenhong ^{[1
]}

LI, Jie ^{[1
]}

Liu, Zhihong ^{[1
]}

Shen, Lincheng ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

CHINESE JOURNAL OF AERONAUTICS | 2022年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;

D O I：

10.1016/j.cja.2021.09.008

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：100 / 112

页数：13

共 50 条

[21] Flocking algorithm with multi-target tracking for multi-agent systems
Luo, Xiaoyuan
Li, Shaobao
Guan, Xinping
PATTERN RECOGNITION LETTERS, 2010, 31 (09) : 800 - 805
[22] Improving Computational Complexity of Multi-Target Multi-Agent Reinforcement for Hyperspectral Satellite Sensor Tasking
Saeed, Amir K.
Yasin, Alhassan S.
Johnson, Benjamin A.
Holguin, Francisco
Rodriguez, Benjamin M.
PATTERN RECOGNITION AND PREDICTION XXXV, 2024, 13040
[23] Maintaining Connectivity for Multi-UAV Multi-Target Search Using Reinforcement Learning
Guven, Islam
Yanmaz, Evsen
PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON DESIGN AND ANALYSIS OF INTELLIGENT VEHICULAR NETWORKS AND APPLICATIONS, DIVANET 2023, 2023, : 109 - 114
[24] The research on intelligent cooperative combat of UAV cluster with multi-agent reinforcement learning
Xu D.
Chen G.
Aerospace Systems, 2022, 5 (1) : 107 - 121
[25] Route Guidance System Using Multi-agent Reinforcement Learning
Arokhlo, Mortaza Zolfpour
Selamat, Ali
Hashim, Siti Zaiton Mohd
Selamat, Md Hafiz
2011 7TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN ASIA (CITA 11), 2011,
[26] On the Robustness of Cooperative Multi-Agent Reinforcement Learning
Lin, Jieyu
Dzeparoska, Kristina
Zhang, Sai Qian
Leon-Garcia, Alberto
Papernot, Nicolas
2020 IEEE SYMPOSIUM ON SECURITY AND PRIVACY WORKSHOPS (SPW 2020), 2020, : 62 - 68
[27] Extrinsic-and-Intrinsic Reward-Based Multi-Agent Reinforcement Learning for Multi-UAV Cooperative Target Encirclement
Chen, Jinchao
Wang, Yang
Zhang, Ying
Lu, Yantao
Shu, Qiuhao
Hu, Yujiao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2025,
[28] Consensus Learning for Cooperative Multi-Agent Reinforcement Learning
Xu, Zhiwei
Zhang, Bin
Li, Dapeng
Zhang, Zeren
Zhou, Guangchong
Chen, Hao
Fan, Guoliang
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 10, 2023, : 11726 - 11734
[29] Multi-Target Tracking Resources Allocation Using Multi-Agent Modeling and Auction Algorithm
de Rochechouart, Maxence
Seghrouchni, Amal El Fallah
Barbaresco, Frederic
Abu Zitar, Raed
2023 24TH INTERNATIONAL RADAR SYMPOSIUM, IRS, 2023,
[30] Combined Macroscopic and Microscopic Multi-Agent Control For Multi-Target Tracking
Abdulghafoor, Alaa Z.
Bakolas, Efstathios
IFAC PAPERSONLINE, 2022, 55 (37): : 669 - 674

← 1 2 3 4 5 →