Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引：45

作者：

Zhou, Wenhong ^{[1
]}

LI, Jie ^{[1
]}

Liu, Zhihong ^{[1
]}

Shen, Lincheng ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

CHINESE JOURNAL OF AERONAUTICS | 2022年 / 35卷 / 07期

基金：

中国国家自然科学基金;

关键词：

Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;

D O I：

10.1016/j.cja.2021.09.008

中图分类号：

V [航空、航天];

学科分类号：

08 ; 0825 ;

摘要：

Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).

引用

页码：100 / 112

页数：13

共 50 条

[31] Multi-Agent Reinforcement Learning for Multi-Object Tracking
Rosello, Pol
Kochenderfer, Mykel J.
PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1397 - 1404
[32] The Application of Multi-Agent Reinforcement Learning in UAV Networks
Cui, Jingjing
Liu, Yuanwei
Nallanathan, Arumugam
2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
[33] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
ZHANG Jiandong
YANG Qiming
SHI Guoqing
LU Yi
WU Yong
Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
[34] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
Zhang Jiandong
Yang Qiming
Shi Guoqing
Lu Yi
Wu Yong
JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
[35] Multi-mode filter target tracking method for mobile robot using multi-agent reinforcement learning
Li, Xiaofeng
Ren, Jie
Li, Yunbo
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
[36] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
Gong, Zihao
Xu, Yang
Luo, Delin
UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
[37] Multi-Agent Graphic Reinforcement Learning for Real-Time UAV Video Transmission with Predictive Target Tracking
Duan, Fan
Zhu, Kun
2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 287 - 293
[38] Distributed Coordination Guidance in Multi-Agent Reinforcement Learning
Lau, Qiangfeng Peter
Lee, Mong Li
Hsu, Wynne
2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 456 - 463
[39] Cooperative Multi-agent Systems for the Multi-target K-Coverage Problem
Frasheri, Mirgita
Esterle, Lukas
Papadopoulos, Alessandro Vittorio
AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020, 2021, 12613 : 106 - 131
[40] Preference-based experience sharing scheme for multi-agent reinforcement learning in multi-target environments
Zuo, Xuan
Zhang, Pu
Li, Hui-Yan
Liu, Zhun-Ga
EVOLVING SYSTEMS, 2024, 15 (05) : 1681 - 1699

← 1 2 3 4 5 →