Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning

被引:45
|
作者
Zhou, Wenhong [1 ]
LI, Jie [1 ]
Liu, Zhihong [1 ]
Shen, Lincheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
Decentralized cooperation; Maximum reciprocal reward; Multi-agent actor-critic; Pointwise mutual informa-; Reinforcement learning; ALGORITHMS; SEARCH; ROBOTS; GAMES;
D O I
10.1016/j.cja.2021.09.008
中图分类号
V [航空、航天];
学科分类号
08 ; 0825 ;
摘要
Multi-Target Tracking Guidance (MTTG) in unknown environments has great potential values in applications for Unmanned Aerial Vehicle (UAV) swarms. Although Multi-Agent Deep Reinforcement Learning (MADRL) is a promising technique for learning cooperation, most of the existing methods cannot scale well to decentralized UAV swarms due to their computational complexity or global information requirement. This paper proposes a decentralized MADRL method using the maximum reciprocal reward to learn cooperative tracking policies for UAV swarms. This method reshapes each UAV's reward with a regularization term that is defined as the dot product of the reward vector of all neighbor UAVs and the corresponding dependency vector between the UAV and the neighbors. And the dependence between UAVs can be directly captured by the Pointwise Mutual Information (PMI) neural network without complicated aggregation statistics. Then, the experience sharing Reciprocal Reward Multi-Agent Actor-Critic (MAAC-R) algorithm is proposed to learn the cooperative sharing policy for all homogeneous UAVs. Experiments demonstrate that the proposed algorithm can improve the UAVs' cooperation more effectively than the baseline algorithms, and can stimulate a rich form of cooperative tracking behaviors of UAV swarms. Besides, the learned policy can better scale to other scenarios with more UAVs and targets. (c) 2021 Chinese Society of Aeronautics and Astronautics. Production and hosting by Elsevier Ltd. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:100 / 112
页数:13
相关论文
共 50 条
  • [31] Multi-Agent Reinforcement Learning for Multi-Object Tracking
    Rosello, Pol
    Kochenderfer, Mykel J.
    PROCEEDINGS OF THE 17TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS (AAMAS' 18), 2018, : 1397 - 1404
  • [32] The Application of Multi-Agent Reinforcement Learning in UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    2019 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2019,
  • [33] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    ZHANG Jiandong
    YANG Qiming
    SHI Guoqing
    LU Yi
    WU Yong
    Journal of Systems Engineering and Electronics, 2021, 32 (06) : 1421 - 1438
  • [34] UAV cooperative air combat maneuver decision based on multi-agent reinforcement learning
    Zhang Jiandong
    Yang Qiming
    Shi Guoqing
    Lu Yi
    Wu Yong
    JOURNAL OF SYSTEMS ENGINEERING AND ELECTRONICS, 2021, 32 (06) : 1421 - 1438
  • [35] Multi-mode filter target tracking method for mobile robot using multi-agent reinforcement learning
    Li, Xiaofeng
    Ren, Jie
    Li, Yunbo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2024, 127
  • [36] UAV Cooperative Air Combat Maneuvering Confrontation Based on Multi-agent Reinforcement Learning
    Gong, Zihao
    Xu, Yang
    Luo, Delin
    UNMANNED SYSTEMS, 2023, 11 (03) : 273 - 286
  • [37] Multi-Agent Graphic Reinforcement Learning for Real-Time UAV Video Transmission with Predictive Target Tracking
    Duan, Fan
    Zhu, Kun
    2023 IEEE INTERNATIONAL CONFERENCES ON INTERNET OF THINGS, ITHINGS IEEE GREEN COMPUTING AND COMMUNICATIONS, GREENCOM IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING, CPSCOM IEEE SMART DATA, SMARTDATA AND IEEE CONGRESS ON CYBERMATICS,CYBERMATICS, 2024, : 287 - 293
  • [38] Distributed Coordination Guidance in Multi-Agent Reinforcement Learning
    Lau, Qiangfeng Peter
    Lee, Mong Li
    Hsu, Wynne
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 456 - 463
  • [39] Cooperative Multi-agent Systems for the Multi-target K-Coverage Problem
    Frasheri, Mirgita
    Esterle, Lukas
    Papadopoulos, Alessandro Vittorio
    AGENTS AND ARTIFICIAL INTELLIGENCE, ICAART 2020, 2021, 12613 : 106 - 131
  • [40] Preference-based experience sharing scheme for multi-agent reinforcement learning in multi-target environments
    Zuo, Xuan
    Zhang, Pu
    Li, Hui-Yan
    Liu, Zhun-Ga
    EVOLVING SYSTEMS, 2024, 15 (05) : 1681 - 1699