Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

被引:50
作者
Zhou, Wenhong [1 ]
Liu, Zhihong [1 ]
Li, Jie [1 ]
Xu, Xin [1 ]
Shen, Lincheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV swarms; Multi-target tracking; Multi-agent reinforcement learning; Scalability; Feature representation; ROBOTS; ALGORITHMS; SEARCH;
D O I
10.1016/j.neucom.2021.09.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning (DRL) has proved its great potential in multi-agent cooper-ation. However, how to apply DRL to multi-target tracking (MTT) problem for unmanned aerial vehicle (UAV) swarms is challenging: 1) the scale of UAVs may be large, but the existing multi-agent reinforce-ment learning (MARL) methods that rely on global or joint information of all agents suffer from the dimensionality curse; 2) the dimension of each UAV's received information is variable, which is incom-patible with the neural networks with fixed input dimensions; 3) the UAVs are homogeneous and inter-changeable that each UAV's policy should be irrelevant to the permutation of its received information. To this end, we propose a DRL method for UAV swarms to solve the MTT problem. Firstly, a decentralized swarm-oriented Markov Decision Process (MDP) model is presented for UAV swarms, which involves each UAV's local communication and partial observation. Secondly, to achieve better scalability, a car-togram feature representation (FR) is proposed to integrate the variable-dimensional information set into a fixed-shape input variable, and the cartogram FR can also maintain the permutation irrelevance to the information. Then, the double deep Q-learning network with dueling architecture is adapted to the MTT problem, and the experience-sharing training mechanism is adopted to learn the shared cooperative pol-icy for UAV swarms. Extensive experiments are provided and the results show that our method can suc-cessfully learn a cooperative tracking policy for UAV swarms and outperforms the baseline method in the tracking ratio and scalability. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:285 / 297
页数:13
相关论文
共 50 条
[41]   Distributed multi-target search and tracking using the PHD filter [J].
Dames, Philip M. .
AUTONOMOUS ROBOTS, 2020, 44 (3-4) :673-689
[42]   Distributed multi-target search and tracking using the PHD filter [J].
Philip M. Dames .
Autonomous Robots, 2020, 44 :673-689
[43]   SPARSITY BASED MULTI-TARGET TRACKING USING MOBILE SENSORS [J].
Ren, Guohua ;
Schizas, Ioannis D. ;
Maroulas, Vasileios .
2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :4578-4582
[44]   Online Discriminative Structured Output SVM Learning for Multi-Target Tracking [J].
Xu, Yingkun ;
Qin, Lei ;
Li, Guorong ;
Huang, Qingming .
IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) :190-194
[45]   A Data-Driven Packet Routing Algorithm for an Unmanned Aerial Vehicle Swarm: A Multi-Agent Reinforcement Learning Approach [J].
Qiu, Xiulin ;
Xu, Lei ;
Wang, Ping ;
Yang, Yuwang ;
Liao, Zhenqiang .
IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (10) :2160-2164
[46]   Multi-Agent Reinforcement Learning-Based Computation Offloading for Unmanned Aerial Vehicle Post-Disaster Rescue [J].
Wang, Lixing ;
Jiao, Huirong .
SENSORS, 2024, 24 (24)
[47]   Ballistic missile tracking in the presence of clutter using multi-target tracking algorithm [J].
Asad, Muhammad ;
Khan, Sumair ;
Arif, Muhammad ;
Mehmood, Zahid ;
Durrani, Sajjad ;
Khan, Uzair .
PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, :357-360
[48]   Cooperative Multi-Target Positioning for Cell-Free Massive MIMO With Multi-Agent Reinforcement Learning [J].
Liu, Ziheng ;
Zhang, Jiayi ;
Shi, Enyu ;
Zhu, Yiyang ;
Ng, Derrick Wing Kwan ;
Ai, Bo .
IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) :19034-19049
[49]   Hierarchical Adaptation of Multiagent Deep Reinforcement Learning for Multi-Domain Uncrewed Aerial and Ground Vehicle Coordination [J].
Hulede, Ian Ellis L. ;
Mukherjee, Amitav ;
Ashdown, Jonathan .
MILCOM 2024-2024 IEEE MILITARY COMMUNICATIONS CONFERENCE, MILCOM, 2024, :487-492
[50]   Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-Enabled Multi-Agent Reinforcement Learning [J].
Wang, Shengbo ;
Lin, Chuan ;
Han, Guangjie ;
Zhu, Shengchao ;
Li, Zhixian ;
Wang, Zhenyu ;
Ma, Yunpeng .
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) :4296-4311