Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

被引：50

作者：

Zhou, Wenhong ^{[1
]}

Liu, Zhihong ^{[1
]}

Li, Jie ^{[1
]}

Xu, Xin ^{[1
]}

Shen, Lincheng ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 466卷

基金：

中国国家自然科学基金;

关键词：

UAV swarms; Multi-target tracking; Multi-agent reinforcement learning; Scalability; Feature representation; ROBOTS; ALGORITHMS; SEARCH;

D O I：

10.1016/j.neucom.2021.09.044

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep reinforcement learning (DRL) has proved its great potential in multi-agent cooper-ation. However, how to apply DRL to multi-target tracking (MTT) problem for unmanned aerial vehicle (UAV) swarms is challenging: 1) the scale of UAVs may be large, but the existing multi-agent reinforce-ment learning (MARL) methods that rely on global or joint information of all agents suffer from the dimensionality curse; 2) the dimension of each UAV's received information is variable, which is incom-patible with the neural networks with fixed input dimensions; 3) the UAVs are homogeneous and inter-changeable that each UAV's policy should be irrelevant to the permutation of its received information. To this end, we propose a DRL method for UAV swarms to solve the MTT problem. Firstly, a decentralized swarm-oriented Markov Decision Process (MDP) model is presented for UAV swarms, which involves each UAV's local communication and partial observation. Secondly, to achieve better scalability, a car-togram feature representation (FR) is proposed to integrate the variable-dimensional information set into a fixed-shape input variable, and the cartogram FR can also maintain the permutation irrelevance to the information. Then, the double deep Q-learning network with dueling architecture is adapted to the MTT problem, and the experience-sharing training mechanism is adopted to learn the shared cooperative pol-icy for UAV swarms. Extensive experiments are provided and the results show that our method can suc-cessfully learn a cooperative tracking policy for UAV swarms and outperforms the baseline method in the tracking ratio and scalability. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：285 / 297

页数：13

共 50 条

[41] Distributed multi-target search and tracking using the PHD filter [J].

Dames, Philip M. .

AUTONOMOUS ROBOTS, 2020, 44 (3-4) :673-689

[42] Distributed multi-target search and tracking using the PHD filter [J].

Philip M. Dames .

Autonomous Robots, 2020, 44 :673-689

[43] SPARSITY BASED MULTI-TARGET TRACKING USING MOBILE SENSORS [J].

Ren, Guohua ;

Schizas, Ioannis D. ;

Maroulas, Vasileios .

2016 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING PROCEEDINGS, 2016, :4578-4582

[44] Online Discriminative Structured Output SVM Learning for Multi-Target Tracking [J].

Xu, Yingkun ;

Qin, Lei ;

Li, Guorong ;

Huang, Qingming .

IEEE SIGNAL PROCESSING LETTERS, 2014, 21 (02) :190-194

[45] A Data-Driven Packet Routing Algorithm for an Unmanned Aerial Vehicle Swarm: A Multi-Agent Reinforcement Learning Approach [J].

Qiu, Xiulin ;

Xu, Lei ;

Wang, Ping ;

Yang, Yuwang ;

Liao, Zhenqiang .

IEEE WIRELESS COMMUNICATIONS LETTERS, 2022, 11 (10) :2160-2164

[46] Multi-Agent Reinforcement Learning-Based Computation Offloading for Unmanned Aerial Vehicle Post-Disaster Rescue [J].

Wang, Lixing ;

Jiao, Huirong .

SENSORS, 2024, 24 (24)

[47] Ballistic missile tracking in the presence of clutter using multi-target tracking algorithm [J].

Asad, Muhammad ;

Khan, Sumair ;

Arif, Muhammad ;

Mehmood, Zahid ;

Durrani, Sajjad ;

Khan, Uzair .

PROCEEDINGS OF 2017 14TH INTERNATIONAL BHURBAN CONFERENCE ON APPLIED SCIENCES AND TECHNOLOGY (IBCAST), 2017, :357-360

[48] Cooperative Multi-Target Positioning for Cell-Free Massive MIMO With Multi-Agent Reinforcement Learning [J].

Liu, Ziheng ;

Zhang, Jiayi ;

Shi, Enyu ;

Zhu, Yiyang ;

Ng, Derrick Wing Kwan ;

Ai, Bo .

IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (12) :19034-19049

[49] Hierarchical Adaptation of Multiagent Deep Reinforcement Learning for Multi-Domain Uncrewed Aerial and Ground Vehicle Coordination [J].

Hulede, Ian Ellis L. ;

Mukherjee, Amitav ;

Ashdown, Jonathan .

MILCOM 2024-2024 IEEE MILITARY COMMUNICATIONS CONFERENCE, MILCOM, 2024, :487-492

[50] Multi-AUV Cooperative Underwater Multi-Target Tracking Based on Dynamic-Switching-Enabled Multi-Agent Reinforcement Learning [J].

Wang, Shengbo ;

Lin, Chuan ;

Han, Guangjie ;

Zhu, Shengchao ;

Li, Zhixian ;

Wang, Zhenyu ;

Ma, Yunpeng .

IEEE TRANSACTIONS ON MOBILE COMPUTING, 2025, 24 (05) :4296-4311

← 1 2 3 4 5 →