Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

被引：40

作者：

Zhou, Wenhong ^{[1
]}

Liu, Zhihong ^{[1
]}

Li, Jie ^{[1
]}

Xu, Xin ^{[1
]}

Shen, Lincheng ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China

来源：

NEUROCOMPUTING | 2021年 / 466卷

基金：

中国国家自然科学基金;

关键词：

UAV swarms; Multi-target tracking; Multi-agent reinforcement learning; Scalability; Feature representation; ROBOTS; ALGORITHMS; SEARCH;

D O I：

10.1016/j.neucom.2021.09.044

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In recent years, deep reinforcement learning (DRL) has proved its great potential in multi-agent cooper-ation. However, how to apply DRL to multi-target tracking (MTT) problem for unmanned aerial vehicle (UAV) swarms is challenging: 1) the scale of UAVs may be large, but the existing multi-agent reinforce-ment learning (MARL) methods that rely on global or joint information of all agents suffer from the dimensionality curse; 2) the dimension of each UAV's received information is variable, which is incom-patible with the neural networks with fixed input dimensions; 3) the UAVs are homogeneous and inter-changeable that each UAV's policy should be irrelevant to the permutation of its received information. To this end, we propose a DRL method for UAV swarms to solve the MTT problem. Firstly, a decentralized swarm-oriented Markov Decision Process (MDP) model is presented for UAV swarms, which involves each UAV's local communication and partial observation. Secondly, to achieve better scalability, a car-togram feature representation (FR) is proposed to integrate the variable-dimensional information set into a fixed-shape input variable, and the cartogram FR can also maintain the permutation irrelevance to the information. Then, the double deep Q-learning network with dueling architecture is adapted to the MTT problem, and the experience-sharing training mechanism is adopted to learn the shared cooperative pol-icy for UAV swarms. Extensive experiments are provided and the results show that our method can suc-cessfully learn a cooperative tracking policy for UAV swarms and outperforms the baseline method in the tracking ratio and scalability. (c) 2021 Elsevier B.V. All rights reserved.

引用

页码：285 / 297

页数：13

共 50 条

[1] Joint Communication and Action Learning in Multi-Target Tracking of UAV Swarms with Deep Reinforcement Learning
Zhou, Wenhong
Li, Jie
Zhang, Qingjie
DRONES, 2022, 6 (11)
[2] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
Zhou, Wenhong
LI, Jie
Liu, Zhihong
Shen, Lincheng
CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (07) : 100 - 112
[3] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
Wenhong ZHOU
Jie LI
Zhihong LIU
Lincheng SHEN
Chinese Journal of Aeronautics, 2022, (07) : 100 - 112
[4] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
Wenhong ZHOU
Jie LI
Zhihong LIU
Lincheng SHEN
Chinese Journal of Aeronautics, 2022, 35 (07) : 100 - 112
[5] Lightweight multi-target detection algorithm for unmanned aerial vehicle aerial imagery
Liu, Yang
Ma, Ding
Wang, Yongfu
JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (04)
[6] Real-time deep learning for moving target detection and tracking using unmanned aerial vehicle
Doukhi O.
Hossain S.
Lee D.-J.
Journal of Institute of Control, Robotics and Systems, 2020, 26 (05) : 295 - 301
[7] Multi-target Tracking and Data Association on Road Networks Using Unmanned Aerial Vehicles
Barkley, Brett E.
Paley, Derek A.
2017 IEEE AEROSPACE CONFERENCE, 2017,
[8] Trajectory tracking control of an unmanned aerial vehicle with deep reinforcement learning for tasks inside the EAST
Yu, Chao
Yang, Yang
Cheng, Yong
Wang, Zheng
Shi, Mingming
FUSION ENGINEERING AND DESIGN, 2023, 194
[9] A Multi-target Passive Tracking Algorithm Based on Unmanned Underwater Vehicle
Wang Y.
Li Y.
Ju D.
Huang H.
Li, Yu (hhn@mail.ioa.ac.cn), 2013, Science Press (42): : 2013 - 2020
[10] A Multi-target Passive Tracking Algorithm Based on Unmanned Underwater Vehicle
Wang Yujie
Li Yu
Ju Donghao
Huang Haining
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (08) : 2013 - 2020

← 1 2 3 4 5 →