Multi-target tracking for unmanned aerial vehicle swarms using deep reinforcement learning

被引:40
|
作者
Zhou, Wenhong [1 ]
Liu, Zhihong [1 ]
Li, Jie [1 ]
Xu, Xin [1 ]
Shen, Lincheng [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
基金
中国国家自然科学基金;
关键词
UAV swarms; Multi-target tracking; Multi-agent reinforcement learning; Scalability; Feature representation; ROBOTS; ALGORITHMS; SEARCH;
D O I
10.1016/j.neucom.2021.09.044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In recent years, deep reinforcement learning (DRL) has proved its great potential in multi-agent cooper-ation. However, how to apply DRL to multi-target tracking (MTT) problem for unmanned aerial vehicle (UAV) swarms is challenging: 1) the scale of UAVs may be large, but the existing multi-agent reinforce-ment learning (MARL) methods that rely on global or joint information of all agents suffer from the dimensionality curse; 2) the dimension of each UAV's received information is variable, which is incom-patible with the neural networks with fixed input dimensions; 3) the UAVs are homogeneous and inter-changeable that each UAV's policy should be irrelevant to the permutation of its received information. To this end, we propose a DRL method for UAV swarms to solve the MTT problem. Firstly, a decentralized swarm-oriented Markov Decision Process (MDP) model is presented for UAV swarms, which involves each UAV's local communication and partial observation. Secondly, to achieve better scalability, a car-togram feature representation (FR) is proposed to integrate the variable-dimensional information set into a fixed-shape input variable, and the cartogram FR can also maintain the permutation irrelevance to the information. Then, the double deep Q-learning network with dueling architecture is adapted to the MTT problem, and the experience-sharing training mechanism is adopted to learn the shared cooperative pol-icy for UAV swarms. Extensive experiments are provided and the results show that our method can suc-cessfully learn a cooperative tracking policy for UAV swarms and outperforms the baseline method in the tracking ratio and scalability. (c) 2021 Elsevier B.V. All rights reserved.
引用
收藏
页码:285 / 297
页数:13
相关论文
共 50 条
  • [1] Joint Communication and Action Learning in Multi-Target Tracking of UAV Swarms with Deep Reinforcement Learning
    Zhou, Wenhong
    Li, Jie
    Zhang, Qingjie
    DRONES, 2022, 6 (11)
  • [2] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Zhou, Wenhong
    LI, Jie
    Liu, Zhihong
    Shen, Lincheng
    CHINESE JOURNAL OF AERONAUTICS, 2022, 35 (07) : 100 - 112
  • [3] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    Chinese Journal of Aeronautics, 2022, (07) : 100 - 112
  • [4] Improving multi-target cooperative tracking guidance for UAV swarms using multi-agent reinforcement learning
    Wenhong ZHOU
    Jie LI
    Zhihong LIU
    Lincheng SHEN
    Chinese Journal of Aeronautics, 2022, 35 (07) : 100 - 112
  • [5] Lightweight multi-target detection algorithm for unmanned aerial vehicle aerial imagery
    Liu, Yang
    Ma, Ding
    Wang, Yongfu
    JOURNAL OF APPLIED REMOTE SENSING, 2023, 17 (04)
  • [6] Real-time deep learning for moving target detection and tracking using unmanned aerial vehicle
    Doukhi O.
    Hossain S.
    Lee D.-J.
    Journal of Institute of Control, Robotics and Systems, 2020, 26 (05) : 295 - 301
  • [7] Multi-target Tracking and Data Association on Road Networks Using Unmanned Aerial Vehicles
    Barkley, Brett E.
    Paley, Derek A.
    2017 IEEE AEROSPACE CONFERENCE, 2017,
  • [8] Trajectory tracking control of an unmanned aerial vehicle with deep reinforcement learning for tasks inside the EAST
    Yu, Chao
    Yang, Yang
    Cheng, Yong
    Wang, Zheng
    Shi, Mingming
    FUSION ENGINEERING AND DESIGN, 2023, 194
  • [9] A Multi-target Passive Tracking Algorithm Based on Unmanned Underwater Vehicle
    Wang Y.
    Li Y.
    Ju D.
    Huang H.
    Li, Yu (hhn@mail.ioa.ac.cn), 2013, Science Press (42): : 2013 - 2020
  • [10] A Multi-target Passive Tracking Algorithm Based on Unmanned Underwater Vehicle
    Wang Yujie
    Li Yu
    Ju Donghao
    Huang Haining
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2020, 42 (08) : 2013 - 2020