UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引:56
作者
Li, Bo [1 ]
Gan, Zhigang [1 ]
Chen, Daqing [2 ]
Sergey Aleksandrovich, Dyachenko [3 ]
机构
[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[2] London South Bank Univ, Sch Engn, London SE1 0AA, England
[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia
关键词
UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;
D O I
10.3390/rs12223789
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.
引用
收藏
页码:1 / 20
页数:20
相关论文
共 50 条
  • [41] A UAV Path Planning Method Based on Deep Reinforcement Learning
    Li, Yibing
    Zhang, Sitong
    Ye, Fang
    Jiang, Tao
    Li, Yingsong
    2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
  • [42] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
    Yang, Songyue
    Yu, Guizhen
    Meng, Zhijun
    Wang, Zhangyu
    Li, Han
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
  • [43] Task Assignment of UAV Swarms Based on Deep Reinforcement Learning
    Liu, Bo
    Wang, Shulei
    Li, Qinghua
    Zhao, Xinyang
    Pan, Yunqing
    Wang, Changhong
    DRONES, 2023, 7 (05)
  • [44] Tracking Context Changes through Meta-Learning
    Gerhard Widmer
    Machine Learning, 1997, 27 : 259 - 286
  • [45] UAV collection methods for the farmland nodes data based on deep reinforcement learning
    Jie H.
    Yali Z.
    Tuan W.
    Mengcheng W.
    Yubin L.
    Zhixun Z.
    Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (22): : 41 - 51
  • [46] Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
    Liang, Chen
    Wang, Weihong
    Liu, Zhenghua
    Lai, Chao
    Zhou, Benchun
    IEEE ACCESS, 2019, 7 : 47353 - 47365
  • [47] Recurrent adaptive maneuvering target tracking algorithm based on online learning
    Xiong W.
    Zhu H.
    Cui Y.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05):
  • [48] A Deep Learning Approach for Speech Emotion Recognition Optimization Using Meta-Learning
    Ottoni, Lara Toledo Cordeiro
    Ottoni, Andre Luiz Carvalho
    Cerqueira, Jes de Jesus Fiais
    ELECTRONICS, 2023, 12 (23)
  • [49] Tracking context changes through meta-learning
    Widmer, G
    MACHINE LEARNING, 1997, 27 (03) : 259 - 286
  • [50] A Real-Time Tracking Algorithm for Multi-Target UAV Based on Deep Learning
    Hong, Tao
    Liang, Hongming
    Yang, Qiye
    Fang, Linquan
    Kadoch, Michel
    Cheriet, Mohamed
    REMOTE SENSING, 2023, 15 (01)