UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引：56

作者：

Li, Bo ^{[1
]}

Gan, Zhigang ^{[1
]}

Chen, Daqing ^{[2
]}

Sergey Aleksandrovich, Dyachenko ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

[2] London South Bank Univ, Sch Engn, London SE1 0AA, England

[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia

来源：

REMOTE SENSING | 2020年 / 12卷 / 22期

关键词：

UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;

D O I：

10.3390/rs12223789

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.

引用

页码：1 / 20

页数：20

共 50 条

[41] A UAV Path Planning Method Based on Deep Reinforcement Learning
Li, Yibing
Zhang, Sitong
Ye, Fang
Jiang, Tao
Li, Yingsong
2020 IEEE USNC-CNC-URSI NORTH AMERICAN RADIO SCIENCE MEETING (JOINT WITH AP-S SYMPOSIUM), 2020, : 93 - 94
[42] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
Yang, Songyue
Yu, Guizhen
Meng, Zhijun
Wang, Zhangyu
Li, Han
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
[43] Task Assignment of UAV Swarms Based on Deep Reinforcement Learning
Liu, Bo
Wang, Shulei
Li, Qinghua
Zhao, Xinyang
Pan, Yunqing
Wang, Changhong
DRONES, 2023, 7 (05)
[44] Tracking Context Changes through Meta-Learning
Gerhard Widmer
Machine Learning, 1997, 27 : 259 - 286
[45] UAV collection methods for the farmland nodes data based on deep reinforcement learning
Jie H.
Yali Z.
Tuan W.
Mengcheng W.
Yubin L.
Zhixun Z.
Nongye Gongcheng Xuebao/Transactions of the Chinese Society of Agricultural Engineering, 2022, 38 (22): : 41 - 51
[46] Learning to Guide: Guidance Law Based on Deep Meta-Learning and Model Predictive Path Integral Control
Liang, Chen
Wang, Weihong
Liu, Zhenghua
Lai, Chao
Zhou, Benchun
IEEE ACCESS, 2019, 7 : 47353 - 47365
[47] Recurrent adaptive maneuvering target tracking algorithm based on online learning
Xiong W.
Zhu H.
Cui Y.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2022, 43 (05):
[48] A Deep Learning Approach for Speech Emotion Recognition Optimization Using Meta-Learning
Ottoni, Lara Toledo Cordeiro
Ottoni, Andre Luiz Carvalho
Cerqueira, Jes de Jesus Fiais
ELECTRONICS, 2023, 12 (23)
[49] Tracking context changes through meta-learning
Widmer, G
MACHINE LEARNING, 1997, 27 (03) : 259 - 286
[50] A Real-Time Tracking Algorithm for Multi-Target UAV Based on Deep Learning
Hong, Tao
Liang, Hongming
Yang, Qiye
Fang, Linquan
Kadoch, Michel
Cheriet, Mohamed
REMOTE SENSING, 2023, 15 (01)

← 1 2 3 4 5 →