UAV Maneuvering Target Tracking in Uncertain Environments Based on Deep Reinforcement Learning and Meta-Learning

被引：56

作者：

Li, Bo ^{[1
]}

Gan, Zhigang ^{[1
]}

Chen, Daqing ^{[2
]}

Sergey Aleksandrovich, Dyachenko ^{[3
]}

机构：

[1] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

[2] London South Bank Univ, Sch Engn, London SE1 0AA, England

[3] Moscow Inst Aviat Technol, Sch Robot & Intelligent Syst, Moscow 125993, Russia

来源：

REMOTE SENSING | 2020年 / 12卷 / 22期

关键词：

UAV; maneuvering target tracking; deep reinforcement learning; meta-learning; multi-tasks; SYSTEM;

D O I：

10.3390/rs12223789

中图分类号：

X [环境科学、安全科学];

学科分类号：

08 ; 0830 ;

摘要：

This paper combines deep reinforcement learning (DRL) with meta-learning and proposes a novel approach, named meta twin delayed deep deterministic policy gradient (Meta-TD3), to realize the control of unmanned aerial vehicle (UAV), allowing a UAV to quickly track a target in an environment where the motion of a target is uncertain. This approach can be applied to a variety of scenarios, such as wildlife protection, emergency aid, and remote sensing. We consider a multi-task experience replay buffer to provide data for the multi-task learning of the DRL algorithm, and we combine meta-learning to develop a multi-task reinforcement learning update method to ensure the generalization capability of reinforcement learning. Compared with the state-of-the-art algorithms, namely the deep deterministic policy gradient (DDPG) and twin delayed deep deterministic policy gradient (TD3), experimental results show that the Meta-TD3 algorithm has achieved a great improvement in terms of both convergence value and convergence rate. In a UAV target tracking problem, Meta-TD3 only requires a few steps to train to enable a UAV to adapt quickly to a new target movement mode more and maintain a better tracking effectiveness.

引用

页码：1 / 20

页数：20

共 50 条

[1] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
Guoqiang Xu
Weilai Jiang
Zhaolei Wang
Yaonan Wang
Journal of Intelligent & Robotic Systems, 2022, 104
[2] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
Xu, Guoqiang
Jiang, Weilai
Wang, Zhaolei
Wang, Yaonan
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2022, 104 (04)
[3] Maneuvering target tracking of UAV based on MN-DDPG and transfer learning
Li, Bo
Yang, Zhi-peng
Chen, Da-qing
Liang, Shi-yang
Ma, Hao
DEFENCE TECHNOLOGY, 2021, 17 (02) : 457 - 466
[4] Path Planning for UAV Ground Target Tracking via Deep Reinforcement Learning
Li, Bohao
Wu, Yunjie
IEEE ACCESS, 2020, 8 (29064-29074) : 29064 - 29074
[5] Robust Motion Control for UAV in Dynamic Uncertain Environments Using Deep Reinforcement Learning
Wan, Kaifang
Gao, Xiaoguang
Hu, Zijian
Wu, Gaofeng
REMOTE SENSING, 2020, 12 (04)
[6] Intercept Strategy for Maneuvering Target Based on Deep Reinforcement Learning
Wang, Xu
Cai, Yuanli
Fang, Yizhong
Deng, Yifan
2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 3547 - 3552
[7] Meta-learning in Reinforcement Learning
Schweighofer, N
Doya, K
NEURAL NETWORKS, 2003, 16 (01) : 5 - 9
[8] A Two-Stage Target Search and Tracking Method for UAV Based on Deep Reinforcement Learning
Liu, Mei
Wei, Jingbo
Liu, Kun
DRONES, 2024, 8 (10)
[9] Model-Based Deep Learning for Distributed Maneuvering Target Tracking
Yang, Feng
Gao, Tongyang
Zheng, Litao
Liao, Pan
INTELLIGENT ROBOTICS AND APPLICATIONS, ICIRA 2024, PT X, 2025, 15210 : 209 - 222
[10] Learn to chill - Intelligent Chiller Scheduling using Meta-learning and Deep Reinforcement Learning
Manoharan, Praveen
Venkat, Malini Pooni
Nagarathinam, Srinarayana
Vasan, Arunchandar
BUILDSYS'21: PROCEEDINGS OF THE 2021 ACM INTERNATIONAL CONFERENCE ON SYSTEMS FOR ENERGY-EFFICIENT BUILT ENVIRONMENTS, 2021, : 21 - 30

← 1 2 3 4 5 →