Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning

被引：23

作者：

Xu, Guoqiang ^{[1
]}

Jiang, Weilai ^{[1
]}

Wang, Zhaolei ^{[2
]}

Wang, Yaonan ^{[1
]}

机构：

[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China

[2] Beijing Aerosp Automat Control Inst, Sci & Technol Aerosp Intelligent Control Lab, Beijing 100854, Peoples R China

来源：

JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS | 2022年 / 104卷 / 04期

基金：

美国国家科学基金会;

关键词：

Unmanned Aerial Vehicle (UAV); Autonomous obstacle avoidance; Target tracking; Deep reinforcement learning; Continuous control;

D O I：

10.1007/s10846-022-01601-8

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

When using deep reinforcement learning algorithm to complete Unmanned Aerial Vehicle (UAV) autonomous obstacle avoidance and target tracking tasks, there are often some problems such as slow convergence speed and low success rate. Therefore, this paper proposes a new deep reinforcement learning algorithm, namely Multiple Pools Twin Delay Deep Deterministic Policy Gradient (MPTD3) algorithm. Firstly, the state space and action space of UAV are established as continuous models, which is closer to engineering practice than discrete models. Then, multiple experience pools mechanism and gradient truncation are designed to improve the convergence of the algorithm. Furthermore, the generalization ability of the algorithm is obtained by giving UAV environmental perception ability. Experimental results verify the effectiveness of the proposed method.

引用

页数：13

共 50 条

[1] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
Guoqiang Xu
Weilai Jiang
Zhaolei Wang
Yaonan Wang
Journal of Intelligent & Robotic Systems, 2022, 104
[2] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Meta-Reinforcement Learning
Jiang W.
Wu J.
Wang Y.
Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2022, 49 (06): : 101 - 109
[3] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
Yang, Songyue
Yu, Guizhen
Meng, Zhijun
Wang, Zhangyu
Li, Han
JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
[4] Autonomous obstacle avoidance and target tracking of UAV: Transformer for observation sequence in reinforcement learning
Jiang, Weilai
Cai, Tianqing
Xu, Guoqiang
Wang, Yaonan
KNOWLEDGE-BASED SYSTEMS, 2024, 290
[5] A UAV Indoor Obstacle Avoidance System Based on Deep Reinforcement Learning
Lo, Chun-Huang
Lee, Chung-Nan
2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2137 - 2143
[6] UAV autonomous obstacle avoidance via causal reinforcement learning
Sun, Tao
Gu, Jiaojiao
Mou, Junjie
DISPLAYS, 2025, 87
[7] Real-time obstacle avoidance with deep reinforcement learning * Three-Dimensional Autonomous Obstacle Avoidance for UAV
Yang, Songyue
Meng, Zhijun
Chen, Xuzhi
Xie, Ronglei
PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 324 - 329
[8] Virtual Tube Visual Obstacle Avoidance for UAV Based on Deep Reinforcement Learning
Zhao, Jing
Pei, Zi-Nan
Jiang, Bin
Lu, Ning-Yun
Zhao, Fei
Chen, Shu-Feng
Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (11): : 2245 - 2258
[9] Robot target tracking control considering obstacle avoidance based on combination of deep reinforcement learning and PID
Liu, Yong
Jiang, Xiao
Li, Xiang
Sun, Boxi
Qian, Sen
Wu, Yihao
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2025, 239 (03)
[10] UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
Xie, Jingyi
Peng, Xiaodong
Wang, Haijiao
Niu, Wenlong
Zheng, Xiao
SENSORS, 2020, 20 (19) : 1 - 17

← 1 2 3 4 5 →