Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning

被引:23
|
作者
Xu, Guoqiang [1 ]
Jiang, Weilai [1 ]
Wang, Zhaolei [2 ]
Wang, Yaonan [1 ]
机构
[1] Hunan Univ, Coll Elect & Informat Engn, Changsha 410082, Hunan, Peoples R China
[2] Beijing Aerosp Automat Control Inst, Sci & Technol Aerosp Intelligent Control Lab, Beijing 100854, Peoples R China
基金
美国国家科学基金会;
关键词
Unmanned Aerial Vehicle (UAV); Autonomous obstacle avoidance; Target tracking; Deep reinforcement learning; Continuous control;
D O I
10.1007/s10846-022-01601-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
When using deep reinforcement learning algorithm to complete Unmanned Aerial Vehicle (UAV) autonomous obstacle avoidance and target tracking tasks, there are often some problems such as slow convergence speed and low success rate. Therefore, this paper proposes a new deep reinforcement learning algorithm, namely Multiple Pools Twin Delay Deep Deterministic Policy Gradient (MPTD3) algorithm. Firstly, the state space and action space of UAV are established as continuous models, which is closer to engineering practice than discrete models. Then, multiple experience pools mechanism and gradient truncation are designed to improve the convergence of the algorithm. Furthermore, the generalization ability of the algorithm is obtained by giving UAV environmental perception ability. Experimental results verify the effectiveness of the proposed method.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Deep Reinforcement Learning
    Guoqiang Xu
    Weilai Jiang
    Zhaolei Wang
    Yaonan Wang
    Journal of Intelligent & Robotic Systems, 2022, 104
  • [2] Autonomous Obstacle Avoidance and Target Tracking of UAV Based on Meta-Reinforcement Learning
    Jiang W.
    Wu J.
    Wang Y.
    Hunan Daxue Xuebao/Journal of Hunan University Natural Sciences, 2022, 49 (06): : 101 - 109
  • [3] Autonomous obstacle avoidance of UAV based on deep reinforcement learning
    Yang, Songyue
    Yu, Guizhen
    Meng, Zhijun
    Wang, Zhangyu
    Li, Han
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (04) : 3323 - 3335
  • [4] Autonomous obstacle avoidance and target tracking of UAV: Transformer for observation sequence in reinforcement learning
    Jiang, Weilai
    Cai, Tianqing
    Xu, Guoqiang
    Wang, Yaonan
    KNOWLEDGE-BASED SYSTEMS, 2024, 290
  • [5] A UAV Indoor Obstacle Avoidance System Based on Deep Reinforcement Learning
    Lo, Chun-Huang
    Lee, Chung-Nan
    2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 2137 - 2143
  • [6] UAV autonomous obstacle avoidance via causal reinforcement learning
    Sun, Tao
    Gu, Jiaojiao
    Mou, Junjie
    DISPLAYS, 2025, 87
  • [7] Real-time obstacle avoidance with deep reinforcement learning * Three-Dimensional Autonomous Obstacle Avoidance for UAV
    Yang, Songyue
    Meng, Zhijun
    Chen, Xuzhi
    Xie, Ronglei
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ROBOTICS, INTELLIGENT CONTROL AND ARTIFICIAL INTELLIGENCE (RICAI 2019), 2019, : 324 - 329
  • [8] Virtual Tube Visual Obstacle Avoidance for UAV Based on Deep Reinforcement Learning
    Zhao, Jing
    Pei, Zi-Nan
    Jiang, Bin
    Lu, Ning-Yun
    Zhao, Fei
    Chen, Shu-Feng
    Zidonghua Xuebao/Acta Automatica Sinica, 2024, 50 (11): : 2245 - 2258
  • [9] Robot target tracking control considering obstacle avoidance based on combination of deep reinforcement learning and PID
    Liu, Yong
    Jiang, Xiao
    Li, Xiang
    Sun, Boxi
    Qian, Sen
    Wu, Yihao
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART I-JOURNAL OF SYSTEMS AND CONTROL ENGINEERING, 2025, 239 (03)
  • [10] UAV Autonomous Tracking and Landing Based on Deep Reinforcement Learning Strategy
    Xie, Jingyi
    Peng, Xiaodong
    Wang, Haijiao
    Niu, Wenlong
    Zheng, Xiao
    SENSORS, 2020, 20 (19) : 1 - 17