UAV autonomous obstacle avoidance via causal reinforcement learning

被引:1
作者
Sun, Tao [1 ]
Gu, Jiaojiao [1 ]
Mou, Junjie [1 ]
机构
[1] Naval Aeronaut Univ, Yantai 264001, Peoples R China
关键词
Unmanned aerial vehicles (UAVs); Obstacle avoidance; Navigation; Causal inference; Reinforcement learning; SCALE ESTIMATION;
D O I
10.1016/j.displa.2025.102966
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The role of unmanned aerial vehicles (UAVs) in everyday life is becoming increasingly important, and there is a growing demand for UAVs to autonomously perform obstacle avoidance and navigation tasks. Traditional UAV navigation methods typically divide the navigation problem into three stages: perception, mapping, and path planning. However, this approach significantly increases processing delays, causing UAVs to lose their agility advantage. In this paper, we propose a causal reinforcement learning-based end-to-end navigation strategy that directly learns from data, bypassing the explicit mapping and planning steps, thus enhancing responsiveness. To address the issue where using a continuous action space prevents the agent from learning effective experiences from past actions, we introduce an Actor-Critic method with a fixed horizontal plane and a discretized action space. This approach enhances the efficiency of sampling from the experience replay buffer and stabilizes the optimization process, ultimately improving the success rate of the reinforcement learning algorithm in UAV obstacle avoidance and navigation tasks. Furthermore, to overcome the limited generalization capability of end-to-end methods, we incorporate causal inference into the reinforcement learning training process. This step mitigates overfitting caused by insufficient interaction with the environment during training, thereby increasing the success rate of UAVs in performing obstacle avoidance and navigation tasks in unfamiliar environments. We validate the effectiveness of causal inference in improving the generalization capability of the reinforcement learning algorithm by using convergence steps in the training environment and navigation success rates of random targets in the testing environment as quantitative metrics. The results demonstrate that causal inference can effectively reduce overfitting of the policy network to the training environment.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Reinforcement Learning for Mobile Robot Obstacle Avoidance Under Dynamic Environments
    Huang, Liwei
    Qu, Hong
    Fu, Mingsheng
    Deng, Wu
    PRICAI 2018: TRENDS IN ARTIFICIAL INTELLIGENCE, PT I, 2018, 11012 : 441 - 453
  • [22] Reinforcement Learning with Dynamic Movement Primitives for Obstacle Avoidance
    Li, Ang
    Liu, Zhenze
    Wang, Wenrui
    Zhu, Mingchao
    Li, Yanhui
    Huo, Qi
    Dai, Ming
    APPLIED SCIENCES-BASEL, 2021, 11 (23):
  • [23] Poster: Artificial versus Spiking Neural Networks for Reinforcement Learning in UAV Obstacle Avoidance
    Zanatta, Luca
    Barchi, Francesco
    Bartolini, Andrea
    Acquaviva, Andrea
    PROCEEDINGS OF THE 19TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS 2022 (CF 2022), 2022, : 199 - 200
  • [24] A Review on IoT Deep Learning UAV Systems for Autonomous Obstacle Detection and Collision Avoidance
    Fraga-Lamas, Paula
    Ramos, Lucia
    Mondejar-Guerra, Victor
    Fernandez-Carames, Tiago M.
    REMOTE SENSING, 2019, 11 (18)
  • [25] Obstacle Avoidance Algorithm via Hierarchical Interaction Deep Reinforcement Learning
    Ding, Zihao
    Song, Chunlei
    Xu, Jianhua
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3680 - 3685
  • [26] Autonomous Obstacle Avoidance Algorithm for Unmanned Aerial Vehicles Based on Deep Reinforcement Learning
    Gao, Yuan
    Ren, Ling
    Shi, Tianwei
    Xu, Teng
    Ding, Jianbang
    ENGINEERING LETTERS, 2024, 32 (03) : 650 - 660
  • [27] Hybrid offline-online reinforcement learning for obstacle avoidance in autonomous underwater vehicles
    Zhao, Jintao
    Liu, Tao
    Huang, Junhao
    SHIPS AND OFFSHORE STRUCTURES, 2024,
  • [28] Automatic obstacle avoidance of quadrotor UAV via CNN-based learning
    Dai, Xi
    Mao, Yuxin
    Huang, Tianpeng
    Qin, Na
    Huang, Deqing
    Li, Yanan
    NEUROCOMPUTING, 2020, 402 : 346 - 358
  • [29] UAV environmental perception and autonomous obstacle avoidance: A deep learning and depth camera combined solution
    Wang, Dashuai
    Li, Wei
    Liu, Xiaoguang
    Li, Nan
    Zhang, Chunlong
    COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2020, 175
  • [30] Memory-Based Deep Reinforcement Learning for Obstacle Avoidance in UAV With Limited Environment Knowledge
    Singla, Abhik
    Padakandla, Sindhu
    Bhatnagar, Shalabh
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (01) : 107 - 118