SAC-PER: A Navigation Method Based on Deep Reinforcement Learning Under Uncertain Environments

被引:0
作者
Wang, Xinmeng [1 ]
Wang, Lisong [1 ]
She, Shifan [1 ]
Hu, Lingling [1 ]
机构
[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China
来源
WEB AND BIG DATA, PT II, APWEB-WAIM 2022 | 2023年 / 13422卷
关键词
Uncertain environments; Multi-sensor data; POMDP model; Deep reinforcement learning; Navigation and obstacle avoidance;
D O I
10.1007/978-3-031-25198-6_38
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In real scenarios, robots usually face dynamically changing environments, and traditional navigation methods require a predefined high-precision map, which limits the achievability of navigation in dynamic and uncertain environments. To solve this problem, this paper uses a Partially Observable Markov Decision Process (POMDP) to model the uncertain navigation planning problem and proposes a soft actor-critic with prioritized experience replay (SAC-PER) method based on multi-sensor perception to achieve efficient navigation. The method uses multi-source information fusion for environment perception and Deep Reinforcement Learning (DRL) for continuous control of navigation. The multi-source SAC-PER method can effectively avoid obstacles and enable robots to perform navigation tasks autonomously in uncertain environments without building high-precision maps. We evaluate the proposed method using Robot Operating System (ROS) and Gazebo simulator. The results demonstrate that the SAC-PER method has high efficiency and robustness in different environments, and shows good generalization ability.
引用
收藏
页码:501 / 510
页数:10
相关论文
共 12 条
  • [1] Chen J., 2018, Comput. Sci., V45, P85
  • [2] Duong T, 2020, IEEE INT CONF ROBOT, P9666, DOI [10.1109/icra40945.2020.9197412, 10.1109/ICRA40945.2020.9197412]
  • [3] Gavrilov AV, 2011, LECT NOTES COMPUT SC, V6838, P210, DOI 10.1007/978-3-642-24728-6_28
  • [4] A survey of robotic motion planning in dynamic environments
    Mohanan, M. G.
    Salgoankar, Ambuja
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 100 : 171 - 185
  • [5] A novel search and survey technique for unmanned aerial systems in detecting and estimating the area for wildfires
    Sarkar, Mrinmoy
    Yan, Xuyang
    Erol, Berat A.
    Raptis, Ioannis
    Homaifar, Abdollah
    [J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 145
  • [6] Shade R, 2011, IEEE INT CONF ROBOT
  • [7] Tai L, 2017, IEEE INT C INT ROBOT, P31
  • [8] Probabilistic robotics
    Thrun, S
    [J]. COMMUNICATIONS OF THE ACM, 2002, 45 (03) : 52 - 57
  • [9] Mapless Navigation with Deep Reinforcement Learning based on The Convolutional Proximal Policy Optimization Network
    Toan, Nguyen Duc
    Woo, Kim Gon
    [J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 298 - 301
  • [10] Yan F, 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012)