SAC-PER: A Navigation Method Based on Deep Reinforcement Learning Under Uncertain Environments

被引：0

作者：

Wang, Xinmeng ^{[1
]}

Wang, Lisong ^{[1
]}

She, Shifan ^{[1
]}

Hu, Lingling ^{[1
]}

机构：

[1] Nanjing Univ Aeronaut & Astronaut, Coll Comp Sci & Technol, Nanjing 211106, Peoples R China

来源：

WEB AND BIG DATA, PT II, APWEB-WAIM 2022 | 2023年 / 13422卷

关键词：

Uncertain environments; Multi-sensor data; POMDP model; Deep reinforcement learning; Navigation and obstacle avoidance;

D O I：

10.1007/978-3-031-25198-6_38

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In real scenarios, robots usually face dynamically changing environments, and traditional navigation methods require a predefined high-precision map, which limits the achievability of navigation in dynamic and uncertain environments. To solve this problem, this paper uses a Partially Observable Markov Decision Process (POMDP) to model the uncertain navigation planning problem and proposes a soft actor-critic with prioritized experience replay (SAC-PER) method based on multi-sensor perception to achieve efficient navigation. The method uses multi-source information fusion for environment perception and Deep Reinforcement Learning (DRL) for continuous control of navigation. The multi-source SAC-PER method can effectively avoid obstacles and enable robots to perform navigation tasks autonomously in uncertain environments without building high-precision maps. We evaluate the proposed method using Robot Operating System (ROS) and Gazebo simulator. The results demonstrate that the SAC-PER method has high efficiency and robustness in different environments, and shows good generalization ability.

引用

页码：501 / 510

页数：10

共 12 条

[1] Chen J., 2018, Comput. Sci., V45, P85
[2] Duong T, 2020, IEEE INT CONF ROBOT, P9666, DOI [10.1109/icra40945.2020.9197412, 10.1109/ICRA40945.2020.9197412]
[3] Gavrilov AV, 2011, LECT NOTES COMPUT SC, V6838, P210, DOI 10.1007/978-3-642-24728-6_28
[4] A survey of robotic motion planning in dynamic environments
Mohanan, M. G.
Salgoankar, Ambuja
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 100 : 171 - 185
[5] A novel search and survey technique for unmanned aerial systems in detecting and estimating the area for wildfires
Sarkar, Mrinmoy
Yan, Xuyang
Erol, Berat A.
Raptis, Ioannis
Homaifar, Abdollah
[J]. ROBOTICS AND AUTONOMOUS SYSTEMS, 2021, 145
[6] Shade R, 2011, IEEE INT CONF ROBOT
[7] Tai L, 2017, IEEE INT C INT ROBOT, P31
[8] Probabilistic robotics
Thrun, S
[J]. COMMUNICATIONS OF THE ACM, 2002, 45 (03) : 52 - 57
[9] Mapless Navigation with Deep Reinforcement Learning based on The Convolutional Proximal Policy Optimization Network
Toan, Nguyen Duc
Woo, Kim Gon
[J]. 2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP 2021), 2021, : 298 - 301
[10] Yan F, 2012, 2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO 2012)

← 1 2 →