Simulation and Transfer of Reinforcement Learning Algorithms for Autonomous Obstacle Avoidance

被引：0

作者：

Lenk, Max ^{[1
]}

Hilsendegen, Paula ^{[2
]}

Mueller, Silvan Michael ^{[2
]}

Rettig, Oliver ^{[2
]}

Strand, Marcus ^{[2
]}

机构：

[1] SAP SE, Dietmar Hopp Allee 16, D-69190 Walldorf, Germany

[2] Duale Hsch Baden Wurttemberg, Dept Comp Sci, D-76133 Karlsruhe, Germany

来源：

INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15 | 2019年 / 867卷

关键词：

Reinforcement learning; Machine learning; Obstacle avoidance; Collision avoidance; Simulation;

D O I：

10.1007/978-3-030-01370-7_32

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The explicit programming of obstacle avoidance by an autonomous robot can be a computationally expensive undertaking. The application of reinforcement learning algorithms promises a reduction of programming effort. However, these algorithms build on iterative training processes and therefore are time-consuming. In order to overcome this drawback we propose to displace the training process to abstract simulation scenarios. In this study we trained four different reinforcement algorithms (Q-Learning, Deep-Q-Learning, Deep Deterministic Policy Gradient and A synchronous Advantage-Actor-Critic) in different abstract simulation scenarios and transferred the learning results to an autonomous robot. Except for the Asynchronous Advantage-Actor-Critic we achieved good obstacle avoidance during the simulation. Without further real-world training the policies learned by Q-Learning and Deep-Q-Learning achieved immediately obstacle avoidance when transferred to an autonomous robot.

引用

页码：401 / 413

页数：13

共 50 条

[21] Q-Learning for Autonomous Mobile Robot Obstacle Avoidance
Ribeiro, Tiago
Goncalves, Fernando
Garcia, Ines
Lopes, Gil
Fernando Ribeiro, A.
[J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 243 - 249
[22] Optimal reinforcement learning and probabilistic-risk-based path planning and following of autonomous vehicles with obstacle avoidance
Taghavifar, Hamid
Taghavifar, Leyla
Hu, Chuan
Wei, Chongfeng
Qin, Yechen
[J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (06) : 1427 - 1439
[23] Deep reinforcement learning-based collision avoidance for an autonomous ship
Chun, Do-Hyun
Roh, Myung-Il
Lee, Hye-Won
Ha, Jisang
Yu, Donghun
[J]. OCEAN ENGINEERING, 2021, 234
[24] Agile DQN: adaptive deep recurrent attention reinforcement learning for autonomous UAV obstacle avoidance
Fadi AlMahamid
Katarina Grolinger
[J]. Scientific Reports, 15 (1)
[25] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
Ryosuke Saga
Rinto Kozono
Yutaro Tsurumi
Yasunori Nihei
[J]. Artificial Life and Robotics, 2024, 29 : 136 - 144
[26] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
Saga, Ryosuke
Kozono, Rinto
Tsurumi, Yutaro
Nihei, Yasunori
[J]. ARTIFICIAL LIFE AND ROBOTICS, 2024, 29 (01) : 136 - 144
[27] A fuzzy controller with supervised learning assisted reinforcement learning algorithm for obstacle avoidance
Ye, C
Yung, NHC
Wang, DW
[J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (01): : 17 - 27
[28] A Vision-Based Bio-Inspired Reinforcement Learning Algorithms for Manipulator Obstacle Avoidance
Singh, Abhilasha
Shakeel, Mohamed
Kalaichelvi, V
Karthikeyan, R.
[J]. ELECTRONICS, 2022, 11 (21)
[29] Quadrotor Path Following and Reactive Obstacle Avoidance with Deep Reinforcement Learning
Rubi, Bartomeu
Morcego, Bernardo
Perez, Ramon
[J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 103 (04)
[30] The Algorithm for UAV Obstacle Avoidance and Route Planning Based on Reinforcement Learning
Liu, Jiantong
Wang, Zhengjie
Zhang, Zhide
[J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC2019), 2020, 582 : 747 - 754

← 1 2 3 4 5 →