Simulation and Transfer of Reinforcement Learning Algorithms for Autonomous Obstacle Avoidance

被引:0
作者
Lenk, Max [1 ]
Hilsendegen, Paula [2 ]
Mueller, Silvan Michael [2 ]
Rettig, Oliver [2 ]
Strand, Marcus [2 ]
机构
[1] SAP SE, Dietmar Hopp Allee 16, D-69190 Walldorf, Germany
[2] Duale Hsch Baden Wurttemberg, Dept Comp Sci, D-76133 Karlsruhe, Germany
来源
INTELLIGENT AUTONOMOUS SYSTEMS 15, IAS-15 | 2019年 / 867卷
关键词
Reinforcement learning; Machine learning; Obstacle avoidance; Collision avoidance; Simulation;
D O I
10.1007/978-3-030-01370-7_32
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The explicit programming of obstacle avoidance by an autonomous robot can be a computationally expensive undertaking. The application of reinforcement learning algorithms promises a reduction of programming effort. However, these algorithms build on iterative training processes and therefore are time-consuming. In order to overcome this drawback we propose to displace the training process to abstract simulation scenarios. In this study we trained four different reinforcement algorithms (Q-Learning, Deep-Q-Learning, Deep Deterministic Policy Gradient and A synchronous Advantage-Actor-Critic) in different abstract simulation scenarios and transferred the learning results to an autonomous robot. Except for the Asynchronous Advantage-Actor-Critic we achieved good obstacle avoidance during the simulation. Without further real-world training the policies learned by Q-Learning and Deep-Q-Learning achieved immediately obstacle avoidance when transferred to an autonomous robot.
引用
收藏
页码:401 / 413
页数:13
相关论文
共 50 条
  • [21] Q-Learning for Autonomous Mobile Robot Obstacle Avoidance
    Ribeiro, Tiago
    Goncalves, Fernando
    Garcia, Ines
    Lopes, Gil
    Fernando Ribeiro, A.
    [J]. 2019 19TH IEEE INTERNATIONAL CONFERENCE ON AUTONOMOUS ROBOT SYSTEMS AND COMPETITIONS (ICARSC 2019), 2019, : 243 - 249
  • [22] Optimal reinforcement learning and probabilistic-risk-based path planning and following of autonomous vehicles with obstacle avoidance
    Taghavifar, Hamid
    Taghavifar, Leyla
    Hu, Chuan
    Wei, Chongfeng
    Qin, Yechen
    [J]. PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART D-JOURNAL OF AUTOMOBILE ENGINEERING, 2024, 238 (06) : 1427 - 1439
  • [23] Deep reinforcement learning-based collision avoidance for an autonomous ship
    Chun, Do-Hyun
    Roh, Myung-Il
    Lee, Hye-Won
    Ha, Jisang
    Yu, Donghun
    [J]. OCEAN ENGINEERING, 2021, 234
  • [24] Agile DQN: adaptive deep recurrent attention reinforcement learning for autonomous UAV obstacle avoidance
    Fadi AlMahamid
    Katarina Grolinger
    [J]. Scientific Reports, 15 (1)
  • [25] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
    Ryosuke Saga
    Rinto Kozono
    Yutaro Tsurumi
    Yasunori Nihei
    [J]. Artificial Life and Robotics, 2024, 29 : 136 - 144
  • [26] Deep-reinforcement learning-based route planning with obstacle avoidance for autonomous vessels
    Saga, Ryosuke
    Kozono, Rinto
    Tsurumi, Yutaro
    Nihei, Yasunori
    [J]. ARTIFICIAL LIFE AND ROBOTICS, 2024, 29 (01) : 136 - 144
  • [27] A fuzzy controller with supervised learning assisted reinforcement learning algorithm for obstacle avoidance
    Ye, C
    Yung, NHC
    Wang, DW
    [J]. IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (01): : 17 - 27
  • [28] A Vision-Based Bio-Inspired Reinforcement Learning Algorithms for Manipulator Obstacle Avoidance
    Singh, Abhilasha
    Shakeel, Mohamed
    Kalaichelvi, V
    Karthikeyan, R.
    [J]. ELECTRONICS, 2022, 11 (21)
  • [29] Quadrotor Path Following and Reactive Obstacle Avoidance with Deep Reinforcement Learning
    Rubi, Bartomeu
    Morcego, Bernardo
    Perez, Ramon
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 103 (04)
  • [30] The Algorithm for UAV Obstacle Avoidance and Route Planning Based on Reinforcement Learning
    Liu, Jiantong
    Wang, Zhengjie
    Zhang, Zhide
    [J]. PROCEEDINGS OF THE 11TH INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION AND CONTROL (ICMIC2019), 2020, 582 : 747 - 754