Safe Reinforcement Learning of Robot Trajectories in the Presence of Moving Obstacles

被引:0
|
作者
Kiemel, Jonas [1 ]
Righetti, Ludovic [2 ]
Kroeger, Torsten [1 ]
Asfour, Tamim [1 ]
机构
[1] Karlsruhe Inst Technol KIT, Inst Anthropomat & Robot IAR, D-76131 Karlsruhe, Germany
[2] NYU, Tandon Sch Engn, Brooklyn, NY 11201 USA
来源
IEEE ROBOTICS AND AUTOMATION LETTERS | 2024年 / 9卷 / 12期
关键词
Robots; Collision avoidance; Trajectory; Safety; Stochastic processes; Reinforcement learning; Training; Real-time systems; Quadrotors; Kinematics; Motion control; reinforcement learning; robot safety; collision avoidance;
D O I
10.1109/LRA.2024.3488402
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In this paper, we present an approach for learning collision-free robot trajectories in the presence of moving obstacles. As a first step, we train a backup policy to generate evasive movements from arbitrary initial robot states using model-free reinforcement learning. When learning policies for other tasks, the backup policy can be used to estimate the potential risk of a collision and to offer an alternative action if the estimated risk is considered too high. No matter which action is selected, our action space ensures that the kinematic limits of the robot joints are not violated. We analyze and evaluate two different methods for estimating the risk of a collision. A physics simulation performed in the background is computationally expensive but provides the best results in deterministic environments. If a data-based risk estimator is used instead, the computational effort is significantly reduced, but an additional source of error is introduced. For evaluation, we successfully learn a reaching task and a basketball task while keeping the risk of collisions low. The results demonstrate the effectiveness of our approach for deterministic and stochastic environments, including a human-robot scenario and a ball environment, where no state can be considered permanently safe. By conducting experiments with a real robot, we show that our approach can generate safe trajectories in real time.
引用
收藏
页码:11353 / 11360
页数:8
相关论文
共 50 条
  • [1] An optimal and real-time solution to parameterized mobile robot trajectories in the presence of moving obstacles
    Yang, J
    Daoui, A
    Qu, ZH
    Wang, J
    Hull, RA
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4412 - 4417
  • [2] Safe Route Determination for First Responders in the Presence of Moving Obstacles
    Wang, Zhiyong
    Zlatanova, Sisi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 21 (03) : 1044 - 1053
  • [3] Overcoming Obstacles With a Reconfigurable Robot Using Reinforcement Learning
    Yehezkel, Liran
    Berman, Sigal
    Zarrouk, David
    IEEE ACCESS, 2020, 8 : 217541 - 217553
  • [4] Optimized Trajectory Planning for Mobile Robot in the Presence of Moving Obstacles
    Ko, Chun-Hsu
    Young, Kuu-Young
    Hsieh, Yi-Hung
    2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS (ICM), 2015, : 70 - 75
  • [5] Optimal trajectory planning of robot manipulators in the presence of moving obstacles
    Saramago, SFP
    Junior, VS
    MECHANISM AND MACHINE THEORY, 2000, 35 (08) : 1079 - 1094
  • [6] An Intelligent System for Parking Trailer in Presence of Fixed and Moving Obstacles using Reinforcement Learning and Fuzzy Logic
    Sharafi, Morteza
    Zare, A.
    Kamyad, A. V.
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2011, 2 (01): : 141 - 149
  • [7] Robot learning from demonstrations: Emulation learning in environments with moving obstacles
    Ghalamzan, Amir M. E.
    Ragaglia, Matteo
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2018, 101 : 45 - 56
  • [8] A Reactive Algorithm for Safe Navigation of a Wheeled Mobile Robot among Moving Obstacles
    Savkin, Andrey V.
    Wang, Chao
    2012 IEEE INTERNATIONAL CONFERENCE ON CONTROL APPLICATIONS (CCA), 2012, : 1567 - 1571
  • [9] AN INTEGRATED ARCHITECTURE FOR ROBOT MOTION PLANNING AND CONTROL IN THE PRESENCE OF OBSTACLES WITH UNKNOWN TRAJECTORIES
    SPENCE, R
    HUTCHINSON, S
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1995, 25 (01): : 100 - 110
  • [10] A new analytical solution to mobile robot trajectory generation in the presence of moving obstacles
    Qu, ZH
    Wang, J
    Plaisted, CE
    IEEE TRANSACTIONS ON ROBOTICS AND AUTOMATION, 2004, 20 (06): : 978 - 993