Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments

被引:15
|
作者
Thumm, Jakob [1 ]
Althoff, Matthias [1 ]
机构
[1] Tech Univ Munich, Dept Informat, D-85748 Garching, Germany
关键词
D O I
10.1109/ICRA46639.2022.9811698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (RL) has shown promising results in the motion planning of manipulators. However, no method guarantees the safety of highly dynamic obstacles, such as humans, in RL-based manipulator control. This lack of formal safety assurances prevents the application of RL for manipulators in real-world human environments. Therefore, we propose a shielding mechanism that ensures ISO-verified human safety while training and deploying RL algorithms on manipulators. We utilize a fast reachability analysis of humans and manipulators to guarantee that the manipulator comes to a complete stop before a human is within its range. Our proposed method guarantees safety and significantly improves the RL performance by preventing episode-ending collisions. We demonstrate the performance of our proposed method in simulation using human motion capture data.
引用
收藏
页码:6344 / 6350
页数:7
相关论文
共 50 条
  • [21] Provably Safe Reinforcement Learning with Step-wise Violation Constraints
    Xiong, Nuoya
    Du, Yihan
    Huang, Longbo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [22] Robotic Manipulation with Reinforcement Learning, State Representation Learning, and Imitation Learning
    Chen, Hanxiao
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 15769 - 15770
  • [23] Path Planning for the Robotic Manipulator in Dynamic Environments Based on a Deep Reinforcement Learning Method
    Jie Liu
    Hwa Jen Yap
    Anis Salwa Mohd Khairuddin
    Journal of Intelligent & Robotic Systems, 111 (1)
  • [24] Learning Mobile Manipulation through Deep Reinforcement Learning
    Wang, Cong
    Zhang, Qifeng
    Tian, Qiyan
    Li, Shuo
    Wang, Xiaohui
    Lane, David
    Petillot, Yvan
    Wang, Sen
    SENSORS, 2020, 20 (03)
  • [25] Review of Deep Reinforcement Learning for Robot Manipulation
    Hai Nguyen
    Hung Manh La
    2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 590 - 595
  • [26] Reinforcement Learning for Robotic Safe Control with Force Sensing
    Lin, Nan
    Zhang, Linrui
    Chen, Yuxuan
    Zhu, Yujun
    Chen, Ruoxi
    Wu, Peichen
    Chen, Xiaoping
    2019 WORLD ROBOT CONFERENCE SYMPOSIUM ON ADVANCED ROBOTICS AND AUTOMATION (WRC SARA 2019), 2019, : 148 - 153
  • [27] Dynamic Cloth Manipulation with Deep Reinforcement Learning
    Jangir, Rishabh
    Alenya, Guillem
    Torras, Carme
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4630 - 4636
  • [28] Object Manipulation in Marine Environments using Reinforcement Learning
    Nader, Ahmed
    Din, Muhayy Ud
    Irfan, Mughni
    Hussain, Irfan
    IFAC PAPERSONLINE, 2024, 58 (20): : 215 - 222
  • [29] Probabilistic Guarantees for Safe Deep Reinforcement Learning
    Bacci, Edoardo
    Parker, David
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2020, 2020, 12288 : 231 - 248
  • [30] Skill Reward for Safe Deep Reinforcement Learning
    Cheng, Jiangchang
    Yu, Fumin
    Zhang, Hongliang
    Dai, Yinglong
    UBIQUITOUS SECURITY, 2022, 1557 : 203 - 213