Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments

被引:16
|
作者
Thumm, Jakob [1 ]
Althoff, Matthias [1 ]
机构
[1] Tech Univ Munich, Dept Informat, D-85748 Garching, Germany
关键词
D O I
10.1109/ICRA46639.2022.9811698
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Deep reinforcement learning (RL) has shown promising results in the motion planning of manipulators. However, no method guarantees the safety of highly dynamic obstacles, such as humans, in RL-based manipulator control. This lack of formal safety assurances prevents the application of RL for manipulators in real-world human environments. Therefore, we propose a shielding mechanism that ensures ISO-verified human safety while training and deploying RL algorithms on manipulators. We utilize a fast reachability analysis of humans and manipulators to guarantee that the manipulator comes to a complete stop before a human is within its range. Our proposed method guarantees safety and significantly improves the RL performance by preventing episode-ending collisions. We demonstrate the performance of our proposed method in simulation using human motion capture data.
引用
收藏
页码:6344 / 6350
页数:7
相关论文
共 50 条
  • [31] Robotic Grasping using Deep Reinforcement Learning
    Joshi, Shirin
    Kumra, Sulabh
    Sahin, Ferat
    2020 IEEE 16TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2020, : 1461 - 1466
  • [32] Efficient Safe Learning for Robotic Systems in Unstructured Environments
    Pohland, Sara
    Herbert, Sylvia
    Tomlin, Claire
    2019 IEEE 16TH INTERNATIONAL CONFERENCE ON MOBILE AD HOC AND SENSOR SYSTEMS WORKSHOPS (MASSW 2019), 2019, : 82 - 86
  • [33] Reinforcement learning algorithms for robotic navigation in dynamic environments
    Yen, GG
    Hickey, TW
    ISA TRANSACTIONS, 2004, 43 (02) : 217 - 230
  • [34] Reinforcement learning algorithms for robotic navigation in dynamic environments
    Yen, G
    Hickey, T
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 1444 - 1449
  • [35] Towards a Broad-Persistent Advising Approach for Deep Interactive Reinforcement Learning in Robotic Environments
    Nguyen, Hung Son
    Cruz, Francisco
    Dazeley, Richard
    SENSORS, 2023, 23 (05)
  • [36] Safe Navigation for UAV-Enabled Data Dissemination by Deep Reinforcement Learning in Unknown Environments
    Fei Huang
    Guangxia Li
    Shiwei Tian
    Jin Chen
    Guangteng Fan
    Jinghui Chang
    ChinaCommunications, 2022, 19 (01) : 202 - 217
  • [37] Natural object manipulation using anthropomorphic robotic hand through deep reinforcement learning and deep grasping probability network
    Valarezo Anazco, Edwin
    Rivera Lopez, Patricio
    Park, Nahyeon
    Oh, Jiheon
    Ryu, Gahyeon
    Al-antari, Mugahed A.
    Kim, Tae-Seong
    APPLIED INTELLIGENCE, 2021, 51 (02) : 1041 - 1055
  • [38] Safe Navigation for UAV-Enabled Data Dissemination by Deep Reinforcement Learning in Unknown Environments
    Huang, Fei
    Li, Guangxia
    Tian, Shiwei
    Chen, Jin
    Fan, Guangteng
    Chang, Jinghui
    CHINA COMMUNICATIONS, 2022, 19 (01) : 202 - 217
  • [39] Natural object manipulation using anthropomorphic robotic hand through deep reinforcement learning and deep grasping probability network
    Edwin Valarezo Añazco
    Patricio Rivera Lopez
    Nahyeon Park
    Jiheon Oh
    Gahyeon Ryu
    Mugahed A. Al-antari
    Tae-Seong Kim
    Applied Intelligence, 2021, 51 : 1041 - 1055
  • [40] Towards Safe Human-Robot Collaboration Using Deep Reinforcement Learning
    El-Shamouty, Mohamed
    Wu, Xinyang
    Yang, Shanqi
    Albus, Marcel
    Huber, Marco F.
    2020 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2020, : 4899 - 4905