Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

被引:2
作者
Yang, Yue [1 ]
Chen, Letian [1 ]
Zaidi, Zulfqar [1 ]
van Waveren, Sanne [1 ]
Krishna, Arjun [1 ]
Gombolay, Matthew [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
来源
PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 | 2024年
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Learning from Demonstration; Control Barrier Function; Safety;
D O I
10.1145/3610977.3635002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Learning from Demonstration (LfD) is a powerful method for non-roboticists end-users to teach robots new tasks, enabling them to customize the robot behavior. However, modern LfD techniques do not explicitly synthesize safe robot behavior, which limits the deployability of these approaches in the real world. To enforce safety in LfD without relying on experts, we propose a new framework, ShiElding with Control barrier fUnctions in inverse REinforcement learning (SECURE), which learns a customized Control Barrier Function (CBF) from end-users that prevents robots from taking unsafe actions while imposing little interference with the task completion. We evaluate SECURE in three sets of experiments. First, we empirically validate SECURE learns a high-quality CBF from demonstrations and outperforms conventional LfD methods on simulated robotic and autonomous driving tasks with improvements on safety by up to 100%. Second, we demonstrate that roboticists can leverage SECURE to outperform conventional LfD approaches on a real-world knife-cutting, meal-preparation task by 12.5% in task completion while driving the number of safety violations to zero. Finally, we demonstrate in a user study that non-roboticists can use SECURE to efectively teach the robot safe policies that avoid collisions with the person and prevent cofee from spilling.
引用
收藏
页码:820 / 829
页数:10
相关论文
共 50 条
  • [31] Feasibility-guaranteed safety-critical control with high-order control barrier function method
    Zhang, Sihua
    Zhai, Di-Hua
    Xiong, Yuhan
    Xia, Yuanqing
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (05) : 3425 - 3441
  • [32] Dynamic guidance virtual fixture for hydraulic manipulator via learning from demonstration
    Cheng, Min
    Li, Renming
    Ding, Ruqi
    Luo, Shaqi
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2023, 237 (04) : 952 - 962
  • [33] Robust Safe Learning and Control in an Unknown Environment: An Uncertainty-Separated Control Barrier Function Approach
    Li, Jiacheng
    Liu, Qingchen
    Jin, Wanxin
    Qin, Jiahu
    Hirche, Sandra
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6539 - 6546
  • [34] Nonprehensile Manipulation for Rapid Object Spinning via Multisensory Learning from Demonstration
    Shin, Ku Jin
    Jeon, Soo
    SENSORS, 2024, 24 (02)
  • [35] Control barrier function based trajectory generation and tracking control for spacecraft inspection mission under multiple safety constraints
    Wang, Kun
    Meng, Tao
    Wang, Weijia
    Lei, Jiakun
    ADVANCES IN SPACE RESEARCH, 2024, 73 (03) : 2080 - 2097
  • [36] Learning Gentle Grasping From Human-Free Force Control Demonstration
    Li, Mingxuan
    Zhang, Lunwei
    Li, Tiemin
    Jiang, Yao
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2391 - 2398
  • [37] Control Barrier Function Based Model Predictive Control to Safety Obstacle-Avoidance of Autonomous Manned Mobile Robots
    Gao, Qingji
    Feng, Junhu
    Zhang, Gaowei
    Cao, Wenbo
    ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 5, 2025, 1341 : 606 - 614
  • [38] Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies
    Rodriguez, Ivan Dario Jimenez
    Csomay-Shanklin, Noel
    Yue, Yisong
    Ames, Aaron D.
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
  • [39] ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing
    Akbulut, M. Tuluhan
    Oztop, Erhan
    Seker, M. Yunus
    Xue, Honghu
    Tekden, Ahmet E.
    Ugur, Emre
    CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 1896 - 1907
  • [40] Safety-Critical Generalized Predictive Control for Speed Regulation of PMSM Drives Based on Dynamic Robust Control Barrier Function
    Cao, Zhongkun
    Mao, Jianliang
    Zhang, Chuanlin
    Cui, Chenggang
    Yang, Jun
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (02) : 1881 - 1891