Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

被引：2

作者：

Yang, Yue ^{[1
]}

Chen, Letian ^{[1
]}

Zaidi, Zulfqar ^{[1
]}

van Waveren, Sanne ^{[1
]}

Krishna, Arjun ^{[1
]}

Gombolay, Matthew ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024 | 2024年

基金：

美国国家卫生研究院; 美国国家科学基金会;

关键词：

Learning from Demonstration; Control Barrier Function; Safety;

D O I：

10.1145/3610977.3635002

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Learning from Demonstration (LfD) is a powerful method for non-roboticists end-users to teach robots new tasks, enabling them to customize the robot behavior. However, modern LfD techniques do not explicitly synthesize safe robot behavior, which limits the deployability of these approaches in the real world. To enforce safety in LfD without relying on experts, we propose a new framework, ShiElding with Control barrier fUnctions in inverse REinforcement learning (SECURE), which learns a customized Control Barrier Function (CBF) from end-users that prevents robots from taking unsafe actions while imposing little interference with the task completion. We evaluate SECURE in three sets of experiments. First, we empirically validate SECURE learns a high-quality CBF from demonstrations and outperforms conventional LfD methods on simulated robotic and autonomous driving tasks with improvements on safety by up to 100%. Second, we demonstrate that roboticists can leverage SECURE to outperform conventional LfD approaches on a real-world knife-cutting, meal-preparation task by 12.5% in task completion while driving the number of safety violations to zero. Finally, we demonstrate in a user study that non-roboticists can use SECURE to efectively teach the robot safe policies that avoid collisions with the person and prevent cofee from spilling.

引用

页码：820 / 829

页数：10

共 50 条

[31] Feasibility-guaranteed safety-critical control with high-order control barrier function method
Zhang, Sihua
Zhai, Di-Hua
Xiong, Yuhan
Xia, Yuanqing
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (05) : 3425 - 3441
[32] Dynamic guidance virtual fixture for hydraulic manipulator via learning from demonstration
Cheng, Min
Li, Renming
Ding, Ruqi
Luo, Shaqi
PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART C-JOURNAL OF MECHANICAL ENGINEERING SCIENCE, 2023, 237 (04) : 952 - 962
[33] Robust Safe Learning and Control in an Unknown Environment: An Uncertainty-Separated Control Barrier Function Approach
Li, Jiacheng
Liu, Qingchen
Jin, Wanxin
Qin, Jiahu
Hirche, Sandra
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (10) : 6539 - 6546
[34] Nonprehensile Manipulation for Rapid Object Spinning via Multisensory Learning from Demonstration
Shin, Ku Jin
Jeon, Soo
SENSORS, 2024, 24 (02)
[35] Control barrier function based trajectory generation and tracking control for spacecraft inspection mission under multiple safety constraints
Wang, Kun
Meng, Tao
Wang, Weijia
Lei, Jiakun
ADVANCES IN SPACE RESEARCH, 2024, 73 (03) : 2080 - 2097
[36] Learning Gentle Grasping From Human-Free Force Control Demonstration
Li, Mingxuan
Zhang, Lunwei
Li, Tiemin
Jiang, Yao
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2391 - 2398
[37] Control Barrier Function Based Model Predictive Control to Safety Obstacle-Avoidance of Autonomous Manned Mobile Robots
Gao, Qingji
Feng, Junhu
Zhang, Gaowei
Cao, Wenbo
ADVANCES IN GUIDANCE, NAVIGATION AND CONTROL, VOL 5, 2025, 1341 : 606 - 614
[38] Neural Gaits: Learning Bipedal Locomotion via Control Barrier Functions and Zero Dynamics Policies
Rodriguez, Ivan Dario Jimenez
Csomay-Shanklin, Noel
Yue, Yisong
Ames, Aaron D.
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 168, 2022, 168
[39] ACNMP: Skill Transfer and Task Extrapolation through Learning from Demonstration and Reinforcement Learning via Representation Sharing
Akbulut, M. Tuluhan
Oztop, Erhan
Seker, M. Yunus
Xue, Honghu
Tekden, Ahmet E.
Ugur, Emre
CONFERENCE ON ROBOT LEARNING, VOL 155, 2020, 155 : 1896 - 1907
[40] Safety-Critical Generalized Predictive Control for Speed Regulation of PMSM Drives Based on Dynamic Robust Control Barrier Function
Cao, Zhongkun
Mao, Jianliang
Zhang, Chuanlin
Cui, Chenggang
Yang, Jun
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2025, 72 (02) : 1881 - 1891

← 1 2 3 4 5 →