ConBaT: Control Barrier Transformer for Safe Robot Learning from Demonstrations

被引：0

作者：

Meng, Yue ^{[1
]}

Vemprela, Sai ^{[2
]}

Bonatti, Rogerio ^{[2
]}

Fan, Chuchu ^{[1
]}

Kapoor, Ashish ^{[2
]}

机构：

[1] MIT, Dept Aeronaut & Astronaut, 70 Vassar St, Cambridge, MA 02139 USA

[2] Microsoft Res, Autonomous Syst & Robot Grp, 14820 NE 36th St, Redmond, WA 98052 USA

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) | 2024年

关键词：

D O I：

10.1109/ICRA57147.2024.10611109

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised training mainly focuses on imitating previously observed behaviors, and the training demonstrations carry no notion of which behaviors should be explicitly avoided. In this work, we propose Control Barrier Transformer (ConBaT), an approach that learns safe behaviors from demonstrations in a self-supervised fashion. ConBaT is inspired by the concept of control barrier functions in control theory and uses a causal transformer that learns to predict safe robot actions autoregressively using a critic that requires minimal safety data labeling. During deployment, we employ a lightweight online optimization to find actions that ensure future states lie within the learned safe set. We apply our approach to different simulated control tasks and show that our method results in safer control policies compared to other classical and learning-based methods such as imitation learning, reinforcement learning, and model predictive control.

引用

页码：12857 / 12864

页数：8

共 50 条

[21] Safe Robot Trajectory Control Using Probabilistic Movement Primitives and Control Barrier Functions
Davoodi, Mohammadreza
Iqbal, Asif
Cloud, Joseph M.
Beksi, William J.
Gans, Nicholas R.
FRONTIERS IN ROBOTICS AND AI, 2022, 9
[22] Learning from Demonstrations in Human-Robot Collaborative Scenarios: A Survey
Daniel Sosa-Ceron, Arturo
Gustavo Gonzalez-Hernandez, Hugo
Antonio Reyes-Avendano, Jorge
ROBOTICS, 2022, 11 (06)
[23] Learning how to approach industrial robot tasks from natural demonstrations
Michieletto, Stefano
Chessa, Nicola
Menegatti, Emanuele
2013 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2013, : 255 - 260
[24] Learning control lyapunov functions from counterexamples and demonstrations
Hadi Ravanbakhsh
Sriram Sankaranarayanan
Autonomous Robots, 2019, 43 : 275 - 307
[25] Learning control lyapunov functions from counterexamples and demonstrations
Ravanbakhsh, Hadi
Sankaranarayanan, Sriram
AUTONOMOUS ROBOTS, 2019, 43 (02) : 275 - 307
[26] An Automatic Robot Skills Learning System from Robot's Real-World Demonstrations
Li, Boyao
Lu, Tao
Li, Xiaocan
Cai, Yinghao
Wang, Shuo
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5138 - 5142
[27] Safe reinforcement learning: A control barrier function optimization approach
Marvi, Zahra
Kiumarsi, Bahare
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 1923 - 1940
[28] Safe Reinforcement Learning Using Robust Control Barrier Functions
Emam, Yousef
Notomista, Gennaro
Glotfelter, Paul
Kira, Zsolt
Egerstedt, Magnus
IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2886 - 2893
[29] Improved Learning Accuracy for Learning Stable Control from Human Demonstrations
Jin, Shaokun
Wang, Zhiyang
Ou, Yongsheng
Zhou, Yimin
2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2679 - 2685
[30] A Control Barrier Function Approach to Human-multi-robot Safe Interaction
Lippi, Martina
Marino, Alessandro
2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 604 - 609

← 1 2 3 4 5 →