ConBaT: Control Barrier Transformer for Safe Robot Learning from Demonstrations

被引：0

作者：

Meng, Yue ^{[1
]}

Vemprela, Sai ^{[2
]}

Bonatti, Rogerio ^{[2
]}

Fan, Chuchu ^{[1
]}

Kapoor, Ashish ^{[2
]}

机构：

[1] MIT, Dept Aeronaut & Astronaut, 70 Vassar St, Cambridge, MA 02139 USA

[2] Microsoft Res, Autonomous Syst & Robot Grp, 14820 NE 36th St, Redmond, WA 98052 USA

来源：

2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) | 2024年

关键词：

D O I：

10.1109/ICRA57147.2024.10611109

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised training mainly focuses on imitating previously observed behaviors, and the training demonstrations carry no notion of which behaviors should be explicitly avoided. In this work, we propose Control Barrier Transformer (ConBaT), an approach that learns safe behaviors from demonstrations in a self-supervised fashion. ConBaT is inspired by the concept of control barrier functions in control theory and uses a causal transformer that learns to predict safe robot actions autoregressively using a critic that requires minimal safety data labeling. During deployment, we employ a lightweight online optimization to find actions that ensure future states lie within the learned safe set. We apply our approach to different simulated control tasks and show that our method results in safer control policies compared to other classical and learning-based methods such as imitation learning, reinforcement learning, and model predictive control.

引用

页码：12857 / 12864

页数：8

共 50 条

[1] Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
Lindemann, Lars
Robey, Alexander
Jiang, Lejun
Das, Satyajeet
Tu, Stephen
Matni, Nikolai
IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2024, 3 : 158 - 172
[2] Redundant robot control with learning from expert demonstrations
Ramirez, Jorge
Yu, Wen
2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720
[3] Learning Control Barrier Functions from Expert Demonstrations
Robey, Alexander
Hu, Haimin
Lindemann, Lars
Zhang, Hanwen
Dimarogonas, Dimos, V
Tu, Stephen
Matni, Nikolai
2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3717 - 3724
[4] BarrierNet: Differentiable Control Barrier Functions for Learning of Safe Robot Control
Xiao, Wei
Wang, Tsun-Hsuan
Hasani, Ramin
Chahine, Makram
Amini, Alexander
Li, Xiao
Rus, Daniela
IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (03) : 2289 - 2307
[5] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
Du, Desong
Han, Shaohang
Qi, Naiming
Ammar, Haitham Bou
Wang, Jun
Pan, Wei
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
[6] Robot Learning to Paint from Demonstrations
Park, Younghyo
Jeon, Seunghun
Lee, Taeyoon
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3053 - 3060
[7] Robot Learning from Failed Demonstrations
Grollman, Daniel H.
Billard, Aude G.
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2012, 4 (04) : 331 - 342
[8] Robot Learning from Failed Demonstrations
Daniel H. Grollman
Aude G. Billard
International Journal of Social Robotics, 2012, 4 : 331 - 342
[9] Reinforcement learning from expert demonstrations with application to redundant robot control
Ramirez, Jorge
Yu, Wen
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
[10] Biologically Inspired Motion Modeling and Neural Control for Robot Learning From Demonstrations
Yang, Chenguang
Chen, Chuize
Wang, Ning
Ju, Zhaojie
Fu, Jian
Wang, Min
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2019, 11 (02) : 281 - 291

← 1 2 3 4 5 →