ConBaT: Control Barrier Transformer for Safe Robot Learning from Demonstrations

被引:0
|
作者
Meng, Yue [1 ]
Vemprela, Sai [2 ]
Bonatti, Rogerio [2 ]
Fan, Chuchu [1 ]
Kapoor, Ashish [2 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, 70 Vassar St, Cambridge, MA 02139 USA
[2] Microsoft Res, Autonomous Syst & Robot Grp, 14820 NE 36th St, Redmond, WA 98052 USA
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) | 2024年
关键词
D O I
10.1109/ICRA57147.2024.10611109
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised training mainly focuses on imitating previously observed behaviors, and the training demonstrations carry no notion of which behaviors should be explicitly avoided. In this work, we propose Control Barrier Transformer (ConBaT), an approach that learns safe behaviors from demonstrations in a self-supervised fashion. ConBaT is inspired by the concept of control barrier functions in control theory and uses a causal transformer that learns to predict safe robot actions autoregressively using a critic that requires minimal safety data labeling. During deployment, we employ a lightweight online optimization to find actions that ensure future states lie within the learned safe set. We apply our approach to different simulated control tasks and show that our method results in safer control policies compared to other classical and learning-based methods such as imitation learning, reinforcement learning, and model predictive control.
引用
收藏
页码:12857 / 12864
页数:8
相关论文
共 50 条
  • [1] Learning Robust Output Control Barrier Functions From Safe Expert Demonstrations
    Lindemann, Lars
    Robey, Alexander
    Jiang, Lejun
    Das, Satyajeet
    Tu, Stephen
    Matni, Nikolai
    IEEE OPEN JOURNAL OF CONTROL SYSTEMS, 2024, 3 : 158 - 172
  • [2] Redundant robot control with learning from expert demonstrations
    Ramirez, Jorge
    Yu, Wen
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 715 - 720
  • [3] Learning Control Barrier Functions from Expert Demonstrations
    Robey, Alexander
    Hu, Haimin
    Lindemann, Lars
    Zhang, Hanwen
    Dimarogonas, Dimos, V
    Tu, Stephen
    Matni, Nikolai
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3717 - 3724
  • [4] BarrierNet: Differentiable Control Barrier Functions for Learning of Safe Robot Control
    Xiao, Wei
    Wang, Tsun-Hsuan
    Hasani, Ramin
    Chahine, Makram
    Amini, Alexander
    Li, Xiao
    Rus, Daniela
    IEEE TRANSACTIONS ON ROBOTICS, 2023, 39 (03) : 2289 - 2307
  • [5] Reinforcement Learning for Safe Robot Control using Control Lyapunov Barrier Functions
    Du, Desong
    Han, Shaohang
    Qi, Naiming
    Ammar, Haitham Bou
    Wang, Jun
    Pan, Wei
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 9442 - 9448
  • [6] Robot Learning to Paint from Demonstrations
    Park, Younghyo
    Jeon, Seunghun
    Lee, Taeyoon
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 3053 - 3060
  • [7] Robot Learning from Failed Demonstrations
    Grollman, Daniel H.
    Billard, Aude G.
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2012, 4 (04) : 331 - 342
  • [8] Robot Learning from Failed Demonstrations
    Daniel H. Grollman
    Aude G. Billard
    International Journal of Social Robotics, 2012, 4 : 331 - 342
  • [9] Reinforcement learning from expert demonstrations with application to redundant robot control
    Ramirez, Jorge
    Yu, Wen
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 119
  • [10] Biologically Inspired Motion Modeling and Neural Control for Robot Learning From Demonstrations
    Yang, Chenguang
    Chen, Chuize
    Wang, Ning
    Ju, Zhaojie
    Fu, Jian
    Wang, Min
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2019, 11 (02) : 281 - 291