ConBaT: Control Barrier Transformer for Safe Robot Learning from Demonstrations

被引:0
|
作者
Meng, Yue [1 ]
Vemprela, Sai [2 ]
Bonatti, Rogerio [2 ]
Fan, Chuchu [1 ]
Kapoor, Ashish [2 ]
机构
[1] MIT, Dept Aeronaut & Astronaut, 70 Vassar St, Cambridge, MA 02139 USA
[2] Microsoft Res, Autonomous Syst & Robot Grp, 14820 NE 36th St, Redmond, WA 98052 USA
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024) | 2024年
关键词
D O I
10.1109/ICRA57147.2024.10611109
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety requirements: besides executing correct actions, an autonomous agent must also avoid the high cost and potentially fatal critical mistakes. Traditionally, self-supervised training mainly focuses on imitating previously observed behaviors, and the training demonstrations carry no notion of which behaviors should be explicitly avoided. In this work, we propose Control Barrier Transformer (ConBaT), an approach that learns safe behaviors from demonstrations in a self-supervised fashion. ConBaT is inspired by the concept of control barrier functions in control theory and uses a causal transformer that learns to predict safe robot actions autoregressively using a critic that requires minimal safety data labeling. During deployment, we employ a lightweight online optimization to find actions that ensure future states lie within the learned safe set. We apply our approach to different simulated control tasks and show that our method results in safer control policies compared to other classical and learning-based methods such as imitation learning, reinforcement learning, and model predictive control.
引用
收藏
页码:12857 / 12864
页数:8
相关论文
共 50 条
  • [21] Safe Robot Trajectory Control Using Probabilistic Movement Primitives and Control Barrier Functions
    Davoodi, Mohammadreza
    Iqbal, Asif
    Cloud, Joseph M.
    Beksi, William J.
    Gans, Nicholas R.
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [22] Learning from Demonstrations in Human-Robot Collaborative Scenarios: A Survey
    Daniel Sosa-Ceron, Arturo
    Gustavo Gonzalez-Hernandez, Hugo
    Antonio Reyes-Avendano, Jorge
    ROBOTICS, 2022, 11 (06)
  • [23] Learning how to approach industrial robot tasks from natural demonstrations
    Michieletto, Stefano
    Chessa, Nicola
    Menegatti, Emanuele
    2013 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2013, : 255 - 260
  • [24] Learning control lyapunov functions from counterexamples and demonstrations
    Hadi Ravanbakhsh
    Sriram Sankaranarayanan
    Autonomous Robots, 2019, 43 : 275 - 307
  • [25] Learning control lyapunov functions from counterexamples and demonstrations
    Ravanbakhsh, Hadi
    Sankaranarayanan, Sriram
    AUTONOMOUS ROBOTS, 2019, 43 (02) : 275 - 307
  • [26] An Automatic Robot Skills Learning System from Robot's Real-World Demonstrations
    Li, Boyao
    Lu, Tao
    Li, Xiaocan
    Cai, Yinghao
    Wang, Shuo
    PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 5138 - 5142
  • [27] Safe reinforcement learning: A control barrier function optimization approach
    Marvi, Zahra
    Kiumarsi, Bahare
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2021, 31 (06) : 1923 - 1940
  • [28] Safe Reinforcement Learning Using Robust Control Barrier Functions
    Emam, Yousef
    Notomista, Gennaro
    Glotfelter, Paul
    Kira, Zsolt
    Egerstedt, Magnus
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2886 - 2893
  • [29] Improved Learning Accuracy for Learning Stable Control from Human Demonstrations
    Jin, Shaokun
    Wang, Zhiyang
    Ou, Yongsheng
    Zhou, Yimin
    2019 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2019, : 2679 - 2685
  • [30] A Control Barrier Function Approach to Human-multi-robot Safe Interaction
    Lippi, Martina
    Marino, Alessandro
    2021 29TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2021, : 604 - 609