Safe and Efficient Reinforcement Learning Using Disturbance-Observer-Based Control Barrier Functions

被引:0
|
作者
Cheng, Yikun [1 ]
Zhao, Pan [1 ]
Hovakimyan, Naira [1 ]
机构
[1] Univ Illinois, Urbana, IL 61801 USA
来源
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211 | 2023年 / 211卷
关键词
Reinforcement learning; robot safety; robust control; uncertainty estimation;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Safe reinforcement learning (RL) with assured satisfaction of hard state constraints during training has recently received a lot of attention. Safety filters, e.g., based on control barrier functions (CBFs), provide a promising way for safe RL via modifying the unsafe actions of an RL agent on the fly. Existing safety filter-based approaches typically involve learning of uncertain dynamics and quantifying the learned model error, which leads to conservative filters before a large amount of data is collected to learn a good model, thereby preventing efficient exploration. This paper presents a method for safe and efficient RL using disturbance observers (DOBs) and control barrier functions (CBFs). Unlike most existing safe RL methods that deal with hard state constraints, our method does not involve model learning, and leverages DOBs to accurately estimate the pointwise value of the uncertainty, which is then incorporated into a robust CBF condition to generate safe actions. The DOB-based CBF can be used as a safety filter with model-free RL algorithms by minimally modifying the actions of an RL agent whenever necessary to ensure safety throughout the learning process. Simulation results on a unicycle and a 2D quadrotor demonstrate that the proposed method outperforms a state-of-the-art safe RL algorithm using CBFs and Gaussian processes-based model learning, in terms of safety violation rate, and sample and computational efficiency.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Safe Reinforcement Learning Using Robust Control Barrier Functions
    Emam, Yousef
    Notomista, Gennaro
    Glotfelter, Paul
    Kira, Zsolt
    Egerstedt, Magnus
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2886 - 2893
  • [2] Safe exploration in model-based reinforcement learning using control barrier functions
    Cohen, Max H.
    Belta, Calin
    AUTOMATICA, 2023, 147
  • [3] Safe Off-policy Reinforcement Learning Using Barrier Functions
    Marvi, Zahra
    Kiumarsi, Bahare
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 2176 - 2181
  • [4] Disturbance-observer-based robust synchronization control of uncertain chaotic systems
    Mou Chen
    Qingxian Wu
    Changsheng Jiang
    Nonlinear Dynamics, 2012, 70 : 2421 - 2432
  • [5] Disturbance-Observer-based Robust Control for Time Delay Uncertain Systems
    Chen, Mou
    Chen, Wen-Hua
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2010, 8 (02) : 445 - 453
  • [6] Safe and Robust Observer-Controller Synthesis Using Control Barrier Functions
    Agrawal, Devansh R.
    Panagou, Dimitra
    IEEE CONTROL SYSTEMS LETTERS, 2022, 7 : 127 - 132
  • [7] Disturbance-observer-based robust synchronization control of uncertain chaotic systems
    Chen, Mou
    Wu, Qingxian
    Jiang, Changsheng
    NONLINEAR DYNAMICS, 2012, 70 (04) : 2421 - 2432
  • [8] Disturbance-observer-based robust control for time delay uncertain systems
    Mou Chen
    Wen-Hua Chen
    International Journal of Control, Automation and Systems, 2010, 8 : 445 - 453
  • [9] Disturbance-Observer-Based Model Predictive Control of Underwater Vehicle Manipulator Systems
    Oliveira, Everton L.
    Orsino, Renato M. M.
    Donha, Decio C.
    IFAC PAPERSONLINE, 2021, 54 (16): : 348 - 355
  • [10] Sample-Efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions
    Luo, Wenhao
    Sun, Wen
    Kapoor, Ashish
    ALGORITHMIC FOUNDATIONS OF ROBOTICS XV, 2023, 25 : 419 - 435