Control Barriers in Bayesian Learning of System Dynamics

被引:26
作者
Dhiman, Vikas [1 ]
Khojasteh, Mohammad Javad [2 ]
Franceschetti, Massimo [3 ]
Atanasov, Nikolay [3 ]
机构
[1] Univ Maine, Dept Elect & Comp Engn, Bangor, ME 04469 USA
[2] MIT, Lab Informat & Decis Syst, Cambridge, MA 02139 USA
[3] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA
基金
美国国家科学基金会;
关键词
Control barrier function (CBF); Gaussian process; high relative-degree system safety; learning for dynamics and control; self-triggered safe control; QUADRATIC PROGRAMS; FRAMEWORK;
D O I
10.1109/TAC.2021.3137059
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article focuses on learning a model of system dynamics online, while satisfying safety constraints. Our objective is to avoid offline system identification or hand-specified models and allow a system to safely and autonomously estimate and adapt its own model during operation. Given streaming observations of the system state, we use Bayesian learning to obtain a distribution over the system dynamics. Specifically, we propose a new matrix variate Gaussian process (MVGP) regression approach with an efficient covariance factorization to learn the drift and input gain terms of a nonlinear control-affine system. The MVGP distribution is then used to optimize the system behavior and ensure safety with high probability, by specifying control Lyapunov function (CLF) and control barrier function (CBF) chance constraints. We show that a safe control policy can be synthesized for systems with arbitrary relative degree and probabilistic CLF-CBF constraints by solving a second-order cone program. Finally, we extend our design to a self-triggering formulation, adaptively determining the time at which a new control input needs to be applied in order to guarantee safety.
引用
收藏
页码:214 / 229
页数:16
相关论文
共 83 条
[1]  
Adler R. J., 1981, The Geometry of Random Fields
[2]  
Akella P, 2020, IEEE DECIS CONTR P, P790, DOI 10.1109/CDC42340.2020.9303776
[3]  
Alshiekh M, 2018, AAAI CONF ARTIF INTE, P2669
[4]   Kernels for Vector-Valued Functions: A Review [J].
Alvarez, Mauricio A. ;
Rosasco, Lorenzo ;
Lawrence, Neil D. .
FOUNDATIONS AND TRENDS IN MACHINE LEARNING, 2012, 4 (03) :195-266
[5]  
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ecc.2019.8796030, 10.23919/ECC.2019.8796030]
[6]   Control Barrier Function Based Quadratic Programs for Safety Critical Systems [J].
Ames, Aaron D. ;
Xu, Xiangru ;
Grizzle, Jessy W. ;
Tabuada, Paulo .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) :3861-3876
[7]  
[Anonymous], 2020, GUROBI OPTIMIZER REF
[8]   Learning for Autonomous Navigation Advances in Machine Learning for Rough Terrain Mobility [J].
Bagnell, James Andrew ;
Bradley, David ;
Silver, David ;
Sofman, Boris ;
Stentz, Anthony .
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2010, 17 (02) :74-84
[9]  
Barbosa FS, 2020, IEEE DECIS CONTR P, P2056, DOI 10.1109/CDC42340.2020.9304160
[10]  
Bastani O, 2021, P AMER CONTR CONF, P3488, DOI 10.23919/ACC50511.2021.9483182