Safe learning-based gradient-free model predictive control based on cross-entropy method

被引:5
作者
Zheng, Lei [1 ]
Yang, Rui [2 ]
Wu, Zhixuan [2 ]
Pan, Jiesen [2 ]
Cheng, Hui [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Model predictive control; Learning-based control; Cross-entropy method; Minimal intervention controller; TRAJECTORY GENERATION; SYSTEMS; ROBUST; ROBOTICS; BOUNDS;
D O I
10.1016/j.engappai.2022.104731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a non-differentiable objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental Gaussian Processes to learn the uncertain disturbances. The cross-entropy method (CEM) is utilized as the sampling-based optimizer for the MPC with a non-differentiable objective function. A minimal intervention controller is devised with a control Lyapunov function and a control barrier function to guide the sampling process and endow the system with high probabilistic safety. The proposed algorithm shows a safe and adaptive control performance on a simulated quadrotor in the tasks of trajectory tracking and obstacle avoidance under uncertain wind disturbances.
引用
收藏
页数:14
相关论文
共 51 条
[1]  
Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
[2]   Control Barrier Function Based Quadratic Programs for Safety Critical Systems [J].
Ames, Aaron D. ;
Xu, Xiangru ;
Grizzle, Jessy W. ;
Tabuada, Paulo .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) :3861-3876
[3]  
Andersen M., 2013, CVXOPT: A python package for convex optimization, version 1.1.6
[4]   CasADi: a software framework for nonlinear optimization and optimal control [J].
Andersson, Joel A. E. ;
Gillis, Joris ;
Horn, Greg ;
Rawlings, James B. ;
Diehl, Moritz .
MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) :1-36
[5]  
[Anonymous], 2001, Regularization, Optimization, and Beyond
[6]  
[Anonymous], 1997, Primal-dual interior-point methods
[7]   Provably safe and robust learning-based model predictive control [J].
Aswani, Anil ;
Gonzalez, Humberto ;
Sastry, S. Shankar ;
Tomlin, Claire .
AUTOMATICA, 2013, 49 (05) :1216-1226
[8]  
Berkenkamp F, 2017, ADV NEUR IN, V30
[9]  
Berkenkamp F, 2016, IEEE DECIS CONTR P, P4661, DOI 10.1109/CDC.2016.7798979
[10]  
Bharadhwaj H., 2020, ARXIV PREPRINT ARXIV