Safe learning-based gradient-free model predictive control based on cross-entropy method

被引:3
作者
Zheng, Lei [1 ]
Yang, Rui [2 ]
Wu, Zhixuan [2 ]
Pan, Jiesen [2 ]
Cheng, Hui [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Model predictive control; Learning-based control; Cross-entropy method; Minimal intervention controller; TRAJECTORY GENERATION; SYSTEMS; ROBUST; ROBOTICS; BOUNDS;
D O I
10.1016/j.engappai.2022.104731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a non-differentiable objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental Gaussian Processes to learn the uncertain disturbances. The cross-entropy method (CEM) is utilized as the sampling-based optimizer for the MPC with a non-differentiable objective function. A minimal intervention controller is devised with a control Lyapunov function and a control barrier function to guide the sampling process and endow the system with high probabilistic safety. The proposed algorithm shows a safe and adaptive control performance on a simulated quadrotor in the tasks of trajectory tracking and obstacle avoidance under uncertain wind disturbances.
引用
收藏
页数:14
相关论文
共 51 条
  • [1] Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]
  • [2] Control Barrier Function Based Quadratic Programs for Safety Critical Systems
    Ames, Aaron D.
    Xu, Xiangru
    Grizzle, Jessy W.
    Tabuada, Paulo
    [J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) : 3861 - 3876
  • [3] Andersen MS, 2013, CVXOPT PYTHON PACKAG
  • [4] CasADi: a software framework for nonlinear optimization and optimal control
    Andersson, Joel A. E.
    Gillis, Joris
    Horn, Greg
    Rawlings, James B.
    Diehl, Moritz
    [J]. MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) : 1 - 36
  • [5] Provably safe and robust learning-based model predictive control
    Aswani, Anil
    Gonzalez, Humberto
    Sastry, S. Shankar
    Tomlin, Claire
    [J]. AUTOMATICA, 2013, 49 (05) : 1216 - 1226
  • [6] Berkenkamp F, 2017, ADV NEUR IN, V30
  • [7] Berkenkamp F, 2016, IEEE DECIS CONTR P, P4661, DOI 10.1109/CDC.2016.7798979
  • [8] Bharadhwaj H., 2020, ARXIV PREPRINT ARXIV
  • [9] A Globally Stabilizing Path Following Controller for Rotorcraft With Wind Disturbance Rejection
    Cabecinhas, David
    Cunha, Rita
    Silvestre, Carlos
    [J]. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (02) : 708 - 714
  • [10] Gaussian Process Model Predictive Control of an Unmanned Quadrotor
    Cao, Gang
    Lai, Edmund M. -K.
    Alam, Fakhrul
    [J]. JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 88 (01) : 147 - 162