Safe learning-based gradient-free model predictive control based on cross-entropy method

被引:3
作者
Zheng, Lei [1 ]
Yang, Rui [2 ]
Wu, Zhixuan [2 ]
Pan, Jiesen [2 ]
Cheng, Hui [2 ]
机构
[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China
[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China
关键词
Model predictive control; Learning-based control; Cross-entropy method; Minimal intervention controller; TRAJECTORY GENERATION; SYSTEMS; ROBUST; ROBOTICS; BOUNDS;
D O I
10.1016/j.engappai.2022.104731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a non-differentiable objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental Gaussian Processes to learn the uncertain disturbances. The cross-entropy method (CEM) is utilized as the sampling-based optimizer for the MPC with a non-differentiable objective function. A minimal intervention controller is devised with a control Lyapunov function and a control barrier function to guide the sampling process and endow the system with high probabilistic safety. The proposed algorithm shows a safe and adaptive control performance on a simulated quadrotor in the tasks of trajectory tracking and obstacle avoidance under uncertain wind disturbances.
引用
收藏
页数:14
相关论文
共 51 条
  • [11] Chua K, 2018, 32 C NEURAL INFORM P
  • [12] Reactive Trajectory Generation for Multiple Vehicles in Unknown Environments With Wind Disturbances
    Cole, Kenan
    Wickenheiser, Adam M.
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (05) : 1333 - 1348
  • [13] A tutorial on the cross-entropy method
    De Boer, PT
    Kroese, DP
    Mannor, S
    Rubinstein, RY
    [J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 19 - 67
  • [14] Computing Large Convex Regions of Obstacle-Free Space Through Semidefinite Programming
    Deits, Robin
    Tedrake, Russ
    [J]. ALGORITHMIC FOUNDATIONS OF ROBOTICS XI, 2015, 107 : 109 - 124
  • [15] Desaraju Vishnu R., 2016, Robot Learning and Planning (RLP 2016), P29
  • [16] Finn Chelsea, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P2786, DOI 10.1109/ICRA.2017.7989324
  • [17] Adaptive model predictive control for a class of constrained linear systems based on the comparison model
    Fukushima, Hiroaki
    Kim, Tae-Hyoung
    Sugie, Toshiharu
    [J]. AUTOMATICA, 2007, 43 (02) : 301 - 308
  • [18] Grizzle J. W., 2002, NONLINEAR SYSTEMS, V3
  • [19] Real-Time Trajectory Generation for Quadrocopters
    Hehn, Markus
    D'Andrea, Raffaello
    [J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (04) : 877 - 892
  • [20] Learning-Based Model Predictive Control: Toward Safe Learning in Control
    Hewing, Lukas
    Wabersich, Kim P.
    Menner, Marcel
    Zeilinger, Melanie N.
    [J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296