Safe learning-based gradient-free model predictive control based on cross-entropy method

被引：3

作者：

Zheng, Lei ^{[1
]}

Yang, Rui ^{[2
]}

Wu, Zhixuan ^{[2
]}

Pan, Jiesen ^{[2
]}

Cheng, Hui ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China

[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2022年 / 110卷

关键词：

Model predictive control; Learning-based control; Cross-entropy method; Minimal intervention controller; TRAJECTORY GENERATION; SYSTEMS; ROBUST; ROBOTICS; BOUNDS;

D O I：

10.1016/j.engappai.2022.104731

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a non-differentiable objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental Gaussian Processes to learn the uncertain disturbances. The cross-entropy method (CEM) is utilized as the sampling-based optimizer for the MPC with a non-differentiable objective function. A minimal intervention controller is devised with a control Lyapunov function and a control barrier function to guide the sampling process and endow the system with high probabilistic safety. The proposed algorithm shows a safe and adaptive control performance on a simulated quadrotor in the tasks of trajectory tracking and obstacle avoidance under uncertain wind disturbances.

引用

页数：14

共 51 条

[11] Chua K, 2018, 32 C NEURAL INFORM P
[12] Reactive Trajectory Generation for Multiple Vehicles in Unknown Environments With Wind Disturbances
Cole, Kenan
Wickenheiser, Adam M.
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2018, 34 (05) : 1333 - 1348
[13] A tutorial on the cross-entropy method
De Boer, PT
Kroese, DP
Mannor, S
Rubinstein, RY
[J]. ANNALS OF OPERATIONS RESEARCH, 2005, 134 (01) : 19 - 67
[14] Computing Large Convex Regions of Obstacle-Free Space Through Semidefinite Programming
Deits, Robin
Tedrake, Russ
[J]. ALGORITHMIC FOUNDATIONS OF ROBOTICS XI, 2015, 107 : 109 - 124
[15] Desaraju Vishnu R., 2016, Robot Learning and Planning (RLP 2016), P29
[16] Finn Chelsea, 2017, 2017 IEEE International Conference on Robotics and Automation (ICRA), P2786, DOI 10.1109/ICRA.2017.7989324
[17] Adaptive model predictive control for a class of constrained linear systems based on the comparison model
Fukushima, Hiroaki
Kim, Tae-Hyoung
Sugie, Toshiharu
[J]. AUTOMATICA, 2007, 43 (02) : 301 - 308
[18] Grizzle J. W., 2002, NONLINEAR SYSTEMS, V3
[19] Real-Time Trajectory Generation for Quadrocopters
Hehn, Markus
D'Andrea, Raffaello
[J]. IEEE TRANSACTIONS ON ROBOTICS, 2015, 31 (04) : 877 - 892
[20] Learning-Based Model Predictive Control: Toward Safe Learning in Control
Hewing, Lukas
Wabersich, Kim P.
Menner, Marcel
Zeilinger, Melanie N.
[J]. ANNUAL REVIEW OF CONTROL, ROBOTICS, AND AUTONOMOUS SYSTEMS, VOL 3, 2020, 2020, 3 : 269 - 296

← 1 2 3 4 5 6 →