Safe learning-based gradient-free model predictive control based on cross-entropy method

被引：5

作者：

Zheng, Lei ^{[1
]}

Yang, Rui ^{[2
]}

Wu, Zhixuan ^{[2
]}

Pan, Jiesen ^{[2
]}

Cheng, Hui ^{[2
]}

机构：

[1] Sun Yat Sen Univ, Sch Elect & Informat Technol, Guangzhou 510006, Peoples R China

[2] Sun Yat Sen Univ, Sch Comp Sci & Engn, Guangzhou 510006, Peoples R China

来源：

ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE | 2022年 / 110卷

关键词：

Model predictive control; Learning-based control; Cross-entropy method; Minimal intervention controller; TRAJECTORY GENERATION; SYSTEMS; ROBUST; ROBOTICS; BOUNDS;

D O I：

10.1016/j.engappai.2022.104731

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, a safe and learning-based control framework for model predictive control (MPC) is proposed to optimize nonlinear systems with a non-differentiable objective function under uncertain environmental disturbances. The control framework integrates a learning-based MPC with an auxiliary controller in a way of minimal intervention. The learning-based MPC augments the prior nominal model with incremental Gaussian Processes to learn the uncertain disturbances. The cross-entropy method (CEM) is utilized as the sampling-based optimizer for the MPC with a non-differentiable objective function. A minimal intervention controller is devised with a control Lyapunov function and a control barrier function to guide the sampling process and endow the system with high probabilistic safety. The proposed algorithm shows a safe and adaptive control performance on a simulated quadrotor in the tasks of trajectory tracking and obstacle avoidance under uncertain wind disturbances.

引用

页数：14

共 51 条

[1]

Ames AD, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P3420, DOI [10.23919/ECC.2019.8796030, 10.23919/ecc.2019.8796030]

[2] Control Barrier Function Based Quadratic Programs for Safety Critical Systems [J].

Ames, Aaron D. ;

Xu, Xiangru ;

Grizzle, Jessy W. ;

Tabuada, Paulo .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2017, 62 (08) :3861-3876

[3]

Andersen M., 2013, CVXOPT: A python package for convex optimization, version 1.1.6

[4] CasADi: a software framework for nonlinear optimization and optimal control [J].

Andersson, Joel A. E. ;

Gillis, Joris ;

Horn, Greg ;

Rawlings, James B. ;

Diehl, Moritz .

MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) :1-36

[5]

[Anonymous], 2001, Regularization, Optimization, and Beyond

[6]

[Anonymous], 1997, Primal-dual interior-point methods

[7] Provably safe and robust learning-based model predictive control [J].

Aswani, Anil ;

Gonzalez, Humberto ;

Sastry, S. Shankar ;

Tomlin, Claire .

AUTOMATICA, 2013, 49 (05) :1216-1226

[8]

Berkenkamp F, 2017, ADV NEUR IN, V30

[9]

Berkenkamp F, 2016, IEEE DECIS CONTR P, P4661, DOI 10.1109/CDC.2016.7798979

[10]

Bharadhwaj H., 2020, ARXIV PREPRINT ARXIV

← 1 2 3 4 5 6 →