Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control

被引：9

作者：

Seel, Katrine ^{[1
,2
]}

Kordabad, Arash Bahari ^{[1
]}

Gros, Sebastien ^{[1
]}

Gravdahl, Jan Tommy ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol, Dept Engn Cybernet, N-7032 Trondheim, Norway

[2] SINTEF Digital Math & Cybernet, N-0373 Oslo, Norway

来源：

IEEE OPEN JOURNAL OF CONTROL SYSTEMS | 2022年 / 1卷

关键词：

Costs; Stability analysis; Cost function; Standards; Predictive models; Predictive control; Numerical stability; Dissipativity; economic nonlinear model predictive control; neural networks; reinforcement learning; MPC; STABILITY;

D O I：

10.1109/OJCSYS.2022.3221063

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.

引用

页码：366 / 379

页数：14

共 35 条

[1]

Amos Brandon, 2017, P MACHINE LEARNING R, V70

[2] On Average Performance and Stability of Economic Model Predictive Control [J].