Convex Neural Network-Based Cost Modifications for Learning Model Predictive Control

被引:9
作者
Seel, Katrine [1 ,2 ]
Kordabad, Arash Bahari [1 ]
Gros, Sebastien [1 ]
Gravdahl, Jan Tommy [1 ]
机构
[1] Norwegian Univ Sci & Technol, Dept Engn Cybernet, N-7032 Trondheim, Norway
[2] SINTEF Digital Math & Cybernet, N-0373 Oslo, Norway
来源
IEEE OPEN JOURNAL OF CONTROL SYSTEMS | 2022年 / 1卷
关键词
Costs; Stability analysis; Cost function; Standards; Predictive models; Predictive control; Numerical stability; Dissipativity; economic nonlinear model predictive control; neural networks; reinforcement learning; MPC; STABILITY;
D O I
10.1109/OJCSYS.2022.3221063
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Developing model predictive control (MPC) schemes can be challenging for systems where an accurate model is not available, or too costly to develop. With the increasing availability of data and tools to treat them, learning-based MPC has of late attracted wide attention. It has recently been shown that adapting not only the MPC model, but also its cost function is conducive to achieving optimal closed-loop performance when an accurate model cannot be provided. In the learning context, this modification can be performed via parametrizing the MPC cost and adjusting the parameters via, e.g., reinforcement learning (RL). In this framework, simple cost parametrizations can be effective, but the underlying theory suggests that rich parametrizations in principle can be useful. In this paper, we propose such a cost parametrization using a class of neural networks (NNs) that preserves convexity. This choice avoids creating difficulties when solving the MPC problem via sensitivity-based solvers. In addition, this choice of cost parametrization ensures nominal stability of the resulting MPC scheme. Moreover, we detail how this choice can be applied to economic MPC problems where the cost function is generic and therefore does not necessarily fulfill any specific property.
引用
收藏
页码:366 / 379
页数:14
相关论文
共 35 条
[1]  
Amos Brandon, 2017, P MACHINE LEARNING R, V70
[2]   On Average Performance and Stability of Economic Model Predictive Control [J].
Angeli, David ;
Amrit, Rishi ;
Rawlings, James B. .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2012, 57 (07) :1615-1626
[3]  
Bertsekas D. P., 2017, Dynamic programming and optimal control, VI
[4]  
Bnning F., 2021, P 3 C LEARNING DYNAM, P251
[5]  
Boyd S., 2004, Convex optimization
[6]   MPC-based Reinforcement Learning for a Simplified Freight Mission of Autonomous Surface Vehicles [J].
Cai, Wenqi ;
Kordabad, Arash B. ;
Esfahani, Hossein N. ;
Lekkas, Anastasios M. ;
Gros, Sebastien .
2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, :2990-2995
[7]   Log-Sum-Exp Neural Networks and Posynomial Models for Convex and Log-Log-Convex Data [J].
Calafiore, Giuseppe C. ;
Gaubert, Stephane ;
Possieri, Corrado .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (03) :827-838
[8]   Adaptation strategies for real-time optimization [J].
Chachuat, B. ;
Srinivasan, B. ;
Bonvin, D. .
COMPUTERS & CHEMICAL ENGINEERING, 2009, 33 (10) :1557-1567
[9]  
Chen Y., 2018, P 31 INT C MACH LEAR
[10]  
Chollet F., 2015, Keras