Differentiable predictive control: Deep learning alternative to explicit model predictive control for unknown nonlinear systems

被引:29
作者
Drgona, Jan [1 ]
Kis, Karol [2 ]
Tuor, Aaron [1 ]
Vrabie, Draguna [1 ]
Klauco, Martin [2 ]
机构
[1] Pacific Northwest Natl Lab, Richland, WA 99354 USA
[2] Slovak Univ Technol Bratislava, Bratislava, Slovakia
关键词
Differentiable predictive control; Model predictive control; Neural state space models; Data-driven differentiable optimization; Deep learning; COMPLEXITY REDUCTION; STABILITY; MPC;
D O I
10.1016/j.jprocont.2022.06.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We present differentiable predictive control (DPC) as a deep learning-based alternative to the explicit model predictive control (MPC) for unknown nonlinear systems. In the DPC framework, a neural state space model is learned from time-series measurements of the system dynamics. The neural control policy is then optimized via stochastic gradient descent approach by differentiating the MPC loss function through the closed-loop system dynamics model. The proposed DPC method learns model based control policies with state and input constraints, while supporting time-varying references and constraints. In embedded implementation using a Raspberry-Pi platform, we experimentally demonstrate that it is possible to train constrained control policies purely based on the measurements of the unknown nonlinear system. We compare the control performance of the DPC method against explicit MPC and report efficiency gains in online computational demands, memory requirements, policy complexity, and construction time. In particular, we show that our method scales linearly compared to exponential scalability of the explicit MPC solved via multiparametric programming. (C) 2022 Battelle Memorial Institute and The Author(s). Published by Elsevier Ltd.
引用
收藏
页码:80 / 92
页数:13
相关论文
共 96 条
[1]  
Amos B., 2018, CORR ABS181013400 AR
[2]  
Amos B., 2016, CORR ABS160907152 AR
[3]   CasADi: a software framework for nonlinear optimization and optimal control [J].
Andersson, Joel A. E. ;
Gillis, Joris ;
Horn, Greg ;
Rawlings, James B. ;
Diehl, Moritz .
MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (01) :1-36
[4]  
Belbute-Peres FD, 2018, ADV NEUR IN, V31
[5]   The explicit linear quadratic regulator for constrained systems [J].
Bemporad, A ;
Morari, M ;
Dua, V ;
Pistikopoulos, EN .
AUTOMATICA, 2002, 38 (01) :3-20
[6]  
Bemporad A, 1999, LECT NOTES CONTR INF, V245, P207
[7]  
Bieker K., 2019, CORR ABS190510094 AR
[8]  
Bodla N., IEEE I CONF COMP VIS, p5561 5569
[9]  
Borrelli F., 2017, P 40 IEEE C DECISION
[10]  
Broad A., 2018, CORR ABS180801184 AR