Robust Learning Model-Predictive Control for Linear Systems Performing Iterative Tasks

被引:21
作者
Rosolia, Ugo [1 ,2 ]
Zhang, Xiaojing [1 ]
Borrelli, Francesco [1 ]
机构
[1] Univ Calif Berkeley, Dept Mech Engn, Berkeley, CA 94701 USA
[2] CALTECH, Pasadena, CA 91125 USA
关键词
Iterative learning control; predictive control; robust control; MPC; SAFE;
D O I
10.1109/TAC.2021.3083559
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, a robust learning model-predictive controller (LMPC) for uncertain systems performing iterative tasks is presented. At each iteration of the control task, the closed-loop state, input, and cost are stored and used in the controller design. This article first illustrates how to construct robust control invariant sets and safe control policies exploiting historical data. Then, we propose an iterative LMPC design procedure, where data generated by a robust controller at iteration j are used to design a robust LMPC at the next iteration j + 1. We show that this procedure allows us to iteratively enlarge the domain of the control policy, and it guarantees recursive constraints satisfaction, input-to-state stability, and performance bounds for the certainty equivalent closed-loop system. The use of different feedback policies along the horizon is the key element of the proposed design. The effectiveness of the proposed control scheme is illustrated on a linear system subject to bounded additive disturbances.
引用
收藏
页码:856 / 869
页数:14
相关论文
共 56 条
[1]  
[Anonymous], 2001, P 40 IEEE C DECISION
[2]  
[Anonymous], 2005, IFAC P VOLUMES
[3]   Provably safe and robust learning-based model predictive control [J].
Aswani, Anil ;
Gonzalez, Humberto ;
Sastry, S. Shankar ;
Tomlin, Claire .
AUTOMATICA, 2013, 49 (05) :1216-1226
[4]   General interpolation in MPC and its advantages [J].
Bacic, M ;
Cannon, M ;
Lee, YI ;
Kouvaritakis, B .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (06) :1092-1096
[5]  
Bemporad A, 1999, LECT NOTES CONTR INF, V245, P207
[6]   Min-max control of constrained uncertain discrete-time linear systems [J].
Bemporad, A ;
Borrelli, F ;
Morari, M .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2003, 48 (09) :1600-1606
[7]  
Berkenkamp F, 2016, IEEE INT CONF ROBOT, P491, DOI 10.1109/ICRA.2016.7487170
[8]  
Bertsekas D. P., 1996, NEURO DYNAMIC PROGRA, V5
[9]   Feature-Based Aggregation and Deep Reinforcement Learning: A Survey and Some New Implementations [J].
Bertsekas, Dimitri P. .
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (01) :1-31
[10]  
Bradtke SJ, 1996, MACH LEARN, V22, P33, DOI 10.1007/BF00114723