Data-Driven MPC for Nonlinear Systems with Reinforcement Learning

被引：0

作者：

Li, Yiran ^{[1
]}

Wang, Qian ^{[1
]}

Sun, Zhongqi ^{[1
,2
]}

Xia, Yuanqing ^{[1
]}

机构：

[1] Beijing Inst Technol, Sch Automat, Beijing 100081, Peoples R China

[2] Beijing Inst Technol, Yangtze Delta Reg Acad, Jiaxing 314019, Peoples R China

来源：

2022 41ST CHINESE CONTROL CONFERENCE (CCC) | 2022年

关键词：

Model predictive control (MPC); reinforcement learning (RL); data-driven method; nonlinear systems;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Inspired by Willems and the co-authors' idea that continuously excited system trajectories can be used to represent the input-output behavior of discrete-time linear time-invariant (DT LTI) systems. We extend this idea to nonlinear systems. In this paper, we propose a data-driven model predictive control (MPC) scheme with reinforcement learning (RL) for unknown nonlinear systems. We utilize the input-output data of the system to form Hankel matrices to represent the system model implicitly. The accuracy of the prediction is improved by updating the data online. Another core idea of this scheme is to combine the standard MPC with RL to approximate the terminal cost function by TD-learning to ensure the closed-loop stability of the system. Simulation experiments on the cart-damper-spring system were used to demonstrate the feasibility of the proposed algorithm.

引用

页码：2404 / 2409

页数：6

共 20 条

[1] Alsalti M., 2021, P IEEE C DEC CONTR
[2] Berberich J., 2021, ARXIV210508567
[3] Data-Driven Model Predictive Control With Stability and Robustness Guarantees
Berberich, Julian
Koehler, Johannes
Mueller, Matthias A.
Allgoewer, Frank
[J]. IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2021, 66 (04) : 1702 - 1717
[4] A trajectory-based framework for data-driven system analysis and control
Berberich, Julian
Allgower, Frank
[J]. 2020 EUROPEAN CONTROL CONFERENCE (ECC 2020), 2020, : 1365 - 1370
[5] Coulson J, 2019, 2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), P307, DOI [10.23919/ECC.2019.8795639, 10.23919/ecc.2019.8795639]
[6] [崔丽群 Cui Liqun], 2019, [计算机工程, Computer Engineering], V45, P108
[7] Finlayson B. A., 2013, METHOD WEIGHTED RESI
[8] GALLIERI M, 2019, ARXIV191106556
[9] Hoeller D., 2020, C ROBOT LEARNING, P990
[10] Hua YM, 2015, PROCEEDINGS OF 2015 INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND INTERNET OF THINGS, P1, DOI 10.1109/ICAIOT.2015.7111524

← 1 2 →