Robust Learning-Based Predictive Control for Discrete-Time Nonlinear Systems With Unknown Dynamics and State Constraints
被引:33
|
作者:
Zhang, Xinglong
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
Zhang, Xinglong
[1
]
Liu, Jiahang
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
Liu, Jiahang
[1
]
Xu, Xin
论文数: 0引用数: 0
h-index: 0
机构:
Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R ChinaNatl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
Xu, Xin
[1
]
Yu, Shuyou
论文数: 0引用数: 0
h-index: 0
机构:
Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130025, Peoples R China
Jilin Univ, Dept Control Sci & Engn, Changchun 130025, Peoples R ChinaNatl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
Yu, Shuyou
[2
,3
]
Chen, Hong
论文数: 0引用数: 0
h-index: 0
机构:
Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R ChinaNatl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
Chen, Hong
[4
]
机构:
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha 410073, Peoples R China
[2] Jilin Univ, State Key Lab Automot Simulat & Control, Changchun 130025, Peoples R China
[3] Jilin Univ, Dept Control Sci & Engn, Changchun 130025, Peoples R China
[4] Tongji Univ, Coll Elect & Informat Engn, Shanghai 201804, Peoples R China
来源:
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
|
2022年
/
52卷
/
12期
基金:
中国博士后科学基金;
中国国家自然科学基金;
关键词:
Robustness;
Predictive control;
Aerospace electronics;
Computational modeling;
Predictive models;
Optimal control;
Heuristic algorithms;
Model predictive control (MPC);
nonlinear systems;
reinforcement learning (RL);
robustness;
state constraints;
TRACKING CONTROL;
LINEAR-SYSTEMS;
STABILITY;
D O I:
10.1109/TSMC.2022.3146284
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
Robust model predictive control (MPC) is a well-known control technique for model-based control with constraints and uncertainties. In classic robust tube-based MPC approaches, an open-loop control sequence is computed via periodically solving an online nominal MPC problem, which requires prior model information and frequent access to onboard computational resources. In this article, we propose an efficient robust MPC solution based on receding horizon reinforcement learning, called r-LPC, for unknown nonlinear systems with state constraints and disturbances. The proposed r-LPC utilizes a Koopman operator-based prediction model obtained offline from precollected input-output datasets. Unlike classic tube-based MPC, in each prediction time interval of r-LPC, we use an actor-critic structure to learn a near-optimal feedback control policy rather than a control sequence. The resulting closed-loop control policy can be learned offline and deployed online or learned online in an asynchronous way. In the latter case, online learning can be activated whenever necessary; for instance, the safety constraint is violated with the deployed policy. The closed-loop recursive feasibility, robustness, and asymptotic stability are proven under function approximation errors of the actor-critic networks. Simulation and experimental results on two nonlinear systems with unknown dynamics and disturbances have demonstrated that our approach has better or comparable performance when compared with tube-based MPC and linear quadratic regulator, and outperforms a recently developed actor-critic learning approach.
机构:
Cent South Univ, Sch Automat, Changsha 410083, Hunan, Peoples R ChinaCent South Univ, Sch Automat, Changsha 410083, Hunan, Peoples R China
Liu, Fang
Peng, Hui
论文数: 0引用数: 0
h-index: 0
机构:
Cent South Univ, Sch Automat, Changsha 410083, Hunan, Peoples R China
Xiangjiang Lab, Changsha 410205, Peoples R ChinaCent South Univ, Sch Automat, Changsha 410083, Hunan, Peoples R China
机构:
Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R ChinaTongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Bao, Hanqiu
Kang, Qi
论文数: 0引用数: 0
h-index: 0
机构:
Tongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R ChinaTongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Kang, Qi
Shi, Xudong
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai 201418, Peoples R ChinaTongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Shi, Xudong
Xiao, Lingfei
论文数: 0引用数: 0
h-index: 0
机构:
Nanjing Univ Aeronaut & Astronaut, Coll Energy & Power Engn, Nanjing 210016, Peoples R ChinaTongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
Xiao, Lingfei
An, Jing
论文数: 0引用数: 0
h-index: 0
机构:
Shanghai Inst Technol, Sch Elect & Elect Engn, Shanghai 201418, Peoples R ChinaTongji Univ, Dept Control Sci & Engn, Shanghai 201804, Peoples R China
机构:
Univ Fed Rio de Janeiro, COPPE, Chem Engn Program, BR-21941972 Rio De Janeiro, BrazilUniv Fed Rio de Janeiro, COPPE, Chem Engn Program, BR-21941972 Rio De Janeiro, Brazil
Goncalves, Guilherme A. A.
Guay, Martin
论文数: 0引用数: 0
h-index: 0
机构:
Queens Univ, Dept Chem Engn, Kingston, ON K7L 3N6, CanadaUniv Fed Rio de Janeiro, COPPE, Chem Engn Program, BR-21941972 Rio De Janeiro, Brazil
机构:
City Univ Hong Kong, Dept Biomed Engn, Hong Kong, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangdong Key Lab IoT Informat Technol, Guangzhou 510006, Peoples R China
Jiang, Yi
Xie, Kan
论文数: 0引用数: 0
h-index: 0
机构:
Guangdong Univ Technol, Ctr Intelligent Batch Mfg Based IoT Technol 111, Sch Automat, Guangzhou 510006, Peoples R ChinaGuangdong Univ Technol, Sch Automat, Guangdong Key Lab IoT Informat Technol, Guangzhou 510006, Peoples R China
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Wang, Xiubo
Duan, Guangren
论文数: 0引用数: 0
h-index: 0
机构:
Harbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China
Southern Univ Sci & Technol, Ctr Control Sci & Technol, Shenzhen 518055, Peoples R ChinaHarbin Inst Technol, Ctr Control Theory & Guidance Technol, Harbin 150001, Peoples R China