Direct learning of improved control policies from historical plant data

被引:3
作者
Alhazmi, Khalid [1 ]
Sarathy, S. Mani [1 ]
机构
[1] King Abdullah Univ Sci & Technol KAUST, Phys Sci & Engn Div PSE, Thuwal 239556900, Saudi Arabia
关键词
Model predictive control; Reinforcement learning; Deep learning; Process control; MODEL-PREDICTIVE CONTROL; SYSTEMS; CHALLENGES;
D O I
10.1016/j.compchemeng.2024.108662
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The continuous optimization of the operational performance of chemical plants is of fundamental importance. This research proposes a method that utilizes policy-constrained offline reinforcement learning to learn improved control policies from abundant historical plant data available in industrial settings. As a case study, historical data is generated from a nonlinear chemical system controlled by an economic model predictive controller (EMPC). However, the method's principles are broadly applicable. Theoretically, it is demonstrated that the learning-based controller inherits stability guarantees from the baseline EMPC. Experimentally, we validate that our method enhances the optimality of the baseline controller while preserving stability, improving the baseline policy by 1% to 20%. The results of this study offer a promising direction for the general improvement of advanced control systems, both data-informed and stability-guaranteed.
引用
收藏
页数:11
相关论文
empty
未找到相关数据