Direct learning of improved control policies from historical plant data

被引：3

作者：

Alhazmi, Khalid ^{[1
]}

Sarathy, S. Mani ^{[1
]}

机构：

[1] King Abdullah Univ Sci & Technol KAUST, Phys Sci & Engn Div PSE, Thuwal 239556900, Saudi Arabia

来源：

COMPUTERS & CHEMICAL ENGINEERING | 2024年 / 185卷

关键词：

Model predictive control; Reinforcement learning; Deep learning; Process control; MODEL-PREDICTIVE CONTROL; SYSTEMS; CHALLENGES;

D O I：

10.1016/j.compchemeng.2024.108662

中图分类号：

TP39 [计算机的应用];

学科分类号：

081203 ; 0835 ;

摘要：

The continuous optimization of the operational performance of chemical plants is of fundamental importance. This research proposes a method that utilizes policy-constrained offline reinforcement learning to learn improved control policies from abundant historical plant data available in industrial settings. As a case study, historical data is generated from a nonlinear chemical system controlled by an economic model predictive controller (EMPC). However, the method's principles are broadly applicable. Theoretically, it is demonstrated that the learning-based controller inherits stability guarantees from the baseline EMPC. Experimentally, we validate that our method enhances the optimality of the baseline controller while preserving stability, improving the baseline policy by 1% to 20%. The results of this study offer a promising direction for the general improvement of advanced control systems, both data-informed and stability-guaranteed.

引用

页数：11