A practically implementable reinforcement learning control approach by leveraging offset-free model predictive control

被引:6
|
作者
Hassanpour, Hesam [1 ]
Mhaskar, Prashant [1 ]
Corbett, Brandon [2 ]
机构
[1] McMaster Univ, Dept Chem Engn, Hamilton, ON L8S 4L7, Canada
[2] Sartorius, Oakville, ON, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Reinforcement learning; Machine learning; Offset-free model predictive control; Process control; CONTROL SYSTEM; MPC;
D O I
10.1016/j.compchemeng.2023.108511
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work addresses the problem of designing an offset-free implementable reinforcement learning (RL) controller for nonlinear processes. RL-based controllers can update the control policy using observed data obtained online based on the controller-process interactions. This allows to alleviate the regular model maintenance step that is essential in advanced control techniques such as model predictive control (MPC). However, random explorations are required for an RL agent to find optimal state-action regions so as to finally reach the optimal policy. This is not implementable in practical situations due to safety concerns and economic objectives. To address this issue, a pre-training strategy is proposed to provide a secure platform for online implementations of the RL controllers. To this end, an offset-free MPC (representative industrial MPC) optimization problem is leveraged to train the RL agent offline. Having obtained similar performance to the offset-free MPC, the RL agent is utilized for online control to interact with the actual process. The efficacy of the proposed approach to handle nonlinearity and changes in plant operating conditions (due to unmeasured disturbances) are demonstrated through simulations on a chemical reactor example for a pH neutralization process. The results show that the proposed RL controller can significantly improve the oscillatory closed -loop responses, obtained by running the offset-free MPC due to the plant-model mismatch and unmeasured disturbances.
引用
收藏
页数:10
相关论文
共 50 条
  • [11] Disturbance modeling for offset-free linear model predictive control
    Muske, KR
    Badgwell, TA
    JOURNAL OF PROCESS CONTROL, 2002, 12 (05) : 617 - 632
  • [12] SERVO MODEL PREDICTIVE CONTROL FOR OFFSET-FREE SETPOINT TRACKING
    Su, Yang
    Tan, Kok K.
    Lee, Tong H.
    CONTROL AND INTELLIGENT SYSTEMS, 2014, 42 (03) : 247 - 253
  • [13] A Discussion on Stability of Offset-free Linear Model Predictive Control
    Ding, Baocang
    Zou, Tao
    Pan, Hongguang
    PROCEEDINGS OF THE 2012 24TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2012, : 80 - 85
  • [14] Offset-free model predictive control of a vapor compression cycle
    Wallace, Matt
    Das, Buddhadeva
    Mhaskar, Prashant
    House, John
    Salsbury, Tim
    JOURNAL OF PROCESS CONTROL, 2012, 22 (07) : 1374 - 1386
  • [15] Offset-free control of a pH system using Multiple Model Predictive Control
    Hermansson, A. W.
    Syafiie, S.
    26TH REGIONAL SYMPOSIUM ON CHEMICAL ENGINEERING (RSCE 2019), 2020, 778
  • [16] Integrated approach to AUV docking based on nonlinear offset-free model predictive control
    Shi, Kai
    Wang, Xiaohui
    Xu, Huixi
    Chen, Zhong
    Zhao, Hongyin
    MEASUREMENT & CONTROL, 2023, 56 (3-4): : 733 - 750
  • [17] Linear Offset-Free Model Predictive Control in the Dynamic PLS Framework
    Hou, Ligang
    Wu, Ze
    Jin, Xin
    Wang, Yue
    INFORMATION, 2019, 10 (01):
  • [18] Offset-Free Model Predictive Control for Active Magnetic Bearing Systems
    Bonfitto, Angelo
    Molina, Luis Miguel Castellanos
    Tonoli, Andrea
    Amati, Nicola
    ACTUATORS, 2018, 7 (03)
  • [19] Adaptive Disturbance Estimation for Offset-Free SISO Model Predictive Control
    Huusom, Jakob Kjobsted
    Poulsen, Niels Kjolstad
    Jorgensen, Sten Bay
    Jorgensen, John Bagterp
    2011 AMERICAN CONTROL CONFERENCE, 2011, : 2417 - 2422
  • [20] Offset-free explicit hybrid model predictive control of intravenous anaesthesia
    Nascu, Ioana
    Oberdieck, Richard
    Pistikopoulos, Efstratios N.
    2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 2475 - 2480