A hierarchical constrained reinforcement learning for optimization of bitumen recovery rate in a primary separation vessel

被引:18
作者
Shafi, Hareem [1 ]
Velswamy, Kirubakaran [1 ]
Ibrahim, Fadi [1 ]
Huang, Biao [1 ]
机构
[1] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
Primary separation vessel; Oil sands; Machine learning; Reinforcement learning; Process control;
D O I
10.1016/j.compchemeng.2020.106939
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
This work proposes a two-level hierarchical constrained control structure for reinforcement learning (RL) with application in a Primary Separation Vessel (PSV). The lower level is concerned with servo tracking and regulation of the interface level against variances in ore quality by manipulating middlings flow rate. At the higher level, with the objective to optimize bitumen recovery rate, a supervisory interface level setpoint control is implemented. To prevent sanding, tailings density regulation using tailings withdrawal flow rate is proposed. For each case, an asynchronous advantage actor-critic (A3C) based agent is chosen to interact with a high-fidelity PSV model to learn the near optimal control strategy through episodic interactions. Each of the three control loops is sequentially learnt. In the interface level control loop, a behavioral cloning based two-phase learning scheme to promote stable state space exploration is proposed. The proposed hierarchical structure successfully demonstrates improved bitumen recovery rate by manipulating the interface level while preventing sanding. (c) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:16
相关论文
共 34 条
  • [1] [Anonymous], 1969, ONE DIMENSIONAL 2 PH, DOI DOI 10.1002/AIC.690160603
  • [2] [Anonymous], 2014, 31 INT C MACH LEARN
  • [3] CAPP, 2016, STAT HDB CAN UPSTR P, P233, DOI [10.4067/S0718-95162017005000034, DOI 10.4067/S0718-95162017005000034]
  • [4] Cleveland C.J., 2014, HDB ENERGY, VII, DOI [10.1016/B978-0-12-397219-4.00009-6, DOI 10.1016/B978-0-12-397219-4.00009-6]
  • [5] SETTLING VELOCITIES OF PARTICULATE SYSTEMS .2. SETTLING VELOCITIES OF SUSPENSIONS OF SPHERICAL-PARTICLES
    CONCHA, F
    ALMENDRA, ER
    [J]. INTERNATIONAL JOURNAL OF MINERAL PROCESSING, 1979, 6 (01) : 31 - 41
  • [6] Cui YD, 2018, IEEE INT CON AUTO SC, P304, DOI 10.1109/COASE.2018.8560593
  • [7] Fortunato M., 2017, Noisy networks for exploration
  • [8] An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
    Ge, Yulei
    Li, Shurong
    Chan, Peng
    [J]. JOURNAL OF PROCESS CONTROL, 2018, 64 : 15 - 26
  • [9] Gilbert W.A., 2004, THESIS
  • [10] Government of Canada, 2018, WHAT AR OIL SANDS