Data-Driven Flotation Industrial Process Operational Optimal Control Based on Reinforcement Learning

被引：129

作者：

Jiang, Yi ^{[1
,2
,3
]}

Fan, Jialu ^{[1
,2
]}

Chai, Tianyou ^{[1
,2
]}

Li, Jinna ^{[1
,2
,4
]}

Lewis, Frank L. ^{[1
,2
,3
]}

机构：

[1] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Liaoning, Peoples R China

[2] Northeastern Univ, Int Joint Res Lab Integrated Automat, Shenyang 110819, Liaoning, Peoples R China

[3] Univ Texas Arlington, UTA Res Inst, Arlington, TX 76118 USA

[4] Shenyang Univ Chem Technol, Sch Informat Engn, Shenyang 110142, Liaoning, Peoples R China

来源：

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS | 2018年 / 14卷 / 05期

基金：

美国国家科学基金会;

关键词：

Flotation process; interleaved learning; model free; operational optimal control (OOC); reinforcement learning (RL); MODEL-PREDICTIVE CONTROL; ADAPTIVE OPTIMAL-CONTROL; OUTPUT-FEEDBACK CONTROL; SETPOINTS COMPENSATION; TIME-SYSTEMS; DESIGN;

D O I：

10.1109/TII.2017.2761852

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper studies the operational optimal control problem for the industrial flotation process, a key component in the mineral processing concentrator line. A new model-free data-driven method is developed here for real-time solution of this problem. A novel formulation is given for the optimal selection of the process control inputs that guarantees optimal tracking of the operational indices while maintaining the inputs within specified bounds. Proper tracking of prescribed operational indices, namely concentrate grade and tail grade, is essential in the proper economic operation of the flotation process. The difficulty in establishing an accurate mathematic model is overcome, and optimal controls are learned online in real time, using a novel form of reinforcement learning we call interleaved learning for online computation of the operational optimal control solution. Simulation experiments are provided to verify the effectiveness of the proposed interleaved learning method and to show that it performs significantly better than standard policy iteration and value iteration.

引用

页码：1974 / 1989

页数：16

共 49 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949

[3] Optimizing process economics online using model predictive control [J].

Amrit, Rishi ;

Rawlings, James B. ;

Biegler, Lorenz T. .

COMPUTERS & CHEMICAL ENGINEERING, 2013, 58 :334-343

[4]

[Anonymous], 1999, Neural network control of robot manipulators and nonlinear systems

[5]

[Anonymous], 2001, Neural Networks: A Comprehensive Foundation

[6] An intelligent switching control for a mixed separation thickener process [J].

Chai, Tianyou ;

Jia, Yao ;

Li, Haibo ;

Wang, Hong .

CONTROL ENGINEERING PRACTICE, 2016, 57 :61-71

[7] Optimal operational control for complex industrial processes [J].

Chai, Tianyou ;

Qin, S. Joe ;

Wang, Hong .

ANNUAL REVIEWS IN CONTROL, 2014, 38 (01) :81-92

[8] Integrated Network-Based Model Predictive Control for Setpoints Compensation in Industrial Processes [J].

Chai, Tianyou ;

Zhao, Lin ;

Qiu, Jianbin ;

Liu, Fangzhou ;

Fan, Jialu .

IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2013, 9 (01) :417-426

[9] Hybrid intelligent control for optimal operation of shaft furnace roasting process [J].

Chai, Tianyou ;

Ding, Jinliang ;

Wu, Fenghua .

CONTROL ENGINEERING PRACTICE, 2011, 19 (03) :264-275

[10] Data-Driven Optimization Control for Safety Operation of Hematite Grinding Process [J].

Dai, Wei ;

Chai, Tianyou ;

Yang, Simon X. .

IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (05) :2930-2941

← 1 2 3 4 5 →