Machine learning modeling and predictive control of nonlinear processes using noisy data

被引:47
作者
Wu, Zhe [1 ]
Rincon, David [1 ]
Luo, Junwei [1 ]
Christofides, Panagiotis D. [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Chem & Biomol Engn, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Elect & Comp Engn, Los Angeles, CA USA
基金
美国国家科学基金会;
关键词
long short-term memory; machine learning; model predictive control; noisy data; nonlinear systems; RECURRENT NEURAL-NETWORKS; PRINCIPAL COMPONENT ANALYSIS; SYSTEM-IDENTIFICATION; ALGORITHM; ARX; MPC;
D O I
10.1002/aic.17164
中图分类号
TQ [化学工业];
学科分类号
0817 ;
摘要
This work focuses on machine learning modeling and predictive control of nonlinear processes using noisy data. We use long short-term memory (LSTM) networks with training data from sensor measurements corrupted by two types of noise: Gaussian and non-Gaussian noise, to train the process model that will be used in a model predictive controller (MPC). We first discuss the LSTM training with noisy data following a Gaussian distribution, and demonstrate that the standard LSTM network is capable of capturing the underlying process dynamic behavior by reducing the impact of noise. Subsequently, given that the standard LSTM performs poorly on a noisy data set from industrial operation (i.e., non-Gaussian noisy data), we propose an LSTM network using Monte Carlo dropout method to reduce the overfitting to noisy data. Furthermore, an LSTM network using co-teaching training method is proposed to further improve its approximation performance when noise-free data from a process model capturing the nominal process state evolution is available. A chemical process example is used throughout the manuscript to illustrate the application of the proposed modeling approaches and demonstrate their open- and closed-loop performance under a Lyapunov-based model predictive controller with state measurements corrupted by industrial noise.
引用
收藏
页数:15
相关论文
共 56 条
[1]   Nonlinear model predictive control for distributed parameter systems using data driven artificial neural network models [J].
Aggelogiannaki, Eleni ;
Sarimveis, Haralambos .
COMPUTERS & CHEMICAL ENGINEERING, 2008, 32 (06) :1225-1237
[2]   Nonlinear system identification for predictive control using continuous time recurrent neural networks and automatic differentiation [J].
Al Seyab, R. K. ;
Cao, Y. .
JOURNAL OF PROCESS CONTROL, 2008, 18 (06) :568-581
[3]   Artificial Intelligence techniques applied as estimator in chemical process systems - A literature survey [J].
Ali, Jarinah Mohd ;
Hussain, M. A. ;
Tade, Moses O. ;
Zhang, Jie .
EXPERT SYSTEMS WITH APPLICATIONS, 2015, 42 (14) :5915-5931
[4]   Data-driven model predictive quality control of batch processes [J].
Aumi, Siam ;
Corbett, Brandon ;
Clarke-Pringle, Tracy ;
Mhaskar, Prashant .
AICHE JOURNAL, 2013, 59 (08) :2852-2861
[5]  
Bishop Christopher M, 2006, PATTERN RECOGN, V128, P1, DOI [10.1117/1.2819119, DOI 10.1117/1]
[6]   Optimization and control of a thin film growth process: A hybrid first principles/artificial neural network based multiscale modelling approach [J].
Chaffart, Donovan ;
Ricardez-Sandoval, Luis A. .
COMPUTERS & CHEMICAL ENGINEERING, 2018, 119 :465-479
[7]   DYNAMIC RECURRENT NEURAL-NETWORK FOR SYSTEM-IDENTIFICATION AND CONTROL [J].
DELGADO, A ;
KAMBHAMPATI, C ;
WARWICK, K .
IEE PROCEEDINGS-CONTROL THEORY AND APPLICATIONS, 1995, 142 (04) :307-314
[8]   Identification of ARX and ARARX Models in the Presence of Input and Output Noises [J].
Diversi, Roberto ;
Guidorzi, Roberto ;
Soverini, Umberto .
EUROPEAN JOURNAL OF CONTROL, 2010, 16 (03) :242-255
[9]   An encoder-decoder LSTM-based EMPC framework applied to a building HVAC system [J].
Ellis, Matthew J. ;
Chinde, Venkatesh .
CHEMICAL ENGINEERING RESEARCH & DESIGN, 2020, 160 :508-520
[10]   APPROXIMATION OF DYNAMICAL-SYSTEMS BY CONTINUOUS-TIME RECURRENT NEURAL NETWORKS [J].
FUNAHASHI, K ;
NAKAMURA, Y .
NEURAL NETWORKS, 1993, 6 (06) :801-806