Semi-supervised mixture of latent factor analysis models with application to online key variable estimation

被引:23
作者
Shao, Weiming [1 ]
Ge, Zhiqiang [1 ]
Song, Zhihuan [1 ]
机构
[1] Zhejiang Univ, State Key Lab Ind Control Technol, Coll Control Sci & Engn, Hangzhou 310027, Zhejiang, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Virtual sensor; Semi-supervised learning; Mixture of latent factor analysis models; Akaike information criterion; Expectation-maximization; COMPONENT REGRESSION-MODEL; ADAPTIVE SOFT SENSOR; QUALITY PREDICTION; CHEMICAL-PROCESSES; ANALYTICS;
D O I
10.1016/j.conengprac.2018.11.008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data-driven virtual sensors have extensive applications in industrial processes for online estimating those important but difficult-to-measure variables. In the virtual sensor application, labeled samples could be very infrequent due to technical or economical limitations, thereby virtual sensors developed upon insufficient labeled samples may not be well trained, which leads to poor estimation performance. In addition, industrial processes are inherently stochastic and the vast majority of them present nonlinear and non-Gaussian characteristics. To cope with these issues, this paper proposes a semi-supervised mixture of latent factor analysis models (S(2)MLFA). In the S(2)MLFA, the insufficiency of labeled samples is remedied by exploiting both labeled and unlabeled data sets, while the nonlinear and non-Gaussian characteristics are handled by the mixture model structure. Moreover, the process uncertainties are modeled by the probabilistic model formulation. An efficient expectation-maximization-based learning algorithm is developed for training the S(2)MLFA, and a modified Akaike information criterion is presented for model selection. The S(2)MLFA is investigated by a numerical example and two real-world industrial processes, through which the effectiveness and feasibility of the proposed schemes are verified.
引用
收藏
页码:32 / 47
页数:16
相关论文
共 36 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 1908, BIOMETRIKA, V6, P1
[3]  
Bishop C. M., 2006, PATTERN RECOGNITION, DOI DOI 10.1117/1.2819119
[4]   Multimodel inference - understanding AIC and BIC in model selection [J].
Burnham, KP ;
Anderson, DR .
SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) :261-304
[5]   Monitoring Nonlinear and Non-Gaussian Processes Using Gaussian Mixture Model-Based Weighted Kernel Independent Component Analysis [J].
Cai, Lianfang ;
Tian, Xuemin ;
Chen, Sheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (01) :122-135
[6]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[7]   Process Data Analytics via Probabilistic Latent Variable Models: A Tutorial Review [J].
Ge, Zhiqiang .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2018, 57 (38) :12646-12661
[8]   Data Mining and Analytics in the Process Industry: The Role of Machine Learning [J].
Ge, Zhiqiang ;
Song, Zhihuan ;
Deng, Steven X. ;
Huang, Biao .
IEEE ACCESS, 2017, 5 :20590-20616
[9]   Supervised Latent Factor Analysis for Process Data Regression Modeling and Soft Sensor Application [J].
Ge, Zhiqiang .
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2016, 24 (03) :1004-1011
[10]   Mixture Bayesian Regularization of PCR Model and Soft Sensing Application [J].
Ge, Zhiqiang .
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2015, 62 (07) :4336-4343