Semisupervised Bayesian Gaussian Mixture Models for Non-Gaussian Soft Sensor

被引:36
作者
Shao, Weiming [1 ]
Ge, Zhiqiang [2 ,3 ]
Song, Zhihuan [2 ,3 ]
机构
[1] China Univ Petr, Coll New Energy, Qingdao 266580, Peoples R China
[2] Zhejiang Univ, State Key Lab Ind Control Technol, Hangzhou 310007, Peoples R China
[3] Zhejiang Univ, Coll Control Sci & Engn, Hangzhou 310007, Peoples R China
关键词
Bayes methods; Gaussian mixture model; Sensors; Numerical models; Germanium; Biological system modeling; Bayesian Gaussian mixture models (GMMs); non-Gaussian industrial processes; semisupervised learning; soft sensor; weighted variational inference (WVI); COMPONENT REGRESSION-MODEL; QUALITY PREDICTION; CHEMICAL-PROCESSES; INFERENCE;
D O I
10.1109/TCYB.2019.2947622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Soft sensors have been widely accepted for online estimating key quality-related variables in industrial processes. The Gaussian mixture models (GMM) is one of the most popular soft sensing methods for the non-Gaussian industrial processes. However, in industrial applications, the quantity of samples with known labels is usually quite limited because of the technical limitations or economical reasons. Traditional GMM-based soft sensor models solely depending on labeled samples may easily suffer from singular covariances, overfitting, and difficulties in model selection, which results in the performance deterioration. To tackle these issues, we propose a semisupervised Bayesian GMM (S(2)BGMM). In the S(2)BGMM, we first propose a semisupervised fully Bayesian model, which enables learning from both the labeled and unlabeled datasets for remedying the deficiency of infrequent labeled samples. Subsequently, a general framework of weighted variational inference is developed to train the S(2)BGMM, such that the rate of learning from unlabeled samples can be controlled by penalizing the unlabeled dataset. Case studies are carried out to evaluate the performance of the S(2)BGMM through a numerical example and two real-world industrial processes, which demonstrate the effectiveness and reliability of the proposed approach.
引用
收藏
页码:3455 / 3468
页数:14
相关论文
共 41 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Barber D, 2012, Bayesian reasoning and machine learning
[3]  
Bishop C.M., 2006, Pattern Recognition and Machine Learning, DOI DOI 10.1007/978-0-387-45528-0
[4]   Variational Inference for Dirichlet Process Mixtures [J].
Blei, David M. ;
Jordan, Michael I. .
BAYESIAN ANALYSIS, 2006, 1 (01) :121-143
[5]   Multimodel inference - understanding AIC and BIC in model selection [J].
Burnham, KP ;
Anderson, DR .
SOCIOLOGICAL METHODS & RESEARCH, 2004, 33 (02) :261-304
[6]   Monitoring Nonlinear and Non-Gaussian Processes Using Gaussian Mixture Model-Based Weighted Kernel Independent Component Analysis [J].
Cai, Lianfang ;
Tian, Xuemin ;
Chen, Sheng .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (01) :122-135
[7]   Melt index prediction with a mixture of Gaussian process regression with embedded clustering and variable selections [J].
Chan, Lester Lik Teck ;
Chen, Junghui .
JOURNAL OF APPLIED POLYMER SCIENCE, 2017, 134 (40)
[8]   Adaptive Gaussian Mixture Model-Based Relevant Sample Selection for JITL Soft Sensor Development [J].
Fan, Miao ;
Ge, Zhiqiang ;
Song, Zhihuan .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2014, 53 (51) :19979-19986
[9]   Process Data Analytics via Probabilistic Latent Variable Models: A Tutorial Review [J].
Ge, Zhiqiang .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2018, 57 (38) :12646-12661
[10]   Mixture Semisupervised Principal Component Regression Model and Soft Sensor Application [J].
Ge, Zhiqiang ;
Huang, Biao ;
Song, Zhihuan .
AICHE JOURNAL, 2014, 60 (02) :533-545