Framework for regression-based missing data imputation methods in on-line MSPC

被引:49
作者
Arteaga, F
Ferrer, A
机构
[1] Univ Politecn Valencia, Dept Estadist & IO Aplicadas & Calidad, Valencia 46022, Spain
[2] Univ Catolica Valencia, Fac Estudios Empresa, Valencia 46008, Spain
关键词
principal component analysis (PCA); missing data; multivariate statistical process control (MSPC);
D O I
10.1002/cem.946
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing data are a critical issue in on-line multivariate statistical process control (MSPC). Among the different scores estimation methods for future multivariate incomplete observations from an existing principal component analysis (PCA) model, the most statistical efficient ones are those that estimate the scores for the new incomplete observation as the prediction from a regression model. We have called them regression-based methods. Several approximations have been proposed in the literature to overcome the singularity or ill-conditioning problems that some of the mentioned methods can suffer due to missing data. This is particularly acute in on-line batch process monitoring. In order to ease the comparison of the statistical performance of these methods and to improve the understanding of their relationships, in this paper we propose a framework that allows to write these regression-based methods by an unique expression, function of a key matrix. From this framework a statistical performance index (PRESV) is introduced as a way to compare the statistical efficiency of the different framework members and to predict the impact of specific missing data combinations on scores estimation without requiring real data. The results are illustrated by application to several continuous and batch industrial data sets. Copyright (c) 2005 John Wiley & Sons, Ltd.
引用
收藏
页码:439 / 447
页数:9
相关论文
共 15 条
[1]   PLS regression methods [J].
Höskuldsson, Agnar .
Journal of Chemometrics, 1988, 2 (03) :211-228
[2]  
[Anonymous], 1989, MULTIVARIATE CALIBRA
[3]   Dealing with missing data in MSPC: several methods, different interpretations, some examples [J].
Arteaga, F ;
Ferrer, A .
JOURNAL OF CHEMOMETRICS, 2002, 16 (8-10) :408-418
[4]   SIMPLS - AN ALTERNATIVE APPROACH TO PARTIAL LEAST-SQUARES REGRESSION [J].
DEJONG, S .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1993, 18 (03) :251-263
[5]   Model predictive monitoring for batch processes [J].
García-Muñoz, S ;
Kourti, T ;
MacGregor, JF .
INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2004, 43 (18) :5929-5941
[6]  
Jackson JE., 1991, A user guide to Principal Components, DOI 10.1002/0471725331
[7]  
Little R.J., 1987, Statistical Analysis With Missing Data
[8]  
Nelson P.R.C., 2002, THESIS MCMASTER U HA
[9]   Missing data methods in PCA and PLS: Score calculations with incomplete observations [J].
Nelson, PRC ;
Taylor, PA ;
MacGregor, JF .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 1996, 35 (01) :45-65
[10]   MULTIVARIATE SPC CHARTS FOR MONITORING BATCH PROCESSES [J].
NOMIKOS, P ;
MACGREGOR, JF .
TECHNOMETRICS, 1995, 37 (01) :41-59