Analysis and comparison of an improved unreconstructed variance criterion to other criteria for estimating the dimension of PCA model

被引:3
作者
Mnassri, Baligh [1 ]
El Adel, El Mostafa [1 ]
Ouladsine, Mustapha [1 ]
机构
[1] Aix Marseille Univ, CNRS, ENSAM, Univ Toulon,LSIS UMR 7296, F-13397 Marseille, France
关键词
Principal component analysis; Data driven process modelling; Selection criteria; PRINCIPAL COMPONENTS; FAULT-DETECTION; PART I; NUMBER; SELECTION; RECONSTRUCTION; ERROR; IDENTIFICATION; COVARIANCE; DIAGNOSIS;
D O I
10.1016/j.jprocont.2016.06.004
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper provides a new criterion to select the significant components of an empirical process model using the principal component analysis approach. The proposed criterion is an improved unreconstructed variance (IUV) applied to a changing of process data representation. Four other criteria are studied to perform fundamental analyses and comparisons to each other. They are well known in the literature as the minimum description length (MDL), the imbedded error (IE), the equality of the eigenvalue (EOE) and the variance of reconstruction error (VRE). The selection of the significant components is usually constrained by three main difficulties such as the noise included in data, the presence of independent and quasi independent process variables and the size of training samples. This paper presents two fundamental proofs that clarify the limitations of both criteria which are IE and VRE. The consistency of the MDL and EOE criteria improves by increasing the number of training observations. The purpose of the IUV criterion is to enhance the VRE in order to remedy the encountered limitations. The proposed criterion shows a promising consistency as well as a highly robustness versus the mentioned difficulties. Its potential and the limitations of the other criteria are illustrated using two numerical examples and the CSTR process. (C) 2016 Elsevier Ltd. All rights reserved.
引用
收藏
页码:207 / 223
页数:17
相关论文
共 39 条
[1]  
Akaike H., 1992, 2 INT S INF THEOR, P610, DOI [10.1007/978-1-4612-1694-0, 10.1007/978-1-4612-0919-538, 10.1007/978-1-4612-0919-5_38, 10.1007/978-0-387-98135-2, DOI 10.1007/978-1-4612-0919-538]
[2]  
Anderson T.W., 2003, INTRO MULTIVARIATE S
[3]   ASYMPTOTIC THEORY FOR PRINCIPAL COMPONENT ANALYSIS [J].
ANDERSON, TW .
ANNALS OF MATHEMATICAL STATISTICS, 1963, 34 (01) :122-&
[4]  
BARTLETT MS, 1954, J ROY STAT SOC B, V16, P296
[5]   PCA STABILITY AND CHOICE OF DIMENSIONALITY [J].
BESSE, P .
STATISTICS & PROBABILITY LETTERS, 1992, 13 (05) :405-410
[6]   SCREE TEST FOR NUMBER OF FACTORS [J].
CATTELL, RB .
MULTIVARIATE BEHAVIORAL RESEARCH, 1966, 1 (02) :245-276
[7]   A unified geometric approach to process and sensor fault identification and reconstruction: The unidimensional fault case [J].
Dunia, R ;
Qin, SJ .
COMPUTERS & CHEMICAL ENGINEERING, 1998, 22 (7-8) :927-943
[8]   CROSS-VALIDATORY CHOICE OF THE NUMBER OF COMPONENTS FROM A PRINCIPAL COMPONENT ANALYSIS [J].
EASTMENT, HT ;
KRZANOWSKI, WJ .
TECHNOMETRICS, 1982, 24 (01) :73-77
[9]   A unified statistical framework for monitoring multivariate systems with unknown source and error signals [J].
Feital, Thiago ;
Kruger, Uwe ;
Xie, Lei ;
Schubert, Udo ;
Lima, Enrique Luis ;
Pinto, Jose Carlos .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2010, 104 (02) :223-232
[10]   SELECTION OF COMPONENTS IN PRINCIPAL COMPONENT ANALYSIS - A COMPARISON OF METHODS [J].
FERRE, L .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 1995, 19 (06) :669-682