Bootstrap based confidence limits in principal component analysis - A case study

被引:68
作者
Babamoradi, Hamid [1 ]
van den Berg, Frans [1 ]
Rinnan, Asmund [1 ]
机构
[1] Univ Copenhagen, Fac Sci, Dept Food Sci Qual & Technol, DK-1958 Frederiksberg C, Denmark
关键词
Bootstrap; PCA; Confidence limits; BCa; Uncertainty; MEASUREMENT ERRORS; STOPPING RULES; REGRESSION; UNCERTAINTY; NUMBER; DIMENSIONALITY; PREDICTIONS; VALIDATION; STABILITY; INTERVALS;
D O I
10.1016/j.chemolab.2012.10.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Principal component analysis (PCA) is widely used as a tool in (exploratory) data investigations for many different research areas such as analytical chemistry, food- and pharmaceutical-research, and multivariate statistical process control. Despite its popularity, not many results have been reported thus far on how to calculate reliable confidence interval limits in PCA estimates. And, like all other data analysis tasks, results of PCA are not complete without reasonable expectations for the parameter uncertainties, especially in the case of predictive model objectives. In this paper we will present a case study on how to calculate confidence limits based on bootstrap re-sampling. Two NIR datasets are used to build bootstrap confidence limits. The first dataset shows the effect of outliers on bootstrap confidence limits, while the second shows the bootstrap confidence limits when the data has a bimodal distribution. The different steps and choices which have to be made for the algorithm to perform correctly will be presented. The bootstrap based confidence limits will be compared with the corresponding asymptotic confidence limits. We will thereby conclude that the confidence limits based on the bootstrap method give more meaningful answers and are to be preferred over its asymptotic counterparts. (C) 2012 Elsevier B.V. All rights reserved.
引用
收藏
页码:97 / 105
页数:9
相关论文
共 49 条
[1]  
[Anonymous], 1988, Statistics: A Journal of Theoretical Applied Statistics, DOI DOI 10.1080/02331888808802095
[2]   BOOTSTRAP TESTS AND CONFIDENCE-REGIONS FOR FUNCTIONS OF A COVARIANCE-MATRIX [J].
BERAN, R ;
SRIVASTAVA, MS .
ANNALS OF STATISTICS, 1985, 13 (01) :95-115
[3]   PCA STABILITY AND CHOICE OF DIMENSIONALITY [J].
BESSE, P .
STATISTICS & PROBABILITY LETTERS, 1992, 13 (05) :405-410
[4]   Cross-validation of component models: A critical look at current methods [J].
Bro, R. ;
Kjeldahl, K. ;
Smilde, A. K. ;
Kiers, H. A. L. .
ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2008, 390 (05) :1241-1251
[5]  
CATTELL RB, 1983, CC/SOC BEHAV SCI, P16
[7]   Evaluation of PCA methods with improved fault isolation capabilities on a paper machine simulator [J].
Cheng, Hui ;
Nikus, Mats ;
Jamsa-Jounela, Sirkka-Liisa .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2008, 92 (02) :186-199
[8]  
Chernick M. R., 2008, Bootstrap methods, a guide for practitioners and researchers
[9]  
Conlin AK, 2000, J CHEMOMETR, V14, P725, DOI 10.1002/1099-128X(200009/12)14:5/6<725::AID-CEM611>3.0.CO
[10]  
2-8