A simple rule for the selection of principal components

被引:49
作者
Karlis, D
Saporta, G
Spinakis, A
机构
[1] Athens Univ Econ & Business, Dept Stat, Athens 10434, Greece
[2] Conservat Natl Arts & Mitiers, Paris, France
[3] QUANTOS SARL, Paris, France
关键词
kaiser criterion; number of components; scree plot; simulation study; nonnormality; heptathlon data;
D O I
10.1081/STA-120018556
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A vast literature has been devoted to the assessment of the proper number of eigenvalues that have to be retained in Principal Components Analysis. Most of the publications are, based on either distributional assumptions for the underlying populations or on empirical evident. In addition, techniques that are based on bootstrap or cross-validatory techniques have been proposed despite the computational effort implied. In this paper a simple technique based on a control chart approach is proposed for selecting the number of principal components to retain for the analysis. This approach accounts for the sampling variability which can lead to the selection of components that are not in fact statistically significant. The method is compared with other methods and is found to be superior regardless of the underlying distributional properties of the population as well as the existing structure. An illustrative example is provided.
引用
收藏
页码:643 / 666
页数:24
相关论文
共 22 条