PCAtest: testing the statistical significance of Principal Component Analysis in R

被引:87
作者
Camargo, Arley [1 ]
机构
[1] Univ Republ, Ctr Univ Reg Noreste, Rivera, Uruguay
关键词
Principal component analysis; Statistical significance; Permutation; R function; PCAtest; PERMUTATION TESTS; STOPPING RULES; AXES;
D O I
10.7717/peerj.12967
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Principal Component Analysis (PCA) is one of the most broadly used statistical methods for the ordination and dimensionality-reduction of multivariate datasets across many scientific disciplines. Trivial PCs can be estimated from data sets without any correlational structure among the original variables, and traditional criteria for selecting non-trivial PC axes are difficult to implement, partially subjective or based on ad hoc thresholds. PCAtest is an R package that implements permutation-based statistical tests to evaluate the overall significance of a PCA, the significance of each PC axis, and of contributions of each observed variable to the significant axes. Based on simulation and empirical results, I encourage R users to routinely apply PCAtest to test the significance of their PCA before proceeding with the direct interpretation of PC axes and/or the utilization of PC scores in subsequent evolutionary and ecological analyses.
引用
收藏
页数:14
相关论文
共 26 条
[1]  
[Anonymous], 2017, Randomization, Bootstrap and Monte Carlo Methods in Biology
[2]   Be careful with your principal components [J].
Bjorklund, Mats .
EVOLUTION, 2019, 73 (10) :2151-2158
[3]   Asymptotic properties of correlation-based principal component analysis? [J].
Choi, Jungjun ;
Yang, Xiye .
JOURNAL OF ECONOMETRICS, 2022, 229 (01) :1-18
[4]   The role of permutation tests in exploratory multivariate data analysis [J].
Dijksterhuis, GB ;
Heiser, WJ .
FOOD QUALITY AND PREFERENCE, 1995, 6 (04) :263-270
[5]   PERMUTATION METHODS FOR FACTOR ANALYSIS AND PCA [J].
Dobriban, Edgar .
ANNALS OF STATISTICS, 2020, 48 (05) :2824-2847
[6]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26
[7]  
Efron Bradley, 1986, Statistical Science, P54, DOI DOI 10.1214/SS/1177013815
[8]   PROPOSAL FOR HANDLING MISSING DATA [J].
GLEASON, TC ;
STAELIN, R .
PSYCHOMETRIKA, 1975, 40 (02) :229-252
[9]   Analysis of a complex of statistical variables into principal components [J].
Hotelling, H .
JOURNAL OF EDUCATIONAL PSYCHOLOGY, 1933, 24 :417-441
[10]   STOPPING RULES IN PRINCIPAL COMPONENTS-ANALYSIS - A COMPARISON OF HEURISTIC AND STATISTICAL APPROACHES [J].
JACKSON, DA .
ECOLOGY, 1993, 74 (08) :2204-2214