Teaching Principal Components Using Correlations

被引:8
作者
Westfall, Peter H. [1 ]
Arias, Andrea L. [2 ,3 ]
Fulton, Lawrence V. [4 ]
机构
[1] Texas Tech Univ, Area Informat Syst & Quantitat Sci, Lubbock, TX 79409 USA
[2] Pontificia Univ Catolica Valparaiso, Sch Ind Engn, Valparaiso, Chile
[3] Texas Tech Univ, Dept Ind Engn, Lubbock, TX 79409 USA
[4] Texas Tech Univ, Area Hlth Org Management, Lubbock, TX 79409 USA
关键词
Factor analysis; heat map; optimality; rotation; variance explained; SIMILARITY;
D O I
10.1080/00273171.2017.1340824
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Introducing principal components (PCs) to students is difficult. First, the matrix algebra and mathematical maximization lemmas are daunting, especially for students in the social and behavioral sciences. Second, the standard motivation involving variance maximization subject to unit length constraint does not directly connect to the variance explained interpretation. Third, the unit length and uncorrelatedness constraints of the standard motivation do not allow re-scaling or oblique rotations, which are common in practice. Instead, we propose to motivate the subject in terms of optimizing (weighted) average proportions of variance explained in the original variables; this approach may be more intuitive, and hence easier to understand because it links directly to the familiar R-squared statistic. It also removes the need for unit length and uncorrelatedness constraints, provides a direct interpretation of variance explained, and provides a direct answer to the question of whether to use covariance-based or correlation-based PCs. Furthermore, the presentation can be made without matrix algebra or optimization proofs. Modern tools from data science, including heat maps and text mining, provide further help in the interpretation and application of PCs; examples are given. Together, these techniques may be used to revise currently used methods for teaching and learning PCs in the behavioral sciences.
引用
收藏
页码:648 / 660
页数:13
相关论文
共 38 条
[1]  
[Anonymous], 2003, User's Guide to Principal Components
[2]   Factor Analysis via Components Analysis [J].
Bentler, Peter M. ;
de Leeuw, Jan .
PSYCHOMETRIKA, 2011, 76 (03) :461-470
[3]   LOADINGS AND CORRELATIONS IN THE INTERPRETATION OF PRINCIPAL COMPONENTS [J].
CADIMA, J ;
JOLLIFFE, IT .
JOURNAL OF APPLIED STATISTICS, 1995, 22 (02) :203-214
[4]   Sequential behavior prediction based on hybrid similarity and cross-user activity transfer [J].
Dai, Peng ;
Ho, Shen-Shyang ;
Rudzicz, Frank .
KNOWLEDGE-BASED SYSTEMS, 2015, 77 :29-39
[5]   Social network analysis of a gamified e-learning course: Small-world phenomenon and network metrics as predictors of academic performance [J].
de-Marcos, Luis ;
Garcia-Lopez, Eva ;
Garcia-Cabot, Antonio ;
Medina-Merodio, Jose-Amelio ;
Dominguez, Adrian ;
Javier Martinez-Herraiz, Jose ;
Diez-Folledo, Teresa .
COMPUTERS IN HUMAN BEHAVIOR, 2016, 60 :312-321
[6]   Principal Component Analysis of Smoothed Tetrachoric Correlation Matrices as a Measure of Dimensionality [J].
Debelak, Rudolf ;
Tran, Ulrich S. .
EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2013, 73 (01) :63-77
[7]  
Duda R. O., 2001, PATTERN CLASSIFICATI
[8]   Design, validation, and reliability determination a citing conformity instrument at three levels: normative, informational, and identification [J].
Ebrahimy, Saeideh ;
Osareh, Farideh .
SCIENTOMETRICS, 2014, 99 (02) :581-597
[9]   Computer-aided diagnosis of human brain tumor through MRI: A survey and a new algorithm [J].
El-Dahshan, El-Sayed A. ;
Mohsen, Heba M. ;
Revett, Kenneth ;
Salem, Abdel-Badeeh M. .
EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (11) :5526-5545
[10]  
Everitt B, 2011, USE R, P1, DOI 10.1007/978-1-4419-9650-3