Factor Retention Using Machine Learning With Ordinal Data

被引:17
|
作者
Goretzko, David [1 ]
Buehner, Markus [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
关键词
exploratory factor analysis; number of factors; machine learning; factor retention; factorial validity; ordinal data; EXPLORATORY FACTOR-ANALYSIS; HORNS PARALLEL ANALYSIS; NUMBER; COMPONENTS; RETAIN; RULES;
D O I
10.1177/01466216221089345
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2-6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large.
引用
收藏
页码:406 / 421
页数:16
相关论文
共 50 条
  • [31] Evaluating Temporal Correlations in Time Series Using Permutation Entropy, Ordinal Probabilities and Machine Learning
    Boaretto, Bruno R. R.
    Budzinski, Roberto C.
    Rossi, Kalel L.
    Prado, Thiago L.
    Lopes, Sergio R.
    Masoller, Cristina
    ENTROPY, 2021, 23 (08)
  • [32] Factor Retention in EFA: Strategies for Health Behavior Researchers
    Stellefson, Michael L.
    Hanik, Bruce W.
    Chaney, Beth H.
    Chaney, J. Don
    AMERICAN JOURNAL OF HEALTH BEHAVIOR, 2009, 33 (05): : 587 - 599
  • [33] An SPSS R-Menu for Ordinal Factor Analysis
    Basto, Mario
    Pereira, Jose Manuel
    JOURNAL OF STATISTICAL SOFTWARE, 2012, 46 (04): : 1 - 29
  • [34] Concrete aging factor prediction using machine learning
    Taffese, Woubishet Zewdu
    Wally, Gustavo Bosel
    Magalhaes, Fabio Costa
    Espinosa-Leal, Leonardo
    MATERIALS TODAY COMMUNICATIONS, 2024, 40
  • [35] Factor retention in ordered categorical variables: Benefits and costs of polychoric correlations in eigenvalue-based testing
    Brandenburg, Nils
    BEHAVIOR RESEARCH METHODS, 2024, 56 (07) : 7241 - 7260
  • [36] Fitting Large Factor Analysis Models With Ordinal Data
    DiStefano, Christine
    McDaniel, Heather L.
    Zhang, Liyun
    Shi, Dexin
    Jiang, Zhehan
    EDUCATIONAL AND PSYCHOLOGICAL MEASUREMENT, 2019, 79 (03) : 417 - 436
  • [37] Prediction of Stress-Dependent Soil Water Retention Using Machine Learning
    Mojtahedi, Seyed Farid Fazel
    Akbarpour, Ali
    Darzi, Ali Golaghaei
    Sadeghi, Hamed
    van Genuchten, Martinus Theodorus
    GEOTECHNICAL AND GEOLOGICAL ENGINEERING, 2024, 42 (05) : 3939 - 3966
  • [38] Data mining and machine learning in retail business: developing efficiencies for better customer retention
    Kumar, M. Rajesh
    Venkatesh, J.
    Rahman, A. M. J. Md Zubair
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021,
  • [39] Photothermal Radiometry Data Analysis by Using Machine Learning
    Xiao, Perry
    Chen, Daqing
    SENSORS, 2024, 24 (10)
  • [40] Intrusion Detection Using Data Fusion and Machine Learning
    Hechmi, Jridi Mohamed
    Khlaifi, Hacen
    Bouatay, Amine
    Zrelli, Amira
    Ezzedine, Tahar
    2018 26TH INTERNATIONAL CONFERENCE ON SOFTWARE, TELECOMMUNICATIONS AND COMPUTER NETWORKS (SOFTCOM), 2018, : 235 - 240