Factor Retention Using Machine Learning With Ordinal Data

被引:17
|
作者
Goretzko, David [1 ]
Buehner, Markus [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
关键词
exploratory factor analysis; number of factors; machine learning; factor retention; factorial validity; ordinal data; EXPLORATORY FACTOR-ANALYSIS; HORNS PARALLEL ANALYSIS; NUMBER; COMPONENTS; RETAIN; RULES;
D O I
10.1177/01466216221089345
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
Determining the number of factors in exploratory factor analysis is probably the most crucial decision when conducting the analysis as it clearly influences the meaningfulness of the results (i.e., factorial validity). A new method called the Factor Forest that combines data simulation and machine learning has been developed recently. This method based on simulated data reached very high accuracy for multivariate normal data, but it has not yet been tested with ordinal data. Hence, in this simulation study, we evaluated the Factor Forest with ordinal data based on different numbers of categories (2-6 categories) and compared it to common factor retention criteria. It showed higher overall accuracy for all types of ordinal data than all common factor retention criteria that were used for comparison (Parallel Analysis, Comparison Data, the Empirical Kaiser Criterion and the Kaiser Guttman Rule). The results indicate that the Factor Forest is applicable to ordinal data with at least five categories (typical scale in questionnaire research) in the majority of conditions and to binary or ordinal data based on items with less categories when the sample size is large.
引用
收藏
页码:406 / 421
页数:16
相关论文
共 50 条
  • [41] Supervised machine learning using encrypted training data
    Gonzalez-Serrano, Francisco-Javier
    Amor-Martin, Adrian
    Casamayon-Anton, Jorge
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2018, 17 (04) : 365 - 377
  • [42] ANALYSIS OF LArTPC DATA USING MACHINE LEARNING METHODS
    Falko, A.
    Gogota, O.
    Yermolenko, R.
    Kadenko, I.
    JOURNAL OF PHYSICAL STUDIES, 2024, 28 (01):
  • [43] Trends in web data extraction using machine learning
    Patnaik, Sudhir Kumar
    Babu, C. Narendra
    WEB INTELLIGENCE, 2021, 19 (03) : 169 - 190
  • [44] Big Data Platform Configuration Using Machine Learning
    Yeh, Chao-Chun
    Lu, Han-Lin
    Zhou, Jiazheng
    Chang, Sheng-An
    Lin, Xuan-Yi
    Sun, Yi-Chiao
    Huang, Shih-Kun
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2020, 36 (03) : 469 - 493
  • [45] Macroeconomic Predictions Using Payments Data and Machine Learning
    Chapman, James T. E.
    Desai, Ajit
    FORECASTING, 2023, 5 (04): : 652 - 683
  • [46] Analysis of Network log data using Machine Learning
    Allagi, Shridhar
    Rachh, Rashmi
    2019 IEEE 5TH INTERNATIONAL CONFERENCE FOR CONVERGENCE IN TECHNOLOGY (I2CT), 2019,
  • [47] Supervised machine learning using encrypted training data
    Francisco-Javier González-Serrano
    Adrián Amor-Martín
    Jorge Casamayón-Antón
    International Journal of Information Security, 2018, 17 : 365 - 377
  • [48] Minimum Data Base Determination using Machine Learning
    Ferrnando Kuri-Morales, Angel
    INTERNATIONAL JOURNAL OF WEB SERVICES RESEARCH, 2016, 13 (04) : 1 - 18
  • [49] A framework for data regression of heat transfer data using machine learning
    Loyola-Fuentes, Jose
    Nazemzadeh, Nima
    Diaz-Bejarano, Emilio
    Mancin, Simone
    Coletti, Francesco
    APPLIED THERMAL ENGINEERING, 2024, 248
  • [50] Sensor data classification using machine learning algorithm
    Rose, Lina
    Mary, X. Anitha
    JOURNAL OF STATISTICS & MANAGEMENT SYSTEMS, 2020, 23 (02) : 363 - 371