Automated learning of mixtures of factor analysis models with missing information

被引：0

作者：

Wan-Lun Wang

Tsung-I Lin

机构：

[1] Feng Chia University,Department of Statistics, Graduate Institute of Statistics and Actuarial Science

[2] National Chung Hsing University,Institute of Statistics

[3] China Medical University,Department of Public Health

来源：

TEST | 2020年 / 29卷

关键词：

Automated learning; Factor analysis; Maximum likelihood estimation; Missing values; Model selection; One-stage algorithm; 62H12; 62H25; 62H30;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

The mixture of factor analyzers (MFA) model has emerged as a useful tool to perform dimensionality reduction and model-based clustering for heterogeneous data. In seeking the most appropriate number of factors (q) of a MFA model with the number of components (g) fixed a priori, a two-stage procedure is commonly implemented by firstly carrying out parameter estimation over a set of prespecified numbers of factors, and then selecting the best q according to certain penalized likelihood criteria. When the dimensionality of data grows higher, such a procedure can be computationally prohibitive. To overcome this obstacle, we develop an automated learning scheme, called the automated MFA (AMFA) algorithm, to effectively merge parameter estimation and selection of q into a one-stage algorithm. The proposed AMFA procedure that allows for much lower computational cost is also extended to accommodate missing values. Moreover, we explicitly derive the score vector and the empirical information matrix for calculating standard errors associated with the estimated parameters. The potential and applicability of the proposed method are demonstrated through a number of real datasets with genuine and synthetic missing values.

引用

页码：1098 / 1124

页数：26

共 50 条

[31] Mixtures of common t-factor analyzers for modeling high-dimensional data with missing values
Wang, Wan-Lun
[J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 83 : 223 - 235
[32] Mixtures of QSAR models: Learning application domains of pK a predicto rs
Dorgo, Gyula
Peter Hamadi, Omar
Varga, Tamas
Abonyi, Janos
[J]. JOURNAL OF CHEMOMETRICS, 2020, 34 (04)
[33] Metrics for evaluating the performance of machine learning based automated valuation models
Steurer, Miriam
Hill, Robert J.
Pfeifer, Norbert
[J]. JOURNAL OF PROPERTY RESEARCH, 2021, 38 (02) : 99 - 129
[34] Performance of Estimators for Confirmatory Factor Analysis of Ordinal Variables with Missing Data
Lei, Pui-Wa
Shiverdecker, Levi K.
[J]. STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2020, 27 (04) : 584 - 601
[35] A Comparison of Missing-Data Imputation Techniques in Exploratory Factor Analysis
Xiao, Canhua
Bruner, Deborah W.
Dai, Tian
Guo, Ying
Hanlon, Alexandra
[J]. JOURNAL OF NURSING MEASUREMENT, 2019, 27 (02) : 313 - 334
[36] STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION
Bai, Jushan
Li, Kunpeng
[J]. ANNALS OF STATISTICS, 2012, 40 (01) : 436 - 465
[37] Bayesian inference for graphical factor analysis models
Giudici, P
Stanghellini, E
[J]. PSYCHOMETRIKA, 2001, 66 (04) : 577 - 591
[38] Bayesian inference for graphical factor analysis models
Paolo Giudici
Elena Stanghellini
[J]. Psychometrika, 2001, 66 : 577 - 591
[39] Learning causal structure from mixed data with missing values using Gaussian copula models
Ruifei Cui
Perry Groot
Tom Heskes
[J]. Statistics and Computing, 2019, 29 : 311 - 333
[40] Learning causal structure from mixed data with missing values using Gaussian copula models
Cui, Ruifei
Groot, Perry
Heskes, Tom
[J]. STATISTICS AND COMPUTING, 2019, 29 (02) : 311 - 333

← 1 2 3 4 5 →