Exploring Class Enumeration in Bayesian Growth Mixture Modeling Based on Conditional Medians

被引:5
作者
Kim, Seohyun [1 ]
Tong, Xin [1 ]
Ke, Zijun [2 ]
机构
[1] Univ Virginia, Dept Psychol, Gilmer Hall, Charlottesville, VA 22903 USA
[2] Sun Yat Sen Univ, Dept Psychol, Guangzhou, Peoples R China
基金
美国国家科学基金会;
关键词
robust methods; growth mixture modeling; conditional medians; bayesian model comparison; outliers; LONGITUDINAL DATA; CROSS-VALIDATION; LATENT CLASSES; CURVE MODELS; REGRESSION; INFERENCE; SELECTION; NUMBER;
D O I
10.3389/feduc.2021.624149
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Growth mixture modeling is a popular analytic tool for longitudinal data analysis. It detects latent groups based on the shapes of growth trajectories. Traditional growth mixture modeling assumes that outcome variables are normally distributed within each class. When data violate this normality assumption, however, it is well documented that the traditional growth mixture modeling mislead researchers in determining the number of latent classes as well as in estimating parameters. To address nonnormal data in growth mixture modeling, robust methods based on various nonnormal distributions have been developed. As a new robust approach, growth mixture modeling based on conditional medians has been proposed. In this article, we present the results of two simulation studies that evaluate the performance of the median-based growth mixture modeling in identifying the correct number of latent classes when data follow the normality assumption or have outliers. We also compared the performance of the median-based growth mixture modeling to the performance of traditional growth mixture modeling as well as robust growth mixture modeling based on t distributions. For identifying the number of latent classes in growth mixture modeling, the following three Bayesian model comparison criteria were considered: deviance information criterion, Watanabe-Akaike information criterion, and leave-one-out cross validation. For the median-based growth mixture modeling and t-based growth mixture modeling, our results showed that they maintained quite high model selection accuracy across all conditions in this study (ranged from 87 to 100%). In the traditional growth mixture modeling, however, the model selection accuracy was greatly influenced by the proportion of outliers. When sample size was 500 and the proportion of outliers was 0.05, the correct model was preferred in about 90% of the replications, but the percentage dropped to about 40% as the proportion of outliers increased to 0.15.
引用
收藏
页数:11
相关论文
共 42 条
[1]   Observations on the use of growth mixture models in psychological research [J].
Bauer, Daniel J. .
MULTIVARIATE BEHAVIORAL RESEARCH, 2007, 42 (04) :757-786
[2]   Distributional assumptions of growth mixture models: Implications for overextraction of latent trajectory classes [J].
Bauer, DJ ;
Curran, PJ .
PSYCHOLOGICAL METHODS, 2003, 8 (03) :338-363
[3]  
Bollen KA, 2006, WILEY SER PROBAB ST, P1
[4]   Univariate and multivariate skewness and kurtosis for measuring nonnormality: Prevalence, influence and estimation [J].
Cain, Meghan K. ;
Zhang, Zhiyong ;
Yuan, Ke-Hai .
BEHAVIOR RESEARCH METHODS, 2017, 49 (05) :1716-1735
[5]  
Celeux G, 2006, BAYESIAN ANAL, V1, P651, DOI 10.1214/06-BA122
[6]   Implementing continuous non-normal skewed distributions in latent growth mixture modeling: An assessment of specification errors and class enumeration [J].
Depaoli, Sarah ;
Winter, Sonja D. ;
Lai, Keke ;
Guerra-Pena, Kiero .
MULTIVARIATE BEHAVIORAL RESEARCH, 2019, 54 (06) :795-821
[7]  
Gabry Jonah, 2024, CRAN
[8]  
Gelman A., 2013, Bayesian data analysis, Vthird, DOI DOI 10.1201/B16018
[9]   Quantile regression for longitudinal data using the asymmetric Laplace distribution [J].
Geraci, Marco ;
Bottai, Matteo .
BIOSTATISTICS, 2007, 8 (01) :140-154
[10]   Class enumeration false positive in skew-t family of continuous growth mixture models [J].
Guerra-Pena, Kiero ;
Emilio Garcia-Batista, Zoilo ;
Depaoli, Sarah ;
Eduardo Garrido, Luis .
PLOS ONE, 2020, 15 (04)