Categorical latent variable modeling utilizing fuzzy clustering generalized structured component analysis as an alternative to latent class analysis

被引:0
作者
Ryoo J.H. [1 ]
Park S. [2 ]
Kim S. [3 ]
机构
[1] Department of Pediatrics and Preventive Medicine, Keck School of Medicine, University of Southern California, Biostatistics Core, The Saban Research Institute, Children’s Hospital Los Angeles, 300B Smith Research Tower, 4650 Sunset Blvd., #160, Los Angeles
[2] The University of Iowa, Iowa City, IA
[3] The University of North Carolina at Greensboro, Greensboro, NC
关键词
Fuzzy clustering; Generalized structured component analysis; Latent class analysis; Optimal scaling;
D O I
10.1007/s41237-019-00084-6
中图分类号
学科分类号
摘要
Latent class analysis is becoming popular in many areas of education, psychology, social and behavioral sciences, public health, and medicine. However, it often suffers from identification issues due to the large number of parameters involved when using maximum likelihood (ML) estimation. Increasing the sample size, reducing sparseness, and strengthening the relationship between the observed variables and the latent variables all improve the information and thus reduce the identification issues, but the identification issue still affects the validity of parameter estimates in ML estimation and the definition of identification is not sufficient to guarantee the existence of an ML solution. In this paper, generalized structured component analysis (GSCA), which is a component-based approach that utilizes optimal scaling and fuzzy clustering, is applied to avoid these identification issues and develop more stable solutions for the heterogeneity of a population based on a set of categorical responses. Testing our proposed new approach, component-based (CB) latent class analysis (LCA), on real world substance use data from Add Health produced not only the same features as those yielded by conventional ML LCA but also stable estimation without identification issues. Comparing the results obtained from ML LCA using Mplus and poLCA in R, with those from our proposed CB LCA using GSCA in R revealed a similar number of latent classes and posterior probabilities and only minor discrepancies in individual latent class classifications when the posterior probabilities of membership are not distinct. © 2019, The Behaviormetric Society.
引用
收藏
页码:291 / 306
页数:15
相关论文
共 49 条
  • [1] Becker J.M., Rai A., Ringle C.M., Volckner F., Discovering unobserved heterogeneity in structural equation models to avert validity threats, MIS Q, 37, 3, pp. 665-694, (2013)
  • [2] Bezdek J.C., Numerical taxonomy with fuzzy sets, J Math Biol, 1, pp. 57-71, (1974)
  • [3] Bezdek J.C., Pattern recognition with fuzzy objective function algorithms, (1981)
  • [4] Collins L., Lanza S., Latent class and latent transition analysis: with applications in the social, behavioral, and health sciences, (2010)
  • [5] Dziak J.J., Lanza S.T., Tan X., Effect size, statistical power, and sample size requirements for the bootstrap likelihood ratio test in latent class analysis, Struct Equ Model, 21, 4, pp. 534-552, (2014)
  • [6] Efron B., Bootstrap methods: another look at the jackknife, Ann Stat, 7, pp. 1-26, (1979)
  • [7] Efron B., The jackknife, the bootstrap and other resampling plans, (1982)
  • [8] Esposito Vinzi V., Trinchera L., Squillacciotti S., Tenenhaus M., REBUS–PLS: a response-based procedure for detecting unit segments in PLS path modeling, Appl Stoch Models Bus Industry, 24, pp. 439-458, (2008)
  • [9] Goodman L.A., The analysis of systems of qualitative variables when some of the variables are unobservable. Part I—a modified latent structure approach, Am J Sociol, 79, pp. 1179-1259, (1974)
  • [10] Goodman L.A., Exploratory latent structure analysis using both identifiable and unidentifiable models, Biometrika, 61, pp. 215-231, (1974)