The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference: an application to longitudinal modeling

被引:15
作者
Heggeseth, Brianna C. [1 ]
Jewell, Nicholas P. [1 ,2 ]
机构
[1] Univ Calif Berkeley, Dept Stat, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Div Biostat, Berkeley, CA 94720 USA
基金
美国国家科学基金会;
关键词
covariance; model misspecification; mixture models; Kullback-Leibler divergence; MAXIMUM-LIKELIHOOD-ESTIMATION; IDENTIFIABILITY; TRAJECTORIES; CONSISTENCY; VARIABLES; BIAS;
D O I
10.1002/sim.5729
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multivariate Gaussian mixtures are a class of models that provide a flexible parametric approach for the representation of heterogeneous multivariate outcomes. When the outcome is a vector of repeated measurements taken on the same subject, there is often inherent dependence between observations. However, a common covariance assumption is conditional independencethat is, given the mixture component label, the outcomes for subjects are independent. In this paper, we study, through asymptotic bias calculations and simulation, the impact of covariance misspecification in multivariate Gaussian mixtures. Although maximum likelihood estimators of regression and mixing probability parameters are not consistent under misspecification, they have little asymptotic bias when mixture components are well separated or if the assumed correlation is close to the truth even when the covariance is misspecified. We also present a robust standard error estimator and show thatit outperforms conventional estimators in simulations and can indicate that the model is misspecified. Body mass index data from a national longitudinal study are used to demonstrate the effects of misspecification on potential inferences made in practice. Copyright (c) 2013 John Wiley & Sons, Ltd.
引用
收藏
页码:2790 / 2803
页数:14
相关论文
共 47 条
  • [1] [Anonymous], 2013, Finite Mixture Distributions
  • [2] [Anonymous], WILEY SERIES PROBABI
  • [3] [Anonymous], 2008, CAUSALITY PSYCHOPATH
  • [4] [Anonymous], 2005, FEP WORKING PAPERS
  • [5] [Anonymous], INT STAT REV
  • [6] [Anonymous], WILEY SERIES PROBABI
  • [7] MODEL-BASED GAUSSIAN AND NON-GAUSSIAN CLUSTERING
    BANFIELD, JD
    RAFTERY, AE
    [J]. BIOMETRICS, 1993, 49 (03) : 803 - 821
  • [8] Maximum Likelihood Estimation of the Multivariate Normal Mixture Model
    Boldea, Otilia
    Magnus, Jan R.
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2009, 104 (488) : 1539 - 1549
  • [9] Dasgupta S., 1999, Proceedings of the 40th Annual Symposium on Foundations of Computer Science, FOCS'99, page, V40, P634
  • [10] ESTIMATING COMPONENTS OF A MIXTURE OF NORMAL DISTRIBUTIONS
    DAY, NE
    [J]. BIOMETRIKA, 1969, 56 (03) : 463 - &