Semiparametric mixture regression with unspecified error distributions

被引:0
作者
Yanyuan Ma
Shaoli Wang
Lin Xu
Weixin Yao
机构
[1] The Pennsylvania State University,Department of Statistics
[2] Shanghai University of Finance and Economics,School of Statistics and Management
[3] Zhejiang University of Finance and Economics,School of Data Sciences
[4] University of California,Department of Statistics
[5] Riverside,undefined
来源
TEST | 2021年 / 30卷
关键词
EM algorithm; Mixture of regressions; Semiparametric mixture models; 62G20; 62G07;
D O I
暂无
中图分类号
学科分类号
摘要
In fitting a mixture of linear regression models, normal assumption is traditionally used to model the error and then regression parameters are estimated by the maximum likelihood estimators (MLE). This procedure is not valid if the normal assumption is violated. By extending the semiparametric regression estimator proposed by Hunter and Young (J Nonparametr Stat 24:19–38, 2012a) which requires the component error densities to be the same (including homogeneous variance), we propose semiparametric mixture of linear regression models with unspecified component error distributions to reduce the modeling bias. We establish a more general identifiability result under weaker conditions than existing results, construct a class of new estimators, and establish their asymptotic properties. These asymptotic results also apply to many existing semiparametric mixture regression estimators whose asymptotic properties have remained unknown due to the inherent difficulties in obtaining them. Using simulation studies, we demonstrate the superiority of the proposed estimators over the MLE when the normal error assumption is violated and the comparability when the error is normal. Analysis of a newly collected Equine Infectious Anemia Virus data in 2017 is employed to illustrate the usefulness of the new estimator.
引用
收藏
页码:429 / 444
页数:15
相关论文
共 81 条
[1]  
Balabdaoui F(2017)Revisiting the Hodges–Lehmann estimator in a location mixture model: Is asymptotic normality good enough? Electron J Stat 11 4563-4595
[2]  
Balabdaoui F(2018)Inference for a two-component mixture of symmetric distributions under log-concavity Bernoulli 24 1053-1071
[3]  
Doss CR(2017)Statistical guarantees for the em algorithm: from population to sample-based analysis Ann Stat 45 77-120
[4]  
Balakrishnan S(2009)An EM-like algorithm for semi- and non-parametric estimation in multivariate mixtures J Comput Graph Stat 18 505-526
[5]  
Wainwright MJ(2006)Semiparametric estimation of a two-component mixture model Ann Stat 34 1204-1232
[6]  
Yu B(2007)An EM algorithm for a semiparametric mixture model Comput Stat Data Anal 51 5429-5443
[7]  
Benaglia T(2017)Semiparametric topographical mixture models with symmetric errors Bernoulli 23 825-862
[8]  
Chauveau D(2013)Estimation of finite mixtures with symmetric components Stat Comput 23 233-249
[9]  
Hunter D(2012)Inference on the order of a normal mixture J Am Stat Assoc 107 1096-1105
[10]  
Bordes L(1977)Maximum likelihood from incomplete data via the EM algorithm J R Stat Soc Ser B 39 1-38