Semiparametric mixture regression with unspecified error distributions

被引:6
作者
Ma, Yanyuan [1 ]
Wang, Shaoli [2 ]
Xu, Lin [3 ]
Yao, Weixin [4 ]
机构
[1] Penn State Univ, Dept Stat, State Coll, PA USA
[2] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[3] Zhejiang Univ Finance & Econ, Sch Data Sci, Hangzhou, Peoples R China
[4] Univ Calif Riverside, Dept Stat, Riverside, CA 92521 USA
关键词
EM algorithm; Mixture of regressions; Semiparametric mixture models; 2-COMPONENT MIXTURE; MAXIMUM-LIKELIHOOD; EM ALGORITHM; INFERENCE; ESTIMATOR; MODELS; ORDER;
D O I
10.1007/s11749-020-00725-z
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In fitting a mixture of linear regression models, normal assumption is traditionally used to model the error and then regression parameters are estimated by the maximum likelihood estimators (MLE). This procedure is not valid if the normal assumption is violated. By extending the semiparametric regression estimator proposed by Hunter and Young (J Nonparametr Stat 24:19-38, 2012a) which requires the component error densities to be the same (including homogeneous variance), we propose semiparametric mixture of linear regression models with unspecified component error distributions to reduce the modeling bias. We establish a more general identifiability result under weaker conditions than existing results, construct a class of new estimators, and establish their asymptotic properties. These asymptotic results also apply to many existing semiparametric mixture regression estimators whose asymptotic properties have remained unknown due to the inherent difficulties in obtaining them. Using simulation studies, we demonstrate the superiority of the proposed estimators over the MLE when the normal error assumption is violated and the comparability when the error is normal. Analysis of a newly collected Equine Infectious Anemia Virus data in 2017 is employed to illustrate the usefulness of the new estimator.
引用
收藏
页码:429 / 444
页数:16
相关论文
共 39 条
  • [1] Revisiting the Hodges-Lehmann estimator in a location mixture model: Is asymptotic normality good enough?
    Balabdaoui, Fadoua
    [J]. ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02): : 4563 - 4595
  • [2] Inference for a two-component mixture of symmetric distributions under log-concavity
    Balabdaoui, Fadoua
    Doss, Charles R.
    [J]. BERNOULLI, 2018, 24 (02) : 1053 - 1071
  • [3] STATISTICAL GUARANTEES FOR THE EM ALGORITHM: FROM POPULATION TO SAMPLE-BASED ANALYSIS
    Balakrishnan, Sivaraman
    Wainwrightt, Martin J.
    Yu, Bin
    [J]. ANNALS OF STATISTICS, 2017, 45 (01) : 77 - 120
  • [4] An EM-Like Algorithm for Semi- and Nonparametric Estimation in Multivariate Mixtures
    Benaglia, Tatiana
    Chauveau, Didier
    Hunter, David R.
    [J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2009, 18 (02) : 505 - 526
  • [5] A stochastic EM algorithm for a semiparametric mixture model
    Bordes, Laurent
    Chauveau, Didier
    Vandekerkhove, Pierre
    [J]. COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 51 (11) : 5429 - 5443
  • [6] Semiparametric estimation of a two-component mixture model
    Bordes, Laurent
    Mottelet, Stephane
    Vandekerkhove, Pierre
    [J]. ANNALS OF STATISTICS, 2006, 34 (03) : 1204 - 1232
  • [7] Semiparametric topographical mixture models with symmetric errors
    Butucea, C.
    Tzoumpe, R. Ngueyep
    Vandekerkhove, P.
    [J]. BERNOULLI, 2017, 23 (02) : 825 - 862
  • [8] Estimation of finite mixtures with symmetric components
    Chee, Chew-Seng
    Wang, Yong
    [J]. STATISTICS AND COMPUTING, 2013, 23 (02) : 233 - 249
  • [9] Inference on the Order of a Normal Mixture
    Chen, Jiahua
    Li, Pengfei
    Fu, Yuejiao
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2012, 107 (499) : 1096 - 1105
  • [10] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM
    DEMPSTER, AP
    LAIRD, NM
    RUBIN, DB
    [J]. JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01): : 1 - 38