Multivariate Contaminated Normal Censored Regression Model: Properties and Maximum Likelihood Inference

被引:4
作者
Wang, Wan-Lun [1 ,2 ,3 ,4 ]
机构
[1] Natl Cheng Kung Univ, Dept Stat, Tainan, Taiwan
[2] Natl Cheng Kung Univ, Inst Data Sci, Tainan, Taiwan
[3] Natl Cheng Kung Univ, Dept Stat, Tainan 701, Taiwan
[4] Natl Cheng Kung Univ, Inst Data Sci, Tainan 701, Taiwan
关键词
Censored data; ECM algorithm; Expected information matrix; Mild outliers; Truncated multivariate contaminated normal distribution; BETA-CAROTENE; MIXTURES; OUTLIERS;
D O I
10.1080/10618600.2023.2184375
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The Multivariate Contaminated Normal (MCN) distribution which contains two extra parameters with respect to parameters of the multivariate normal distribution, one for controlling the proportion of mild outliers and the other for specifying the degree of contamination, has been widely applied in robust statistics in the case of elliptically heavy-tailed empirical distributions. This article extends the MCN model to data with possibly censored values due to limits of quantification, referred to as the MCN with censoring (MCN-C) model, and further establishes the censored multivariate linear regression model where the random errors have the MCN distribution, named as the MCN censored regression (MCN-CR) model. Two computationally feasible Expectation Conditional Maximization (ECM) algorithms are developed for maximum likelihood estimation of MCN-C and MCN-CR models. An information-based method is used to approximate the standard errors of location parameters and regression coefficients. The capability and effectiveness of the MCN-C and MCN-CR models are illustrated via two real-data examples. A simulation study is conducted to investigate the superiority of the proposed models in terms of fit, accuracy of parameter estimation and censored data recovery as compared with classical approaches. for this article are available online.
引用
收藏
页码:1671 / 1684
页数:14
相关论文
共 42 条
[1]   MIXTURE-MODELS, OUTLIERS, AND THE EM ALGORITHM [J].
AITKIN, M ;
WILSON, GT .
TECHNOMETRICS, 1980, 22 (03) :325-331
[2]  
Akaike H., 1998, International Symposium on Information Theory, Budapest, Proceedings, P199, DOI DOI 10.1007/978-1-4612-1694-015
[3]  
[Anonymous], 2015, Robust cluster analysis and variable selection
[4]  
[Anonymous], 2003, VDEQ TECHN B
[5]   The multivariate leptokurtic-normal distribution and its application in model-based clustering [J].
Bagnato, Luca ;
Punzo, Antonio ;
Zoia, Maria G. .
CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2017, 45 (01) :95-119
[6]   Conversion of β-carotene to retinal pigment [J].
Biesalski, Hans K. ;
Chichili, Gurunadh R. ;
Frank, Juergen ;
Von Lintig, Johannes ;
Nohr, Donatus .
VITAMIN A, 2007, 75 :117-+
[8]  
COHEN AC, 1959, B INT STATIST INST, V1, P217
[9]   Linear censored regression models with scale mixtures of normal distributions [J].
Garay, Aldo M. ;
Lachos, Victor H. ;
Bolfarine, Heleno ;
Cabral, Celso R. B. .
STATISTICAL PAPERS, 2017, 58 (01) :247-278
[10]   Nonlinear censored regression models with heavy-tailed distributions [J].
Garay, Aldo M. ;
Lachos, Victor H. ;
Lin, Tsung-I .
STATISTICS AND ITS INTERFACE, 2016, 9 (03) :281-293