Finite mixture modeling of censored data using the multivariate Student-t distribution

被引:24
作者
Lachos, Victor H. [1 ]
Lopez Moreno, Edgar J. [1 ]
Chen, Kun [2 ]
Barbosa Cabral, Celso Romulo [3 ]
机构
[1] Univ Estadual Campinas, Dept Estat, Campinas, SP, Brazil
[2] Univ Connecticut, Dept Stat, Mansfield, CT USA
[3] Univ Fed Amazonas, Dept Estat, Manaus, Amazonas, Brazil
基金
巴西圣保罗研究基金会; 美国国家卫生研究院; 美国国家科学基金会;
关键词
Censored data; Detection limit; EM-type algorithms; Finite mixture models; Multivariate Student-t; MIXED-EFFECTS MODELS; INFLUENCE DIAGNOSTICS; MAXIMUM-LIKELIHOOD; INCOMPLETE DATA; INFERENCE;
D O I
10.1016/j.jmva.2017.05.005
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Finite mixture models have been widely used for the modeling and analysis of data from a heterogeneous population. Moreover, data of this kind can be subject to some upper and/or lower detection limits because of the restriction of experimental apparatus. Another complication arises when measures of each population depart significantly from normality, for instance, in the presence of heavy tails or atypical observations. For such data structures, we propose a robust model for censored data based on finite mixtures of multivariate Student-t distributions. This approach allows us to model data with great flexibility, accommodating multimodality, heavy tails and also skewness depending on the structure of the mixture components. We develop an analytically simple, yet efficient, EM-type algorithm for conducting maximum likelihood estimation of the parameters. The algorithm has closed-form expressions at the E-step that rely on formulas for the mean and variance of the multivariate truncated Student-t distributions. Further, a general information-based method for approximating the asymptotic covariance matrix of the estimators is also presented. Results obtained from the analysis of both simulated and real datasets are reported to demonstrate the effectiveness of the proposed methodology. The proposed algorithm and methods are implemented in the new R package CensMixReg. (C) 2017 Elsevier Inc. All rights reserved.
引用
收藏
页码:151 / 167
页数:17
相关论文
共 44 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
[Anonymous], 2008, EM ALGORITHM EXTENSI
[3]  
[Anonymous], 1986, Statist. Sci.
[4]   Student-t censored regression model: properties and inference [J].
Arellano-Valle, Reinaldo B. ;
Castro, Luis M. ;
Gonzalez-Farias, Graciela ;
Munoz-Gajardo, Karla A. .
STATISTICAL METHODS AND APPLICATIONS, 2012, 21 (04) :453-473
[5]   ON SOME CHARACTERIZATIONS OF THE T-DISTRIBUTION [J].
ARELLANOVALLE, RB ;
BOLFARINE, H .
STATISTICS & PROBABILITY LETTERS, 1995, 25 (01) :79-85
[6]   Robust likelihood methods based on the skew-t and related distributions [J].
Azzalini, Adelchi ;
Genton, Marc G. .
INTERNATIONAL STATISTICAL REVIEW, 2008, 76 (01) :106-129
[7]   Mixture of linear mixed models using multivariate t distribution [J].
Bai, Xiuqin ;
Chen, Kun ;
Yao, Weixin .
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2016, 86 (04) :771-787
[8]   ON RATES OF CONVERGENCE OF EFFICIENT DETECTION CRITERIA IN SIGNAL-PROCESSING WITH WHITE NOISE [J].
BAI, ZD ;
KRISHNAIAH, PR ;
ZHAO, LC .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1989, 35 (02) :380-388
[9]   Multivariate mixture modeling using skew-normal independent distributions [J].
Barbosa Cabral, Celso Romulo ;
Lachos, Victor Hugo ;
Prates, Marcos O. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2012, 56 (01) :126-142
[10]  
Basford KE, 1997, COMPUTATION STAT, V12, P1