High-breakdown estimation of multivariate mean and covariance with missing observations

被引:21
作者
Cheng, TC
Victoria-Feser, MP
机构
[1] Univ Geneva, Fac Psychol & Educ, CH-1211 Geneva 4, Switzerland
[2] Natl Chengchi Univ, Dept Stat, Taipei, Taiwan
关键词
D O I
10.1348/000711002760554615
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
We consider the problem of outliers in incomplete multivariate data when the aim is to estimate a measure of mean and covariance, as is the case, for example, in factor analysis. The ER algorithm of Little and Smith which combines the EM algorithm for missing data and a robust estimation step based on an M-estimator could be used in such a situation. However, the ER algorithm as originally proposed can fail to be robust in some cases, especially in high dimensions. We propose here two alternatives to avoid the problem. One is to combine a small modification of the ER algorithm with a so-called high-breakdown estimator as the starting point for the iterative procedure, and the other is to base the estimation step of the ER algorithm on a high-breakdown estimator. Among the high-breakdown estimators which are actually built to keep their robustness properties even if the number of variables is relatively large, we consider here the minimum covariance determinant estimator and the t-biweight S-estimator. Simulated and real data are used to compare and illustrate the different procedures.
引用
收藏
页码:317 / 335
页数:19
相关论文
共 42 条
[1]  
[Anonymous], [No title captured], DOI DOI 10.2307/2347491
[2]  
[Anonymous], 1997, Methodology of frontal and executive functions
[3]   FAST VERY ROBUST METHODS FOR THE DETECTION OF MULTIPLE OUTLIERS [J].
ATKINSON, AC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) :1329-1339
[4]  
ATKINSON AC, 2000, 22 BIENN C SOC MULT
[5]  
BEALE EML, 1975, J ROY STAT SOC B MET, V37, P129
[6]   ASYMPTOTICS FOR THE MINIMUM COVARIANCE DETERMINANT ESTIMATOR [J].
BUTLER, RW ;
DAVIES, PL ;
JHUN, M .
ANNALS OF STATISTICS, 1993, 21 (03) :1385-1400
[7]  
Campbell N. A., 1980, Applied Statistics, V29, P231, DOI 10.2307/2346896
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   ROBUST ESTIMATION AND OUTLIER DETECTION WITH CORRELATION-COEFFICIENTS [J].
DEVLIN, SJ ;
GNANADESIKAN, R ;
KETTENRING, JR .
BIOMETRIKA, 1975, 62 (03) :531-545