A Deterministic Algorithm for Robust Location and Scatter

被引:88
作者
Hubert, Mia [1 ]
Rousseeuw, Peter J. [1 ]
Verdonck, Tim [1 ]
机构
[1] Katholieke Univ Leuven, Dept Math, Louvain, Belgium
关键词
Affine equivariance; Covariance; Multivariate; Outliers; Robustness; PRINCIPAL COMPONENT ANALYSIS; OUTLIER DETECTION; COVARIANCE; REGRESSION; ESTIMATORS; MATRIX;
D O I
10.1080/10618600.2012.672100
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Most algorithms for highly robust estimators of multivariate location and scatter start by drawing a large number of random subsets. For instance, the FASTMCD algorithm of Rousseeuw and Van Driessen starts in this way, and then takes so-called concentration steps to obtain a more accurate approximation to the MCD. The FASTMCD algorithm is affine equivariant but not permutation invariant. In this article, we present a deterministic algorithm, denoted as DetMCD, which does not use random subsets and is even faster. It computes a small number of deterministic initial estimators, followed by concentration steps. DetMCD is permutation invariant and very close to affine equivariant. We compare it to FASTMCD and to the OGK estimator of Maronna and Zamar. We also illustrate it on real and simulated datasets, with applications involving principal component analysis, classification, and time series analysis. Supplemental material (Matlab code of the DetMCD algorithm and the datasets) is available online.
引用
收藏
页码:618 / 637
页数:20
相关论文
共 44 条
[1]   PROPAGATION OF OUTLIERS IN MULTIVARIATE DATA [J].
Alqallaf, Fatemah ;
Van Aelst, Stefan ;
Yohai, Victor J. ;
Zamar, Ruben H. .
ANNALS OF STATISTICS, 2009, 37 (01) :311-331
[2]  
Atkinson A.C., 2004, SPR S STAT
[3]   BACON: blocked adaptive computationally efficient outlier nominators [J].
Billor, N ;
Hadi, AS ;
Velleman, PF .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2000, 34 (03) :279-298
[4]   ASYMPTOTICS FOR THE MINIMUM COVARIANCE DETERMINANT ESTIMATOR [J].
BUTLER, RW ;
DAVIES, PL ;
JHUN, M .
ANNALS OF STATISTICS, 1993, 21 (03) :1385-1400
[5]   Asymptotic expansion of the minimum covariance determinant estimators [J].
Cator, Eric A. ;
Lopuhaa, Hendrik P. .
JOURNAL OF MULTIVARIATE ANALYSIS, 2010, 101 (10) :2372-2388
[6]  
COPT S, 2004, THEORY APPL RECENT R
[7]   Influence function and efficiency of the minimum covariance determinant scatter matrix estimator [J].
Croux, C ;
Haesbroeck, G .
JOURNAL OF MULTIVARIATE ANALYSIS, 1999, 71 (02) :161-190
[8]   Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies [J].
Croux, C ;
Haesbroeck, G .
BIOMETRIKA, 2000, 87 (03) :603-618
[9]  
Croux C., 2002, REV STAT APPL, V2, P5
[10]   Robust exponential smoothing of multivariate time series [J].
Croux, Christophe ;
Gelper, Sarah ;
Mahieu, Koen .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (12) :2999-3006