Finding an unknown number of multivariate outliers

被引:112
作者
Riani, Marco [2 ]
Atkinson, Anthony C. [1 ]
Cerioli, Andrea [2 ]
机构
[1] Univ London London Sch Econ & Polit Sci, Dept Stat, London WC2A 2AE, England
[2] Univ Parma, I-43100 Parma, Italy
关键词
Forward search; Graphics; Logistic plots; Mahalanobis distance; Minimum covariance determinant; Order statistics; Power comparisons; Simultaneous inference; Truncated distributions; Very robust methods; MULTIPLE OUTLIERS; COVARIANCE; IDENTIFICATION; ASYMPTOTICS; MATRIX; ESTIMATORS; ALGORITHM; POINTS;
D O I
10.1111/j.1467-9868.2008.00692.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We use the forward search to provide robust Mahalanobis distances to detect the presence of outliers in a sample of multivariate normal data. Theoretical results on order statistics and on estimation in truncated samples provide the distribution of our test statistic. We also introduce several new robust distances with associated distributional results. Comparisons of our procedure with tests using other robust Mahalanobis distances show the good size and high power of our procedure. We also provide a unification of results on correction factors for estimation from truncated samples.
引用
收藏
页码:447 / 466
页数:20
相关论文
共 33 条
[1]  
[Anonymous], 1993, Continuous Univariate Distributions, DOI DOI 10.1016/0167-9473(96)90015-8
[2]   Exploratory tools for clustering multivariate data [J].
Atkinson, A. C. ;
Riani, M. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) :272-285
[3]  
Atkinson A.C., 2004, SPR S STAT
[4]   Distribution tbeory and simulations for tests of outliers in regression [J].
Atkinson, Anthony C. ;
Riani, Marco .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2006, 15 (02) :460-476
[5]  
Becker C, 1999, J AM STAT ASSOC, V94, P947
[6]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[7]   ASYMPTOTICS FOR THE MINIMUM COVARIANCE DETERMINANT ESTIMATOR [J].
BUTLER, RW ;
DAVIES, PL ;
JHUN, M .
ANNALS OF STATISTICS, 1993, 21 (03) :1385-1400
[8]  
Casella G., 2002, STAT INFERENCE
[9]   An adaptive trimmed likelihood algorithm for identification of multivariate outliers [J].
Clarke, Brenton R. ;
Schubert, Daniel D. .
AUSTRALIAN & NEW ZEALAND JOURNAL OF STATISTICS, 2006, 48 (03) :353-371
[10]  
COOK RD, 1990, J AM STAT ASSOC, V85, P640, DOI 10.2307/2289996