Outlier detection in the multiple cluster setting using the minimum covariance determinant estimator

被引:157
作者
Hardin, J
Rocke, DM
机构
[1] Pomona Coll, Dept Math, Claremont, CA 91711 USA
[2] Univ Calif Davis, Ctr Image Proc & Integrated Computing, Davis, CA 95616 USA
关键词
minimum covariance determinant; robust clustering; outlier detection;
D O I
10.1016/S0167-9473(02)00280-3
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Mahalanobis-type distances in which the shape matrix is derived from a consistent high-breakdown robust multivariate location and scale estimator can be used to find outlying points. Hardin and Rocke (http://www.cipic.ucdavis.edu/similar todmrocke/preprints.html) developed a new method for identifying outliers in a one-cluster setting using an F distribution. We extend the method to the multiple cluster case which gives a robust clustering method in conjunction with an outlier identification method. We provide results of the F distribution method for multiple clusters which have different sizes and shapes. (C) 2002 Elsevier B.V. All rights reserved.
引用
收藏
页码:625 / 638
页数:14
相关论文
共 20 条
[1]  
[Anonymous], 1987, ROBUST REGRESSION OU
[2]   FAST VERY ROBUST METHODS FOR THE DETECTION OF MULTIPLE OUTLIERS [J].
ATKINSON, AC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) :1329-1339
[3]  
BARNETT B, 1994, OUTLIERS STAT DATA
[4]   Influence function and efficiency of the minimum covariance determinant scatter matrix estimator [J].
Croux, C ;
Haesbroeck, G .
JOURNAL OF MULTIVARIATE ANALYSIS, 1999, 71 (02) :161-190
[5]  
Everitt B., 1993, CLUSTER ANAL
[6]   MCLUST: Software for model-based cluster analysis [J].
Fraley, C ;
Raftery, AE .
JOURNAL OF CLASSIFICATION, 1999, 16 (02) :297-306
[7]   ROBUST ESTIMATES, RESIDUALS, AND OUTLIER DETECTION WITH MULTIRESPONSE DATA [J].
GNANADESIKAN, R ;
KETTENRING, JR .
BIOMETRICS, 1972, 28 (01) :81-+
[8]  
HADI AS, 1992, J ROY STAT SOC B MET, V54, P761
[9]  
HARDIN J, 2002, DISTRIBUTION ROBUST
[10]  
Hawkins D.M, 1980, IDENTIFICATION OUTLI, V11, DOI [10.1007/978-94-015-3994-4, DOI 10.1007/978-94-015-3994-4]