Unveiling outliers with robust covariance matrix estimation: a shrinkage approach

被引:0
作者
Vijayalakshmi, S. [1 ,2 ]
Sebastian, Nicy [1 ,2 ]
Sajesh, T. A. [1 ,2 ]
机构
[1] St Thomas Coll Autonomous, Dept Stat, Trichur, Kerala, India
[2] Univ Calicut, Trichur, Kerala, India
关键词
Robust estimation; Outliers; Shrinkage method; Gnanadesikan-Kettenring estimator; MULTIVARIATE LOCATION; DISPERSION;
D O I
10.1007/s00362-025-01694-x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Outlier detection is a fundamental task in statistics, crucial for identifying anomalous observations that deviate significantly from the majority of the data. This article proposes a robust shrinkage covariance matrix based on the Gnanadesikan-Kettenring estimator and develops the shrinkage Gnanadesikan-Kettenring (SGK) method, a robust outlier detection technique for high dimensional datasets. The effectiveness of the SGK method for outlier detection is comprehensively evaluated using diverse metrics and scenarios through Monte Carlo simulation. The empirical study employs success rate and false detection rate to compare its performance with existing methods across various data characteristics, including correlated and uncorrelated data generated from multivariate normal, multivariate t, and multivariate exponential distributions. Results indicate that the SGK method consistently achieves a high success rate in identifying contaminated observations with a significantly lower false detection rate compared to other methods. Despite being non-affine equivariant, the SGK method maintains similar performance under affinely transformed data and exhibits exceptional resistance to contamination across datasets of varying sizes and dimensions. Real-world applicability is confirmed through performance evaluations on benchmark datasets, where the SGK method successfully identifies known outliers.
引用
收藏
页数:23
相关论文
共 34 条
[1]   FAST VERY ROBUST METHODS FOR THE DETECTION OF MULTIPLE OUTLIERS [J].
ATKINSON, AC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1994, 89 (428) :1329-1339
[2]  
Barnett V., 1994, Outliers in statistical data, V3
[3]  
Becker C, 1999, J AM STAT ASSOC, V94, P947
[4]  
Brownlee K.A., 1965, A Wiley publication in applied statistics
[5]   Multivariate outlier detection based on a robust Mahalanobis distance with shrinkage estimators [J].
Cabana, Elisa ;
Lillo, Rosa E. ;
Laniado, Henry .
STATISTICAL PAPERS, 2021, 62 (04) :1583-1609
[6]  
Campbell N. A., 1980, Applied Statistics, V29, P231, DOI 10.2307/2346896
[7]  
Campbell N.A., 1989, BUSHFIRE MAPPING USI
[8]  
Daudin J.-J., 1988, J THEORETICAL APPL S, V19, P241, DOI DOI 10.1080/02331888808802095
[9]   Size matters: Optimal calibration of shrinkage estimators for portfolio selection [J].
DeMiguel, Victor ;
Martin-Utrera, Alberto ;
Nogales, Francisco J. .
JOURNAL OF BANKING & FINANCE, 2013, 37 (08) :3018-3034
[10]   ROBUST ESTIMATION OF DISPERSION MATRICES AND PRINCIPAL COMPONENTS [J].
DEVLIN, SJ ;
GNANADESIKAN, R ;
KETTENRING, JR .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1981, 76 (374) :354-362