Large Dimensional Analysis of Robust M-Estimators of Covariance With Outliers

被引:14
作者
Morales-Jimenez, David [1 ]
Couillet, Romain [2 ]
McKay, Matthew R. [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Kowloon, Hong Kong, Peoples R China
[2] Univ Paris 11, Cent Supelec, CNRS, Lab Signaux & Syst,L2S,UMR8506, F-91192 Gif Sur Yvette, France
关键词
M-estimation; outliers; random matrix theory; robust statistics; EIGENVALUES; LOCATION;
D O I
10.1109/TSP.2015.2460225
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
A large dimensional characterization of robust M-estimators of covariance (or scatter) is provided under the assumption that the dataset comprises independent (essentially Gaussian) legitimate samples as well as arbitrary deterministic samples, referred to as outliers. Building upon recent random matrix advances in the area of robust statistics, we specifically show that the so-called Maronna M-estimator of scatter asymptotically behaves similar to well-known random matrices when the population and sample sizes grow together to infinity. The introduction of outliers leads the robust estimator to behave asymptotically as the weighted sum of the sample outer products, with a constant weight for all legitimate samples and different weights for the outliers. A fine analysis of this structure reveals importantly that the propensity of the M-estimator to attenuate (or enhance) the impact of outliers is mostly dictated by the alignment of the outliers with the inverse population covariance matrix of the legitimate samples. Thus, robust M-estimators can bring substantial benefits over more simplistic estimators such as the per-sample normalized version of the sample covariance matrix, which is not capable of differentiating the outlying samples. The analysis shows that, within the class of Maronna's estimators of scatter, Huber estimator is more favorable (in a sense to be defined) for rejecting outliers than classical alternatives such as Tyler's scale invariant estimator, often preferred in the literature. In fact, the analysis reveals that estimators similar to Tyler's run the risk of enhancing (instead of mitigating) some outliers.
引用
收藏
页码:5784 / 5797
页数:14
相关论文
共 31 条
  • [1] Bai Z., 2009, Spectral Analysis of Large Dimensional Random Matrices, VSecond Edition
  • [2] Bai ZD, 1998, ANN PROBAB, V26, P316
  • [3] PERFORMANCE ANALYSIS OF SOME EIGEN-BASED HYPOTHESIS TESTS FOR COLLABORATIVE SENSING
    Bianchi, Pascal
    Najim, Jamal
    Maida, Mylene
    Debbah, Merouane
    [J]. 2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, 2009, : 5 - +
  • [4] Regularized estimation of large covariance matrices
    Bickel, Peter J.
    Levina, Elizaveta
    [J]. ANNALS OF STATISTICS, 2008, 36 (01) : 199 - 227
  • [5] Robust Shrinkage Estimation of High-Dimensional Covariance Matrices
    Chen, Yilun
    Wiesel, Ami
    Hero, Alfred O., III
    [J]. IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2011, 59 (09) : 4097 - 4107
  • [6] On the Convergence of Maronna's M-Estimators of Scatter
    Chitour, Yacine
    Couillet, Romain
    Pascal, Frederic
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2015, 22 (06) : 709 - 712
  • [7] Couillet R., 2014, ARXIV14100817
  • [8] Robust spiked random matrices and a robust G-MUSIC estimator
    Couillet, Romain
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 140 : 139 - 161
  • [9] The random matrix regime of Maronna's M-estimator with elliptically distributed samples
    Couillet, Romain
    Pascal, Frederic
    Silverstein, Jack W.
    [J]. JOURNAL OF MULTIVARIATE ANALYSIS, 2015, 139 : 56 - 78
  • [10] Robust Estimates of Covariance Matrices in the Large Dimensional Regime
    Couillet, Romain
    Pascal, Frederic
    Silverstein, Jack W.
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (11) : 7269 - 7278