Robust principal component analysis via ES-algorithm

被引:0
作者
Yaeji Lim
Yeonjoo Park
Hee-Seok Oh
机构
[1] Seoul National University,Department of Statistics
[2] University of Illinois at Urbana-Champaign,Department of Statistics
来源
Journal of the Korean Statistical Society | 2014年 / 43卷
关键词
primary 62H25; secondary 62F35; ES-algorithm; Principal component analysis; Pseudo data; Robustness;
D O I
暂无
中图分类号
学科分类号
摘要
In this paper, a new method for robust principal component analysis (PCA) is proposed. PCA is a widely used tool for dimension reduction without substantial loss of information. However, the classical PCA is vulnerable to outliers due to its dependence on the empirical covariance matrix. To avoid such weakness, several alternative approaches based on robust scatter matrix were suggested. A popular choice is ROBPCA that combines projection pursuit ideas with robust covariance estimation via variance maximization criterion. Our approach is based on the fact that PCA can be formulated as a regression-type optimization problem, which is the main difference from the previous approaches. The proposed robust PCA is derived by substituting square loss function with a robust penalty function, Huber loss function. A practical algorithm is proposed in order to implement an optimization computation, and furthermore, convergence properties of the algorithm are investigated. Results from a simulation study and a real data example demonstrate the promising empirical properties of the proposed method.
引用
收藏
页码:149 / 159
页数:10
相关论文
共 36 条
[1]  
Campbell N A(1980)Procedures in multivariate analysis I: robust covariance estimation Applied Statistics 29 231-237
[2]  
Croux C(2007)Algorithms for projection-pursuit robust principal component analysis Chemometrics and Intelligent Laboratory Systems 87 218-225
[3]  
Filzmoser P(2000)Principal component analysis based on robust estimators of the covariance or correlation matrix: influence functions and efficiencies Biometrika 87 603-618
[4]  
Oliveira M(2005)High breakdown estimators for principle components: the projection-pursuit approach revisited Journal of Multivariate Analysis 95 206-226
[5]  
Croux C(1981)Robust estimation of dispersion matrices and principal components Journal of the American Statistical Association 76 354-362
[6]  
Haesbroeck G(2005)A comparison of three procedures for robust PCA in high dimensions Austrian Journal of Statistics 34 117-126
[7]  
Croux C(1973)Robust regression: asymptotics, conjectures and Monte Carlo Annals of Statistics 1 799-821
[8]  
Ruiz-Gazen A(2008)High-breakdown robust multivariate methods Statistical Science 23 92-119
[9]  
Devlin S J(2005)ROBPCA: a new approach to robust principal components analysis Technometrics 47 64-79
[10]  
Gnanadesikan R(1979)Between-groups comparison of principal components Journal of the American Statistical Association 74 703-707