A comparison of robust principal component analysis in high dimensions

被引:0
作者
Liu, Yun [1 ]
Shu, Lianjie [1 ]
Li, Yanting [2 ]
Tian, Guoliang [3 ]
机构
[1] Univ Macau, Fac Business Adm, Macau, Peoples R China
[2] Shanghai Jiao Tong Univ, Ind Engn & Management, Shanghai, Peoples R China
[3] Southern Univ Sci & Technol, Dept Stat & Data Sci, Shenzhen, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Cellwise contamination model; High-dimensional data; Principal component analysis; Robust methods; PCA; COVARIANCE; PROJECTION; MATRICES; CELLWISE;
D O I
10.1080/03610918.2025.2461583
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
It is rare for high-dimensional data to be clean in reality. There is a great need for robust principal component analysis (PCA) in high dimensional data analysis. However, most of the existing robust PCA methods are constructed based on the rowwise assumption. This assumption is too restrictive as it assumes that all the components are completely contaminated in an observation vector. It is more flexible and realistic to make the cellwise assumption in practice that only some components rather than all the components are contaminated in the observation vector. The objective of this paper is to extent the robust PCA method to the case with cellwise contaminated data and to compare with the existing rowwise methods. The proposed method can make use of information in clean elements of contaminated data, compared to the latter. The simulation results favor the proposed method.
引用
收藏
页数:17
相关论文
共 40 条
[1]  
Albert Jim, 2018, CRAN, DOI 10.32614/CRAN.package.LearnBayes
[2]   PROPAGATION OF OUTLIERS IN MULTIVARIATE DATA [J].
Alqallaf, Fatemah ;
Van Aelst, Stefan ;
Yohai, Victor J. ;
Zamar, Ruben H. .
ANNALS OF STATISTICS, 2009, 37 (01) :311-331
[3]   The Gaussian rank correlation estimator: robustness properties [J].
Boudt, Kris ;
Cornelissen, Jonathan ;
Croux, Christophe .
STATISTICS AND COMPUTING, 2012, 22 (02) :471-483
[4]   Fast computation of robust subspace estimators [J].
Cevallos-Valdiviezo, Holger ;
Van Aelst, Stefan .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2019, 134 :171-185
[5]   Spatially weighted PCA for monitoring video image data with application to additive manufacturing [J].
Colosimo, Bianca M. ;
Grasso, Marco .
JOURNAL OF QUALITY TECHNOLOGY, 2018, 50 (04) :391-417
[6]  
Croux C., 1996, COMPSTAT. Proceedings in Computational Statistics. 12th Symposium, P211
[7]   Algorithms for Projection - Pursuit robust principal component analysis [J].
Croux, C. ;
Filzmoser, P. ;
Oliveira, M. R. .
CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2007, 87 (02) :218-225
[8]   Principal component analysis based on robust estimators of the covariance or correlation matrix: Influence functions and efficiencies [J].
Croux, C ;
Haesbroeck, G .
BIOMETRIKA, 2000, 87 (03) :603-618
[10]   Overview of PCA-Based Statistical Process-Monitoring Methods for Time-Dependent, High-Dimensional Data [J].
de Ketelaere, Bart ;
Hubert, Mia ;
Schmitt, Eric .
JOURNAL OF QUALITY TECHNOLOGY, 2015, 47 (04) :318-335