SPARSE PRINCIPAL COMPONENT ANALYSIS AND ITERATIVE THRESHOLDING

被引:192
作者
Ma, Zongming [1 ]
机构
[1] Univ Penn, Wharton Sch, Dept Stat, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
Dimension reduction; high-dimensional statistics; principal component analysis; principal subspace; sparsity; spiked covariance model; thresholding; CONSISTENCY; ASYMPTOTICS;
D O I
10.1214/13-AOS1097
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Principal component analysis (PCA) is a classical dimension reduction method which projects data onto the principal subspace spanned by the leading eigenvectors of the covariance matrix. However, it behaves poorly when the number of features p is comparable to, or even much larger than, the sample size n. In this paper, we propose a new iterative thresholding approach for estimating principal subspaces in the setting where the leading eigenvectors are sparse. Under a spiked covariance model, we find that the new approach recovers the principal subspace and leading eigenvectors consistently, and even optimally, in a range of high-dimensional sparse settings. Simulated examples also demonstrate its competitive performance.
引用
收藏
页码:772 / 801
页数:30
相关论文
共 50 条
[21]   Exactly Uncorrelated Sparse Principal Component Analysis [J].
Kwon, Oh-Ran ;
Lu, Zhaosong ;
Zou, Hui .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (01) :231-241
[22]   An exact approach to sparse principal component analysis [J].
Farcomeni, Alessio .
COMPUTATIONAL STATISTICS, 2009, 24 (04) :583-604
[23]   SPARSE PRINCIPAL COMPONENT ANALYSIS WITH MISSING OBSERVATIONS [J].
Park, Seyoung ;
Zhao, Hongyu .
ANNALS OF APPLIED STATISTICS, 2019, 13 (02) :1016-1042
[24]   A New Basis for Sparse Principal Component Analysis [J].
Chen, Fan ;
Rohe, Karl .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (02) :421-434
[25]   An exact approach to sparse principal component analysis [J].
Alessio Farcomeni .
Computational Statistics, 2009, 24 :583-604
[26]   Certifiably optimal sparse principal component analysis [J].
Berk, Lauren ;
Bertsimasi, Dimitris .
MATHEMATICAL PROGRAMMING COMPUTATION, 2019, 11 (03) :381-420
[27]   Optimized Adaptive Iterative Sparse Principal Component Analysis Methodology for Fault Detection and Identification in Control Valves [J].
Zhang, Jiaxin ;
Samavedham, Lakshminarayanan ;
Rangaiah, Gade Pandu ;
Dong, Lichun .
2023 62ND ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS, SICE, 2023, :1475-1480
[28]   Iterative kernel principal component analysis for image modeling [J].
Kim, KI ;
Franz, MO ;
Schölkopf, B .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2005, 27 (09) :1351-1366
[29]   Eigenvectors from Eigenvalues Sparse Principal Component Analysis [J].
Frost, H. Robert .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2022, 31 (02) :486-501
[30]   Collaborative filtering based on iterative principal component analysis [J].
Kim, D ;
Yum, BJ .
EXPERT SYSTEMS WITH APPLICATIONS, 2005, 28 (04) :823-830