High Dimensional Principal Component Analysis with Contaminated Data

被引:0
|
作者
Xu, Huan [1 ]
Caramanis, Constantine [2 ]
Mannor, Shie [3 ]
机构
[1] McGill Univ, Dept Elect & Comp Engn, Montreal, PQ H3A 2T5, Canada
[2] Univ Texas Austin, Dept Elect & Comp Engn, Austin, TX USA
[3] McGill Univ, Dept Elect & Comp Engn, Dept Elect Engn, Montreal, PQ, Canada
关键词
LARGEST EIGENVALUE; ALGORITHMS;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the dimensionality-reduction problem (finding a subspace approximation of observed data) for contaminated data in the high dimensional regime, where the the number of observations is of the same magnitude as the number of variables of each observation, and the data set contains some (arbitrarily) corrupted observations. We propose a High-dimensional Robust Principal Component Analysis (HRPCA) algorithm that is tractable, robust to contaminated points, and easily kernelizable. The resulting subspace has a bounded deviation from the desired one, and unlike ordinary PCA algorithms, achieves optimality in the limit case where the proportion of corrupted points goes to zero. In this extended abstract we provide the setup, our algorithm, and a statement of the main theorems, and defer all the details and proofs to the full paper, [1].
引用
收藏
页码:246 / +
页数:2
相关论文
共 50 条
  • [1] Principal component analysis for sparse high-dimensional data
    Raiko, Tapani
    Ilin, Alexander
    Karhunen, Juha
    NEURAL INFORMATION PROCESSING, PART I, 2008, 4984 : 566 - 575
  • [2] Multilevel Functional Principal Component Analysis for High-Dimensional Data
    Zipunnikov, Vadim
    Caffo, Brian
    Yousem, David M.
    Davatzikos, Christos
    Schwartz, Brian S.
    Crainiceanu, Ciprian
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2011, 20 (04) : 852 - 873
  • [3] On principal component analysis for high-dimensional XCSR
    Behdad, Mohammad
    French, Tim
    Barone, Luigi
    Bennamoun, Mohammed
    EVOLUTIONARY INTELLIGENCE, 2012, 5 (02) : 129 - 138
  • [4] High Dimensional Model Representation With Principal Component Analysis
    Hajikolaei, Kambiz Haji
    Wang, G. Gary
    JOURNAL OF MECHANICAL DESIGN, 2014, 136 (01)
  • [5] The effect of principal component analysis on machine learning accuracy with high dimensional spectral data
    Howley, T
    Madden, MG
    O'Connell, ML
    Ryder, AG
    APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XIII, 2006, : 209 - +
  • [6] Cauchy robust principal component analysis with applications to high-dimensional data sets
    Fayomi, Aisha
    Pantazis, Yannis
    Tsagris, Michail
    Wood, Andrew T. A.
    STATISTICS AND COMPUTING, 2024, 34 (01)
  • [7] Exploring high-dimensional biological data with sparse contrastive principal component analysis
    Boileau, Philippe
    Hejazi, Nima S.
    Dudoit, Sandrine
    BIOINFORMATICS, 2020, 36 (11) : 3422 - 3430
  • [8] Adaptive local Principal Component Analysis improves the clustering of high-dimensional data
    Migenda, Nico
    Moeller, Ralf
    Schenck, Wolfram
    PATTERN RECOGNITION, 2024, 146
  • [9] Cauchy robust principal component analysis with applications to high-dimensional data sets
    Aisha Fayomi
    Yannis Pantazis
    Michail Tsagris
    Andrew T. A. Wood
    Statistics and Computing, 2024, 34
  • [10] A note on principal component analysis for multi-dimensional data
    Sun, JG
    STATISTICS & PROBABILITY LETTERS, 2000, 46 (01) : 69 - 73