Phantom oscillations in principal component analysis

被引:11
|
作者
Shinn, Maxwell [1 ]
机构
[1] UCL, Univ Coll London UCL, Queen Sq Inst Neurol, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
PCA; oscillations; dimensionality reduction; data analysis; POPULATION; DYNAMICS;
D O I
10.1073/pnas.2311420120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Principal component analysis (PCA) is a dimensionality reduction method that is known for being simple and easy to interpret. Principal components are often interpreted as low-dimensional patterns in high-dimensional space. However, this simple interpretation fails for timeseries, spatial maps, and other continuous data. In these cases, nonoscillatory data may have oscillatory principal components. Here, we show that two common properties of data cause oscillatory principal components: smoothness and shifts in time or space. These two properties implicate almost all neuroscience data. We show how the oscillations produced by PCA, which we call "phantom oscillations," impact data analysis. We also show that traditional cross validation does not detect phantom oscillations, so we suggest procedures that do. Our findings are supported by a collection of mathematical proofs. Collectively, our work demonstrates that patterns which emerge from high-dimensional data analysis may not the data.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] Local and global principal component analysis for process monitoring
    Yu, Jianbo
    JOURNAL OF PROCESS CONTROL, 2012, 22 (07) : 1358 - 1373
  • [22] A preliminary geometric structure simplification for Principal Component Analysis
    Gu, Huamao
    Lin, Tong
    Wang, Xun
    NEUROCOMPUTING, 2019, 336 : 46 - 55
  • [23] Principal component analysis: the basic building block of chemometrics
    Cordella, Christophe
    ACTUALITE CHIMIQUE, 2010, (345): : 13 - 18
  • [24] Backwards Principal Component Analysis and Principal Nested Relations
    Damon, James
    Marron, J. S.
    JOURNAL OF MATHEMATICAL IMAGING AND VISION, 2014, 50 (1-2) : 107 - 114
  • [25] Schrodinger principal-component analysis: On the duality between principal-component analysis and the Schrodinger equation
    Liu, Ziming
    Qian, Sitian
    Wang, Yixuan
    Yan, Yuxuan
    Yang, Tianyi
    PHYSICAL REVIEW E, 2021, 104 (02)
  • [26] Spectral simulation study on the influence of the principal component analysis step on principal component regression
    Hasegawa, T
    APPLIED SPECTROSCOPY, 2006, 60 (01) : 95 - 98
  • [27] Multilinear Sparse Principal Component Analysis
    Lai, Zhihui
    Xu, Yong
    Chen, Qingcai
    Yang, Jian
    Zhang, David
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (10) : 1942 - 1950
  • [28] Principal component analysis for interval data
    Billard, L.
    Le-Rademacher, J.
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2012, 4 (06): : 535 - 540
  • [29] Efficient fair principal component analysis
    Mohammad Mahdi Kamani
    Farzin Haddadpour
    Rana Forsati
    Mehrdad Mahdavi
    Machine Learning, 2022, 111 : 3671 - 3702
  • [30] Streaming Sparse Principal Component Analysis
    Yang, Wenzhuo
    Xu, Huan
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 494 - 503