Phantom oscillations in principal component analysis

被引:11
|
作者
Shinn, Maxwell [1 ]
机构
[1] UCL, Univ Coll London UCL, Queen Sq Inst Neurol, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
PCA; oscillations; dimensionality reduction; data analysis; POPULATION; DYNAMICS;
D O I
10.1073/pnas.2311420120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Principal component analysis (PCA) is a dimensionality reduction method that is known for being simple and easy to interpret. Principal components are often interpreted as low-dimensional patterns in high-dimensional space. However, this simple interpretation fails for timeseries, spatial maps, and other continuous data. In these cases, nonoscillatory data may have oscillatory principal components. Here, we show that two common properties of data cause oscillatory principal components: smoothness and shifts in time or space. These two properties implicate almost all neuroscience data. We show how the oscillations produced by PCA, which we call "phantom oscillations," impact data analysis. We also show that traditional cross validation does not detect phantom oscillations, so we suggest procedures that do. Our findings are supported by a collection of mathematical proofs. Collectively, our work demonstrates that patterns which emerge from high-dimensional data analysis may not the data.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Sparse Generalised Principal Component Analysis
    Smallman, Luke
    Artemiou, Andreas
    Morgan, Jennifer
    PATTERN RECOGNITION, 2018, 83 : 443 - 455
  • [32] Principal component analysis in the wavelet domain
    Lim, Yaeji
    Kwon, Junhyeon
    Oh, Hee-Seok
    PATTERN RECOGNITION, 2021, 119
  • [33] Double robust principal component analysis
    Wang, Qianqian
    Gao, QuanXue
    Sun, Gan
    Ding, Chris
    NEUROCOMPUTING, 2020, 391 : 119 - 128
  • [34] Bayesian compressive principal component analysis
    Ma, Di
    Chen, Songcan
    FRONTIERS OF COMPUTER SCIENCE, 2020, 14 (04)
  • [35] Principal Component Analysis for Fingerprint Positioning
    Zhang, Yang
    Ren, Qianqian
    Li, Jinbao
    Pan, Yu
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I, 2020, 12452 : 305 - 313
  • [36] Principal component analysis on interval data
    Gioia, Federica
    Lauro, Carlo N.
    COMPUTATIONAL STATISTICS, 2006, 21 (02) : 343 - 363
  • [37] Online multilinear principal component analysis
    Han, Le
    Wu, Zhen
    Zeng, Kui
    Yang, Xiaowei
    NEUROCOMPUTING, 2018, 275 : 888 - 896
  • [38] Automatic sparse principal component analysis
    Park, Heewon
    Yamaguchi, Rui
    Imoto, Seiya
    Miyano, Satoru
    CANADIAN JOURNAL OF STATISTICS-REVUE CANADIENNE DE STATISTIQUE, 2021, 49 (03): : 678 - 697
  • [39] Bilinear Probabilistic Principal Component Analysis
    Zhao, Jianhua
    Yu, Philip L. H.
    Kwok, James T.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2012, 23 (03) : 492 - 503
  • [40] Adaptive robust principal component analysis
    Liu, Yang
    Gao, Xinbo
    Gao, Quanxue
    Shao, Ling
    Han, Jungong
    NEURAL NETWORKS, 2019, 119 : 85 - 92