Phantom oscillations in principal component analysis

被引:11
|
作者
Shinn, Maxwell [1 ]
机构
[1] UCL, Univ Coll London UCL, Queen Sq Inst Neurol, London WC1E 6BT, England
基金
英国生物技术与生命科学研究理事会; 英国医学研究理事会;
关键词
PCA; oscillations; dimensionality reduction; data analysis; POPULATION; DYNAMICS;
D O I
10.1073/pnas.2311420120
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Principal component analysis (PCA) is a dimensionality reduction method that is known for being simple and easy to interpret. Principal components are often interpreted as low-dimensional patterns in high-dimensional space. However, this simple interpretation fails for timeseries, spatial maps, and other continuous data. In these cases, nonoscillatory data may have oscillatory principal components. Here, we show that two common properties of data cause oscillatory principal components: smoothness and shifts in time or space. These two properties implicate almost all neuroscience data. We show how the oscillations produced by PCA, which we call "phantom oscillations," impact data analysis. We also show that traditional cross validation does not detect phantom oscillations, so we suggest procedures that do. Our findings are supported by a collection of mathematical proofs. Collectively, our work demonstrates that patterns which emerge from high-dimensional data analysis may not the data.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] Weighted Principal Component Analysis
    Fan, Zizhu
    Liu, Ergen
    Xu, Baogen
    ARTIFICIAL INTELLIGENCE AND COMPUTATIONAL INTELLIGENCE, PT III, 2011, 7004 : 569 - 574
  • [2] Ensemble Principal Component Analysis
    Dorabiala, Olga
    Aravkin, Aleksandr Y.
    Kutz, J. Nathan
    IEEE ACCESS, 2024, 12 : 6663 - 6671
  • [4] Sea surface temperature patterns in the Tropical Atlantic: Principal component analysis and nonlinear principal component analysis
    Kenfack, Christian Sadem
    Mkankam, Francois Kamga
    Alory, Gael
    du Penhoat, Yves
    Hounkonnou, Mahouton Norbert
    Vondou, Derbetini Appolinaire
    Nfor, Bawe Gerard, Jr.
    TERRESTRIAL ATMOSPHERIC AND OCEANIC SCIENCES, 2017, 28 (03): : 395 - 410
  • [5] Data Analysis Using Principal Component Analysis
    Sehgal, Shrub
    Singh, Harpreet
    Agarwal, Mohit
    Bhasker, V.
    Shantanu
    2014 INTERNATIONAL CONFERENCE ON MEDICAL IMAGING, M-HEALTH & EMERGING COMMUNICATION SYSTEMS (MEDCOM), 2015, : 45 - 48
  • [6] Real Time Principal Component Analysis
    Chowdhury, Ranak Roy
    Adnan, Muhammad Abdullah
    Gupta, Rajesh K.
    2019 IEEE 35TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2019), 2019, : 1678 - 1681
  • [7] Global Modular Principal Component Analysis
    Kadappa, Vijayakumar
    Negi, Atul
    SIGNAL PROCESSING, 2014, 105 : 381 - 388
  • [8] Principal component analysis
    Bro, Rasmus
    Smilde, Age K.
    ANALYTICAL METHODS, 2014, 6 (09) : 2812 - 2831
  • [9] Segmented principal component transform-principal component analysis
    Barros, AS
    Rutledge, DN
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2005, 78 (1-2) : 125 - 137
  • [10] Principal component analysis based on graph embedding
    Ju, Fujiao
    Sun, Yanfeng
    Li, Jianqiang
    Zhang, Yaxiao
    Piao, Xinglin
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (05) : 7105 - 7116