Comparative Analysis of Principal Components Can be Misleading

被引:199
|
作者
Uyeda, Josef C. [1 ]
Caetano, Daniel S. [1 ]
Pennell, Matthew W. [1 ]
机构
[1] Univ Idaho, Inst Bioinformat & Evolutionary Studies, Dept Biol Sci, Moscow, ID 83844 USA
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
Brownian motion; early burst; multivariate evolution; Ornstein-Uhlenbeck; phylogenetic comparative methods; principal components analysis; quantitative genetics; CLIMATIC-NICHE EVOLUTION; TRAIT EVOLUTION; PHYLOGENETIC ANALYSIS; ADAPTIVE RADIATION; EARLY BURSTS; BODY-SIZE; ADAPTATION; SHAPE; DIVERSIFICATION; ALLOMETRY;
D O I
10.1093/sysbio/syv019
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Most existing methods for modeling trait evolution are univariate, although researchers are often interested in investigating evolutionary patterns and processes across multiple traits. Principal components analysis (PCA) is commonly used to reduce the dimensionality of multivariate data so that univariate trait models can be fit to individual principal components. The problem with using standard PCA on phylogenetically structured data has been previously pointed out yet it continues to be widely used in the literature. Here we demonstrate precisely how using standard PCA can mislead inferences: The first few principal components of traits evolved under constant-rate multivariate Brownian motion will appear to have evolved via an "early burst" process. A phylogenetic PCA (pPCA) has been proprosed to alleviate these issues. However, when the true model of trait evolution deviates from the model assumed in the calculation of the pPCA axes, we find that the use of pPCA suffers from similar artifacts as standard PCA. We show that data sets with high effective dimensionality are particularly likely to lead to erroneous inferences. Ultimately, all of the problems we report stem from the same underlying issue-by considering only the first few principal components as univariate traits, we are effectively examining a biased sample of a multivariate pattern. These results highlight the need for truly multivariate phylogenetic comparative methods. As these methods are still being developed, we discuss potential alternative strategies for using and interpreting models fit to univariate axes of multivariate data.
引用
收藏
页码:677 / 689
页数:13
相关论文
共 50 条
  • [1] SIZE-CORRECTION AND PRINCIPAL COMPONENTS FOR INTERSPECIFIC COMPARATIVE STUDIES
    Revell, Liam J.
    EVOLUTION, 2009, 63 (12) : 3258 - 3268
  • [2] Comparative Analysis of Urban Land Intensive Utilization Based on Principal Components
    Wang Yanliang
    Wang Yanqi
    Wang Wei
    PROCEEDINGS OF 2008 INTERNATIONAL CONFERENCE ON CONSTRUCTION & REAL ESTATE MANAGEMENT, VOLS 1 AND 2, 2008, : 1604 - 1606
  • [3] Phylogenetic Principal Components Analysis and Geometric Morphometrics
    Polly, P. David
    Lawing, A. Michelle
    Fabre, Anne-Claire
    Goswami, Anjali
    HYSTRIX-ITALIAN JOURNAL OF MAMMALOGY, 2013, 24 (01): : 33 - 41
  • [4] Comparison of Principal Components Analysis, Independent Components Analysis and Common Components Analysis
    Rutledge D.N.
    Journal of Analysis and Testing, 2018, 2 (3) : 235 - 248
  • [5] Principal components analysis and cyclostationarity
    Boudou, Alain
    Viguier-Pla, Sylvie
    JOURNAL OF MULTIVARIATE ANALYSIS, 2022, 189
  • [6] Local influence in principal components analysis
    Shi, L
    BIOMETRIKA, 1997, 84 (01) : 175 - 186
  • [7] PRINCIPAL COMPONENTS-ANALYSIS (PCA)
    MACKIEWICZ, A
    RATAJCZAK, W
    COMPUTERS & GEOSCIENCES, 1993, 19 (03) : 303 - 342
  • [8] Principal components of recurrence quantification analysis of EMG
    Mewett, DT
    Reynolds, KJ
    Nazeran, H
    PROCEEDINGS OF THE 23RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-4: BUILDING NEW BRIDGES AT THE FRONTIERS OF ENGINEERING AND MEDICINE, 2001, 23 : 1592 - 1595
  • [9] More on local influence in principal components analysis
    Li, B
    De Moor, B
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1999, 28 (10) : 2487 - 2495
  • [10] PRINCIPAL COMPONENTS ANALYSIS FOR RIGHT CENSORED DATA
    Langworthy, Benjamin W.
    Cai, Jianwen
    Corty, Robert W.
    Kosorok, Michael R.
    Fine, Jason P.
    STATISTICA SINICA, 2023, 33 (03) : 1985 - 2016