Comparative Analysis of Principal Components Can be Misleading

被引:199
|
作者
Uyeda, Josef C. [1 ]
Caetano, Daniel S. [1 ]
Pennell, Matthew W. [1 ]
机构
[1] Univ Idaho, Inst Bioinformat & Evolutionary Studies, Dept Biol Sci, Moscow, ID 83844 USA
基金
美国国家科学基金会; 加拿大自然科学与工程研究理事会;
关键词
Brownian motion; early burst; multivariate evolution; Ornstein-Uhlenbeck; phylogenetic comparative methods; principal components analysis; quantitative genetics; CLIMATIC-NICHE EVOLUTION; TRAIT EVOLUTION; PHYLOGENETIC ANALYSIS; ADAPTIVE RADIATION; EARLY BURSTS; BODY-SIZE; ADAPTATION; SHAPE; DIVERSIFICATION; ALLOMETRY;
D O I
10.1093/sysbio/syv019
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Most existing methods for modeling trait evolution are univariate, although researchers are often interested in investigating evolutionary patterns and processes across multiple traits. Principal components analysis (PCA) is commonly used to reduce the dimensionality of multivariate data so that univariate trait models can be fit to individual principal components. The problem with using standard PCA on phylogenetically structured data has been previously pointed out yet it continues to be widely used in the literature. Here we demonstrate precisely how using standard PCA can mislead inferences: The first few principal components of traits evolved under constant-rate multivariate Brownian motion will appear to have evolved via an "early burst" process. A phylogenetic PCA (pPCA) has been proprosed to alleviate these issues. However, when the true model of trait evolution deviates from the model assumed in the calculation of the pPCA axes, we find that the use of pPCA suffers from similar artifacts as standard PCA. We show that data sets with high effective dimensionality are particularly likely to lead to erroneous inferences. Ultimately, all of the problems we report stem from the same underlying issue-by considering only the first few principal components as univariate traits, we are effectively examining a biased sample of a multivariate pattern. These results highlight the need for truly multivariate phylogenetic comparative methods. As these methods are still being developed, we discuss potential alternative strategies for using and interpreting models fit to univariate axes of multivariate data.
引用
收藏
页码:677 / 689
页数:13
相关论文
共 50 条
  • [41] QUALITY CONTROL OF SEMICONDUCTOR PACKAGING BASED ON PRINCIPAL COMPONENTS ANALYSIS
    HE Shuguang QI Ershi HE Zhen NIE Bin School of Management
    Chinese Journal of Mechanical Engineering, 2007, (06) : 84 - 86
  • [42] Identification of Tibicen cicada species by a Principal Components Analysis of their songs
    Ohya, E
    ANAIS DA ACADEMIA BRASILEIRA DE CIENCIAS, 2004, 76 (02): : 441 - 444
  • [43] Study on regional production function model with principal components analysis
    Zhang Yixin
    PROCEEDINGS OF THE 2007 CONFERENCE ON SYSTEMS SCIENCE, MANAGEMENT SCIENCE AND SYSTEM DYNAMICS: SUSTAINABLE DEVELOPMENT AND COMPLEX SYSTEMS, VOLS 1-10, 2007, : 2983 - 2988
  • [44] Reliability Assessment in Photovoltaic Nanogrids by means of Principal Components Analysis
    Xavier Dominguez, Edwin
    Arboleya, Pablo
    2016 IEEE POWER AND ENERGY SOCIETY GENERAL MEETING (PESGM), 2016,
  • [45] Power swing detecting method using principal components analysis
    Wang, Hao
    Dai, Yuanyuan
    Fu, Luchuan
    Liu, Feng
    Hu, Jianli
    Dong, Xinzhou
    Kang, Xiaoning
    Li, Wenzhong
    ENERGY REPORTS, 2021, 7 : 1009 - 1014
  • [46] Quantitative analysis of polymorphic mixtures of carbamazepine by Raman spectroscopy and principal components analysis
    Strachan, CJ
    Pratiwi, D
    Gordon, KC
    Rades, T
    JOURNAL OF RAMAN SPECTROSCOPY, 2004, 35 (05) : 347 - 352
  • [47] Physichemical Properties of Different Corn Varieties by Principal Components Analysis and Cluster Analysis
    Zeng, Jie
    Gao, Haiyan
    Li, Guanglei
    Sun, Junliang
    JOURNAL OF THE CHEMICAL SOCIETY OF PAKISTAN, 2013, 35 (05): : 1275 - 1278
  • [48] Analysis of spectroscopic radiation portal monitor data using principal components analysis
    Runkle, Robert C.
    Tardiff, Mark F.
    Anderson, Kevin K.
    Carlson, Deborah K.
    Smith, L. Eric
    IEEE TRANSACTIONS ON NUCLEAR SCIENCE, 2006, 53 (03) : 1418 - 1423
  • [49] Principal Components Analysis on Capacity of Economic Sustainable Development in Liaoning Province
    Peng, Hui
    Han, Qi
    2012 7TH INTERNATIONAL CONFERENCE ON SYSTEM OF SYSTEMS ENGINEERING (SOSE), 2012, : 557 - 560
  • [50] Principal Components Analysis: An Alternative Way for Removing Natural Growth Trends
    Daniela Oliveira da Silva
    Virginia Klausner
    Alan Prestes
    Humberto Gimenes Macedo
    Tuomas Aakala
    Iuri Rojahn da Silva
    Pure and Applied Geophysics, 2021, 178 : 3131 - 3149