High-resolution species trees without concatenation

被引:480
作者
Edwards, Scott V. [1 ]
Liu, Liang
Pearl, Dennis K.
机构
[1] Harvard Univ, Dept Organism & Evolutionary Biol, Cambridge, MA 02138 USA
[2] Harvard Univ, Museum Comparat Zool, Cambridge, MA 02138 USA
[3] Ohio State Univ, Dept Surg, Columbus, OH 43210 USA
关键词
coalescent theory; importance sampling; molecular clock; yeast;
D O I
10.1073/pnas.0607004104
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The vast majority of phylogenetic models focus on resolution of gene trees, despite the fact that phylogenies of species in which gene trees are embedded are of primary interest. We analyze a Bayesian model for estimating species trees that accounts for the stochastic variation expected for gene trees from multiple unlinked loci sampled from a single species history after a coalescent process. Application of the model to a 106-gene data set from yeast shows that the set of gene trees recovered by statistically acknowledging the shared but unknown species tree from which gene trees are sampled is much reduced compared with treating the history of each locus independently of an overarching species tree. The analysis also yields a concentrated posterior distribution of the yeast species tree whose mode is congruent with the concatenated gene tree but can do so with less than half the loci required by the concatenation method. Using simulations, we show that, with large numbers of loci, highly resolved species trees can be estimated under conditions in which concatenation of sequence data will positively mislead phylogeny, and when the proportion of gene trees matching the species tree is <10%. However, when gene tree/species tree congruence is high, species trees can be resolved with just two or three loci. These results make accessible an alternative paradigm for combining data in phylogenomics that focuses attention on the singularity of species histories and away from the idiosyncrasies and multiplicities of individual gene histories.
引用
收藏
页码:5936 / 5941
页数:6
相关论文
共 46 条
  • [1] Ané C, 2007, MOL BIOL EVOL, V24, P412
  • [2] [Anonymous], 2004, Inferring Phylogenies
  • [3] Avise J.C., 2000, PHYLOGEOGRAPHY HIST, DOI DOI 10.2307/J.CTV1NZFGJ7
  • [4] Avise John C., 1994, pi
  • [5] Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach
    Beerli, P
    Felsenstein, J
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (08) : 4563 - 4568
  • [6] Supertree construction in the genomic age
    Bininda-Emonds, ORP
    [J]. MOLECULAR EVOLUTION: PRODUCING THE BIOCHEMICAL DATA, PART B, 2005, 395 : 745 - 757
  • [7] Toward automatic reconstruction of a highly resolved tree of life
    Ciccarelli, FD
    Doerks, T
    von Mering, C
    Creevey, CJ
    Snel, B
    Bork, P
    [J]. SCIENCE, 2006, 311 (5765) : 1283 - 1287
  • [8] Choosing the best genes for the job: The case for stationary genes in genome-scale phylogenetics
    Collins, TM
    Fedrigo, O
    Naylor, GJP
    [J]. SYSTEMATIC BIOLOGY, 2005, 54 (03) : 493 - 500
  • [9] Cracraft J, 2004, ASSEMBLING THE TREE OF LIFE, P468
  • [10] Discordance of species trees with their most likely gene trees
    Degnan, James H.
    Rosenberg, Noah A.
    [J]. PLOS GENETICS, 2006, 2 (05): : 762 - 768