A matter of phylogenetic scale: Distinguishing incomplete lineage sorting from lateral gene transfer as the cause of gene tree discord in recent versus deep diversification histories

被引:25
作者
Knowles, L. Lacey [1 ]
Huang, Huateng [1 ]
Sukumaran, Jeet [1 ]
Smith, Stephen A. [1 ]
机构
[1] Univ Michigan, Dept Ecol & Evolutionary Biol, Museum Zool, 1109 Geddes Ave, Ann Arbor, MI 48109 USA
关键词
CLASSIPHY; coalescence; gene-tree discord; incomplete lineage sorting; lateral gene transfer; species tree; SPECIES TREES; LAND PLANTS; SIMULATION; COALESCENT; INCONGRUENCE; EVOLUTION; INFERENCE; ACCURACY; PACKAGE; ORIGIN;
D O I
10.1002/ajb2.1064
中图分类号
Q94 [植物学];
学科分类号
071001 ;
摘要
Premise of the Study: Discordant gene trees are commonly encountered when sequences from thousands of loci are applied to estimate phylogenetic relationships. Several processes contribute to this discord. Yet, we have no methods that jointly model different sources of conflict when estimating phylogenies. An alternative to analyzing entire genomes or all the sequenced loci is to identify a subset of loci for phylogenetic analysis. If we can identify data partitions that are most likely to reflect descent from a common ancestor (i.e., discordant loci that indeed reflect incomplete lineage sorting [ILS], as opposed to some other process, such as lateral gene transfer [LGT]), we can analyze this subset using powerful coalescent-based species-tree approaches. Methods: Test data sets were simulated where discord among loci could arise from ILS and LGT. Data sets where analyzed using the newly developed program CLASSIPHY (Huang etal., ) to assess whether our ability to distinguish the cause of discord among loci varied when ILS and LGT occurred in the recent versus deep past and whether the accuracy of these inferences were affected by the mutational process. Key Results: We show that accuracy of probabilistic classification of individual loci by the cause of discord differed when ILS and LGT events occurred more recently compared with the distant past and that the signal-to-noise ratio arising from the mutational process contributes to difficulties in inferring LGT data partitions. Conclusions: We discuss our findings in terms of the promise and limitations of identifying subsets of loci for species-tree inference that will not violate the underlying coalescent model (i.e., data partitions in which ILS, and not LGT, contributes to discord). We also discuss the empirical implications of our work given the many recalcitrant nodes in the tree of life (e.g., origins of angiosperms, amniotes, or Neoaves), and recent arguments for concatenating loci.
引用
收藏
页码:376 / 384
页数:9
相关论文
共 50 条
  • [1] [Anonymous], 2010, ESTIMATING SPECIES T
  • [2] Blischak P., 2017, BIORXIV, DOI [10. 1016/10. 1101/188037, DOI 10.1016/10.1101/188037]
  • [3] Genome-scale coestimation of species and gene trees
    Boussau, Bastien
    Szoellosi, Gergely J.
    Duret, Laurent
    Gouy, Manolo
    Tannier, Eric
    Daubin, Vincent
    [J]. GENOME RESEARCH, 2013, 23 (02) : 323 - 330
  • [4] Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses
    Brown, Jeremy M.
    Thomson, Robert C.
    [J]. SYSTEMATIC BIOLOGY, 2017, 66 (04) : 517 - 530
  • [5] Conflicting Phylogenies for Early Land Plants are Caused by Composition Biases among Synonymous Substitutions
    Cox, Cymon J.
    Li, Blaise
    Foster, Peter G.
    Embley, T. Martin
    Civan, Peter
    [J]. SYSTEMATIC BIOLOGY, 2014, 63 (02) : 272 - 279
  • [6] The supermatrix approach to systematics
    de Queiroz, Alan
    Gatesy, John
    [J]. TRENDS IN ECOLOGY & EVOLUTION, 2007, 22 (01) : 34 - 41
  • [7] IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING?
    Edwards, Scott V.
    [J]. EVOLUTION, 2009, 63 (01) : 1 - 19
  • [8] Extensive introgression in a malaria vector species complex revealed by phylogenomics
    Fontaine, Michael C.
    Pease, James B.
    Steele, Aaron
    Waterhouse, Robert M.
    Neafsey, Daniel E.
    Sharakhov, Igor V.
    Jiang, Xiaofang
    Hall, Andrew B.
    Catteruccia, Flaminia
    Kakani, Evdoxia
    Mitchell, Sara N.
    Wu, Yi-Chieh
    Smith, Hilary A.
    Love, R. Rebecca
    Lawniczak, Mara K.
    Slotman, Michel A.
    Emrich, Scott J.
    Hahn, Matthew W.
    Besansky, Nora J.
    [J]. SCIENCE, 2015, 347 (6217) : 1258524
  • [9] Modeling compositional heterogeneity
    Foster, PG
    [J]. SYSTEMATIC BIOLOGY, 2004, 53 (03) : 485 - 495
  • [10] Phylogenetic analysis at deep timescales: Unreliable gene trees, bypassed hidden support, and the coalescence/concatalescence conundrum
    Gatesy, John
    Springer, Mark S.
    [J]. MOLECULAR PHYLOGENETICS AND EVOLUTION, 2014, 80 : 231 - 266