FastNet: Fast and Accurate Statistical Inference of Phylogenetic Networks Using Large-Scale Genomic Sequence Data

被引:10
作者
Hejase, Hussein A. [1 ]
VandePol, Natalie [2 ]
Bonito, Gregory M. [2 ]
Liu, Kevin J. [3 ]
机构
[1] Cold Spring Harbor Lab, Simons Ctr Quantitat Biol, POB 100, Cold Spring Harbor, NY 11724 USA
[2] Michigan State Univ, Dept Plant Soil & Microbial Sci, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Comp Sci & Engn, E Lansing, MI 48824 USA
来源
COMPARATIVE GENOMICS (RECOMB-CG 2018) | 2018年 / 11183卷
关键词
SPECIES TREE ESTIMATION; MODEL; EVOLUTION;
D O I
10.1007/978-3-030-00834-5_14
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
An emerging discovery in phylogenomics is that interspecific gene flow has played a major role in the evolution of many different organisms. To what extent is the Tree of Life not truly a tree reflecting strict "vertical" divergence, but rather a more general graph structure known as a phylogenetic network which also captures "horizontal" gene flow? The answer to this fundamental question not only depends upon densely sampled and divergent genomic sequence data, but also computational methods which are capable of accurately and efficiently inferring phylogenetic networks from large-scale genomic sequence datasets. Recent methodological advances have attempted to address this gap. However, in the 2016 performance study of Hejase and Liu, state-of-the-art methods fell well short of the scalability requirements of existing phylogenomic studies. The methodological gap remains: how can phylogenetic networks be accurately and efficiently inferred using genomic sequence data involving many dozens or hundreds of taxa? In this study, we address this gap by proposing a new phylogenetic divide-and-conquer method which we call FastNet. We conduct a performance study involving a range of evolutionary scenarios, and we demonstrate that FastNet outperforms state-of-the-art methods in terms of computational efficiency and topological accuracy.
引用
收藏
页码:242 / 259
页数:18
相关论文
共 49 条
  • [11] Testing for Ancient Admixture between Closely Related Populations
    Durand, Eric Y.
    Patterson, Nick
    Reich, David
    Slatkin, Montgomery
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2011, 28 (08) : 2239 - 2252
  • [12] IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING?
    Edwards, Scott V.
    [J]. EVOLUTION, 2009, 63 (01) : 1 - 19
  • [13] CASES IN WHICH PARSIMONY OR COMPATIBILITY METHODS WILL BE POSITIVELY MISLEADING
    FELSENSTEIN, J
    [J]. SYSTEMATIC ZOOLOGY, 1978, 27 (04): : 401 - 410
  • [14] Which Phylogenetic Networks are Merely Trees with Additional Arcs?
    Francis, Andrew R.
    Steel, Mike
    [J]. SYSTEMATIC BIOLOGY, 2015, 64 (05) : 768 - 777
  • [15] Dimensions of Horizontal Gene Transfer in Eukaryotic Microbial Pathogens
    Gluck-Thaler, Emile
    Slot, Jason C.
    [J]. PLoS Pathogens, 2015, 11 (10)
  • [16] A Draft Sequence of the Neandertal Genome
    Green, Richard E.
    Krause, Johannes
    Briggs, Adrian W.
    Maricic, Tomislav
    Stenzel, Udo
    Kircher, Martin
    Patterson, Nick
    Li, Heng
    Zhai, Weiwei
    Fritz, Markus Hsi-Yang
    Hansen, Nancy F.
    Durand, Eric Y.
    Malaspinas, Anna-Sapfo
    Jensen, Jeffrey D.
    Marques-Bonet, Tomas
    Alkan, Can
    Pruefer, Kay
    Meyer, Matthias
    Burbano, Hernan A.
    Good, Jeffrey M.
    Schultz, Rigo
    Aximu-Petri, Ayinuer
    Butthof, Anne
    Hoeber, Barbara
    Hoeffner, Barbara
    Siegemund, Madlen
    Weihmann, Antje
    Nusbaum, Chad
    Lander, Eric S.
    Russ, Carsten
    Novod, Nathaniel
    Affourtit, Jason
    Egholm, Michael
    Verna, Christine
    Rudan, Pavao
    Brajkovic, Dejana
    Kucan, Zeljko
    Gusic, Ivan
    Doronichev, Vladimir B.
    Golovanova, Liubov V.
    Lalueza-Fox, Carles
    de la Rasilla, Marco
    Fortea, Javier
    Rosas, Antonio
    Schmitz, Ralf W.
    Johnson, Philip L. F.
    Eichler, Evan E.
    Falush, Daniel
    Birney, Ewan
    Mullikin, James C.
    [J]. SCIENCE, 2010, 328 (5979) : 710 - 722
  • [17] Hein J., 2004, Gene Genealogies, Variation and Evolution: a Primer in Coalescent Theory
  • [18] A scalability study of phylogenetic network inference methods using empirical datasets and simulations involving a single reticulation
    Hejase, Hussein A.
    Liu, Kevin J.
    [J]. BMC BIOINFORMATICS, 2016, 17
  • [19] Generating samples under a Wright-Fisher neutral model of genetic variation
    Hudson, RR
    [J]. BIOINFORMATICS, 2002, 18 (02) : 337 - 338
  • [20] SUCCESS OF PHYLOGENETIC METHODS IN THE 4-TAXON CASE
    HUELSENBECK, JP
    HILLIS, DM
    [J]. SYSTEMATIC BIOLOGY, 1993, 42 (03) : 247 - 264