Inferring Phylogenetic Networks with Maximum Pseudolikelihood under Incomplete Lineage Sorting

被引:339
作者
Solis-Lemus, Claudia [1 ]
Ane, Cecile [1 ,2 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Bot, Madison, WI USA
基金
美国国家科学基金会;
关键词
PSEUDO-LIKELIHOOD APPROACH; CONCORDANCE; INFERENCE; EVOLUTION; TREES; MODEL; IDENTIFIABILITY; HYBRIDIZATION;
D O I
10.1371/journal.pgen.1005896
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Phylogenetic networks are necessary to represent the tree of life expanded by edges to represent events such as horizontal gene transfers, hybridizations or gene flow. Not all species follow the paradigm of vertical inheritance of their genetic material. While a great deal of research has flourished into the inference of phylogenetic trees, statistical methods to infer phylogenetic networks are still limited and under development. The main disadvantage of existing methods is a lack of scalability. Here, we present a statistical method to infer phylogenetic networks from multi-locus genetic data in a pseudolikelihood framework. Our model accounts for incomplete lineage sorting through the coalescent model, and for horizontal inheritance of genes through reticulation nodes in the network. Computation of the pseudolikelihood is fast and simple, and it avoids the burdensome calculation of the full likelihood which can be intractable with many species. Moreover, estimation at the quartet-level has the added computational benefit that it is easily parallelizable. Simulation studies comparing our method to a full likelihood approach show that our pseudolikelihood approach is much faster without compromising accuracy. We applied our method to reconstruct the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), which is characterized by widespread hybridizations.
引用
收藏
页数:21
相关论文
共 47 条
[31]  
Nguyen Q, 2015, LNBI, V9199, P126
[32]   Reconstructible Phylogenetic Networks: Do Not Distinguish the Indistinguishable [J].
Pardi, Fabio ;
Scornavacca, Celine .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (04)
[33]   Inference of Population Splits and Mixtures from Genome-Wide Allele Frequency Data [J].
Pickrell, Joseph K. ;
Pritchard, Jonathan K. .
PLOS GENETICS, 2012, 8 (11)
[34]  
Rambaut A, 1997, COMPUT APPL BIOSCI, V13, P235
[35]   MrBayes 3: Bayesian phylogenetic inference under mixed models [J].
Ronquist, F ;
Huelsenbeck, JP .
BIOINFORMATICS, 2003, 19 (12) :1572-1574
[36]   Quartet MaxCut: A fast algorithm for amalgamating quartet trees [J].
Snir, Sagi ;
Rao, Satish .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2012, 62 (01) :1-8
[37]  
Solis-Lemus C., 2015, SYSTEMATIC BIOL
[38]   RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies [J].
Stamatakis, Alexandros .
BIOINFORMATICS, 2014, 30 (09) :1312-1313
[39]   Exploring Tree-Like and Non-Tree-Like Patterns Using Genome Sequences: An Example Using the Inbreeding Plant Species Arabidopsis thaliana (L.) Heynh [J].
Stenz, Noah W. M. ;
Larget, Bret ;
Baum, David A. ;
Ane, Cecile .
SYSTEMATIC BIOLOGY, 2015, 64 (05) :809-823
[40]   Likelihood analysis of phylogenetic networks using directed graphical models [J].
Strimmer, K ;
Moulton, V .
MOLECULAR BIOLOGY AND EVOLUTION, 2000, 17 (06) :875-881