Inferring Phylogenetic Networks with Maximum Pseudolikelihood under Incomplete Lineage Sorting

被引:335
作者
Solis-Lemus, Claudia [1 ]
Ane, Cecile [1 ,2 ]
机构
[1] Univ Wisconsin, Dept Stat, Madison, WI 53706 USA
[2] Univ Wisconsin, Dept Bot, Madison, WI USA
基金
美国国家科学基金会;
关键词
PSEUDO-LIKELIHOOD APPROACH; CONCORDANCE; INFERENCE; EVOLUTION; TREES; MODEL; IDENTIFIABILITY; HYBRIDIZATION;
D O I
10.1371/journal.pgen.1005896
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Phylogenetic networks are necessary to represent the tree of life expanded by edges to represent events such as horizontal gene transfers, hybridizations or gene flow. Not all species follow the paradigm of vertical inheritance of their genetic material. While a great deal of research has flourished into the inference of phylogenetic trees, statistical methods to infer phylogenetic networks are still limited and under development. The main disadvantage of existing methods is a lack of scalability. Here, we present a statistical method to infer phylogenetic networks from multi-locus genetic data in a pseudolikelihood framework. Our model accounts for incomplete lineage sorting through the coalescent model, and for horizontal inheritance of genes through reticulation nodes in the network. Computation of the pseudolikelihood is fast and simple, and it avoids the burdensome calculation of the full likelihood which can be intractable with many species. Moreover, estimation at the quartet-level has the added computational benefit that it is easily parallelizable. Simulation studies comparing our method to a full likelihood approach show that our pseudolikelihood approach is much faster without compromising accuracy. We applied our method to reconstruct the evolutionary relationships among swordtails and platyfishes (Xiphophorus: Poeciliidae), which is characterized by widespread hybridizations.
引用
收藏
页数:21
相关论文
共 47 条
[1]   Identifiability of a Markovian model of molecular evolution with gamma-distributed rates [J].
Allman, Elizabeth S. ;
Ane, Cecile ;
Rhodes, John A. .
ADVANCES IN APPLIED PROBABILITY, 2008, 40 (01) :229-249
[2]   Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent [J].
Allman, Elizabeth S. ;
Degnan, James H. ;
Rhodes, John A. .
JOURNAL OF MATHEMATICAL BIOLOGY, 2011, 62 (06) :833-862
[3]   The Identifiability of Covarion Models in Phylogenetics [J].
Allman, Elizabeth S. ;
Rhodes, John A. .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2009, 6 (01) :76-88
[4]  
Ane C., 2010, Estimating Species Trees: Practical and Theoretical Aspects, P35
[5]  
Ané C, 2007, MOL BIOL EVOL, V24, P412
[6]  
[Anonymous], INFERENCE ANCESTRAL
[7]   Weighted Quartets Phylogenetics [J].
Avni, Eliran ;
Cohen, Reuven ;
Snir, Sagi .
SYSTEMATIC BIOLOGY, 2015, 64 (02) :233-242
[8]   Slope heuristics: overview and implementation [J].
Baudry, Jean-Patrick ;
Maugis, Cathy ;
Michel, Bertrand .
STATISTICS AND COMPUTING, 2012, 22 (02) :455-470
[9]   Concordance trees, concordance factors, and the exploration of reticulate genealogy [J].
Baum, David A. .
TAXON, 2007, 56 (02) :417-426
[10]  
Bezanson J., 2014, JULIA FRESH APPROACH