Bayesian inference of phylogenetic networks from bi-allelic genetic markers

被引:27
作者
Zhu, Jiafan [1 ]
Wen, Dingqiao [1 ]
Yu, Yun [1 ]
Meudt, Heidi M. [2 ]
Nakhleh, Luay [1 ,3 ]
机构
[1] Rice Univ, Comp Sci, Houston, TX 77005 USA
[2] Museum New Zealand Te Papa Tongarewa, Wellington, New Zealand
[3] Rice Univ, BioSci, Houston, TX 77005 USA
基金
美国国家科学基金会;
关键词
SPECIES TREES; MAXIMUM-LIKELIHOOD; COALESCENT MODEL; DNA-SEQUENCES; HYBRIDIZATION; INTROGRESSION; PHYLOGENOMICS; EVOLUTION; GENOME; LIGHT;
D O I
10.1371/journal.pcbi.1005932
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.
引用
收藏
页数:32
相关论文
共 41 条
[1]  
[Anonymous], 1997, NATURAL HYBRIDIZATIO
[2]   The role of hybridization in evolution [J].
Barton, NH .
MOLECULAR ECOLOGY, 2001, 10 (03) :551-568
[3]   Inferring Species Trees Directly from Biallelic Genetic Markers: Bypassing Gene Trees in a Full Coalescent Analysis [J].
Bryant, David ;
Bouckaert, Remco ;
Felsenstein, Joseph ;
Rosenberg, Noah A. ;
RoyChoudhury, Arindam .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (08) :1917-1932
[4]   Quartet Inference from SNP Data Under the Coalescent Model [J].
Chifman, Julia ;
Kubatko, Laura .
BIOINFORMATICS, 2014, 30 (23) :3317-3324
[5]   Implementing and testing the multispecies coalescent model: A valuable paradigm for phylogenomics [J].
Edwards, Scott V. ;
Xi, Zhenxiang ;
Janke, Axel ;
Faircloth, Brant C. ;
McCormack, John E. ;
Glenn, Travis C. ;
Zhong, Bojian ;
Wu, Shaoyuan ;
Lemmon, Emily Moriarty ;
Lemmon, Alan R. ;
Leache, Adam D. ;
Liu, Liang ;
Davis, Charles C. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2016, 94 :447-462
[6]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[7]   Extensive introgression in a malaria vector species complex revealed by phylogenomics [J].
Fontaine, Michael C. ;
Pease, James B. ;
Steele, Aaron ;
Waterhouse, Robert M. ;
Neafsey, Daniel E. ;
Sharakhov, Igor V. ;
Jiang, Xiaofang ;
Hall, Andrew B. ;
Catteruccia, Flaminia ;
Kakani, Evdoxia ;
Mitchell, Sara N. ;
Wu, Yi-Chieh ;
Smith, Hilary A. ;
Love, R. Rebecca ;
Lawniczak, Mara K. ;
Slotman, Michel A. ;
Emrich, Scott J. ;
Hahn, Matthew W. ;
Besansky, Nora J. .
SCIENCE, 2015, 347 (6217) :1258524
[8]   Prokaryotic evolution in light of gene transfer [J].
Gogarten, JP ;
Doolittle, WF ;
Lawrence, JG .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (12) :2226-2238
[9]   Reversible jump Markov chain Monte Carlo computation and Bayesian model determination [J].
Green, PJ .
BIOMETRIKA, 1995, 82 (04) :711-732
[10]   Bayesian Inference of Species Trees from Multilocus Data [J].
Heled, Joseph ;
Drummond, Alexei J. .
MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (03) :570-580