Bayesian inference of phylogenetic networks from bi-allelic genetic markers

被引:27
作者
Zhu, Jiafan [1 ]
Wen, Dingqiao [1 ]
Yu, Yun [1 ]
Meudt, Heidi M. [2 ]
Nakhleh, Luay [1 ,3 ]
机构
[1] Rice Univ, Comp Sci, Houston, TX 77005 USA
[2] Museum New Zealand Te Papa Tongarewa, Wellington, New Zealand
[3] Rice Univ, BioSci, Houston, TX 77005 USA
基金
美国国家科学基金会;
关键词
SPECIES TREES; MAXIMUM-LIKELIHOOD; COALESCENT MODEL; DNA-SEQUENCES; HYBRIDIZATION; INTROGRESSION; PHYLOGENOMICS; EVOLUTION; GENOME; LIGHT;
D O I
10.1371/journal.pcbi.1005932
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Phylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.
引用
收藏
页数:32
相关论文
共 18 条
  • [11] Bayesian inference of selection in a heterogeneous environment from genetic time-series data
    Gompert, Zachariah
    MOLECULAR ECOLOGY, 2016, 25 (01) : 121 - 134
  • [12] Identifying genetic markers for a range of phylogenetic utility-From species to family level
    Choi, Bokyung
    Crisp, Michael D.
    Cook, Lyn G.
    Meusemann, Karen
    Edwards, Robert D.
    Toon, Alicia
    Kulheim, Carsten
    PLOS ONE, 2019, 14 (08):
  • [13] Inference of Gene Regulatory Networks from Genetic Perturbations with Linear Regression Model
    Dong, Zijian
    Song, Tiecheng
    Yuan, Chuang
    PLOS ONE, 2013, 8 (12):
  • [14] Genetic divergence and phylogenetic relationship of the rabbitfish Siganus rivulatus inferred from microsatellite and mitochondrial markers
    Hashem, Medhat H.
    Alshaya, Dalal S.
    Jalal, Areej S.
    Abdelsalam, Nader R.
    Abd El-Azeem, Reham M.
    Khaled, Ahmed E.
    Al-Abedi, Amani S.
    Mansour, Abdallah T.
    AlSaqufi, Ahmed S.
    Shafi, Manal E.
    Hassanien, Hesham A.
    JOURNAL OF KING SAUD UNIVERSITY SCIENCE, 2022, 34 (04)
  • [15] NestedBD: Bayesian inference of phylogenetic trees from single-cell copy number profiles under a birth-death model
    Liu, Yushu
    Edrisi, Mohammadamin
    Yan, Zhi
    Ogilvie, Huw A.
    Nakhleh, Luay
    ALGORITHMS FOR MOLECULAR BIOLOGY, 2024, 19 (01)
  • [16] Testing the phylogenetic position of a parasitic plant (Cuscuta, Convolvulaceae, Asteridae):: Bayesian inference and the parametric bootstrap on data drawn from three genomes
    Stefanovic, S
    Olmstead, RG
    SYSTEMATIC BIOLOGY, 2004, 53 (03) : 384 - 399
  • [17] The value of idiosyncratic markers and changes to conserved tRNA sequences from the mitochondrial genome of hard ticks (Acari: Ixodida: Ixodidae) for phylogenetic inference
    Murrell, A
    Campbell, NJH
    Barker, SC
    SYSTEMATIC BIOLOGY, 2003, 52 (03) : 296 - 310
  • [18] TiDeTree: a Bayesian phylogenetic framework to estimate single-cell trees and population dynamic parameters from genetic lineage tracing data
    Seidel, Sophie
    Stadler, Tanja
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2022, 289 (1986)