HyDe: A Python']Python Package for Genome-Scale Hybridization Detection

被引:174
作者
Blischak, Paul D. [1 ]
Chifman, Julia [2 ]
Wolfe, Andrea D. [1 ]
Kubatko, Laura S. [1 ,3 ]
机构
[1] Ohio State Univ, Dept Evolut Ecol & Organismal Biol, 318 W 12th Ave, Columbus, OH 43210 USA
[2] Amer Univ, Dept Math & Stat, Washington, DC 20016 USA
[3] Ohio State Univ, Dept Stat, Columbus, OH 43210 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
ABBA-BABA; coalescence; gene flow; hybridization; phylogenetic invariants; HYBRID SPECIATION; COALESCENT MODEL; INTROGRESSION; INFERENCE; EVOLUTION; HISTORIES; PHYLOGENY; ADMIXTURE; SEQUENCE; DNA;
D O I
10.1093/sysbio/syy023
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The analysis of hybridization and gene flow among closely related taxa is a common goal for researchers studying speciation and phylogeography. Many methods for hybridization detection use simple site pattern frequencies from observed genomic data and compare them to null models that predict an absence of gene flow. The theory underlying the detection of hybridization using these site pattern probabilities exploits the relationship between the coalescent process for gene trees within population trees and the process of mutation along the branches of the gene trees. For certain models, site patterns are predicted to occur in equal frequency (i.e., their difference is 0), producing a set of functions called phylogenetic invariants. In this article, we introduce HyDe, a software package for detecting hybridization using phylogenetic invariants arising under the coalescent model with hybridization. HyDe is written in Python and can be used interactively or through the command line using pre-packaged scripts. We demonstrate the use of HyDe on simulated data, as well as on two empirical data sets from the literature. We focus in particular on identifying individual hybrids within population samples and on distinguishing between hybrid speciation and gene flow.
引用
收藏
页码:821 / 829
页数:9
相关论文
共 40 条
[1]  
Allman E. S., 2016, SYST BIOL, V66, P620
[2]   Adaptive Genetic Exchange: A Tangled History of Admixture and Evolutionary Innovation [J].
Arnold, Michael L. ;
Kunte, Krushnamegh .
TRENDS IN ECOLOGY & EVOLUTION, 2017, 32 (08) :601-611
[3]   A genomic view of introgression and hybrid speciation [J].
Baack, Eric J. ;
Rieseberg, Loren H. .
CURRENT OPINION IN GENETICS & DEVELOPMENT, 2007, 17 (06) :513-518
[4]   Cython: The Best of Both Worlds [J].
Behnel, Stefan ;
Bradshaw, Robert ;
Citro, Craig ;
Dalcin, Lisandro ;
Seljebotn, Dag Sverre ;
Smith, Kurt .
COMPUTING IN SCIENCE & ENGINEERING, 2011, 13 (02) :31-39
[5]   INVARIANTS OF PHYLOGENIES IN A SIMPLE CASE WITH DISCRETE STATES [J].
CAVENDER, JA ;
FELSENSTEIN, J .
JOURNAL OF CLASSIFICATION, 1987, 4 (01) :57-71
[6]   Identifiability of the unrooted species tree topology under the coalescent model with time-reversible substitution processes, site-specific rate variation, and invariable sites [J].
Chifman, Julia ;
Kubatko, Laura .
JOURNAL OF THEORETICAL BIOLOGY, 2015, 374 :35-47
[7]   Quartet Inference from SNP Data Under the Coalescent Model [J].
Chifman, Julia ;
Kubatko, Laura .
BIOINFORMATICS, 2014, 30 (23) :3317-3324
[8]   PHYLOGENOMICS REVEALS EXTENSIVE RETICULATE EVOLUTION IN XIPHOPHORUS FISHES [J].
Cui, Rongfeng ;
Schumer, Molly ;
Kruesi, Karla ;
Walter, Ronald ;
Andolfatto, Peter ;
Rosenthal, Gil G. .
EVOLUTION, 2013, 67 (08) :2166-2179
[9]   Testing for Ancient Admixture between Closely Related Populations [J].
Durand, Eric Y. ;
Patterson, Nick ;
Reich, David ;
Slatkin, Montgomery .
MOLECULAR BIOLOGY AND EVOLUTION, 2011, 28 (08) :2239-2252
[10]   Inferring Phylogeny and Introgression using RADseq Data: An Example from Flowering Plants (Pedicularis: Orobanchaceae) [J].
Eaton, Deren A. R. ;
Ree, Richard H. .
SYSTEMATIC BIOLOGY, 2013, 62 (05) :689-706