Impact of Model Violations on the Inference of Species Boundaries Under the Multispecies Coalescent

被引:73
作者
Barley, Anthony J. [1 ]
Brown, Jeremy M. [2 ,3 ]
Thomson, Robert C. [1 ]
机构
[1] Univ Hawaii, Dept Biol, 2538 McCarthy Mall,Edmondson Hall 216, Honolulu, HI 96822 USA
[2] Louisiana State Univ, Dept Biol Sci, 202 Life Sci Bldg, Baton Rouge, LA 70803 USA
[3] Louisiana State Univ, Museum Nat Sci, 202 Life Sci Bldg, Baton Rouge, LA 70803 USA
基金
美国国家科学基金会;
关键词
Populations structure; gene flow; demographic changes; posterior prediction; simulation; genetics; GENE FLOW; TREE ESTIMATION; DELIMITATION; DIVERSIFICATION; SIMULATIONS; STATISTICS; DIVERGENCE; SPECIATION; SELECTION; PROGRAM;
D O I
10.1093/sysbio/syx073
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
The use of genetic data for identifying species-level lineages across the tree of life has received increasing attention in the field of systematics over the past decade. The multispecies coalescent model provides a framework for understanding the process of lineage divergence and has become widely adopted for delimiting species. However, because these studies lack an explicit assessment of model fit, in many cases, the accuracy of the inferred species boundaries are unknown. This is concerning given the large amount of empirical data and theory that highlight the complexity of the speciation process. Here, we seek to fill this gap by using simulation to characterize the sensitivity of inference under the multispecies coalescent (MSC) to several violations of model assumptions thought to be common in empirical data. We also assess the fit of the MSC model to empirical data in the context of species delimitation. Our results show substantial variation in model fit across data sets. Posterior predictive tests find the poorest model performance in data sets that were hypothesized to be impacted by model violations. We also show that while the inferences assuming the MSC are robust to minor model violations, such inferences can be biased under some biologically plausible scenarios. Taken together, these results suggest that researchers can identify individual data sets in which species delimitation under the MSC is likely to be problematic, thereby highlighting the cases where additional lines of evidence to identify species boundaries are particularly important to collect. Our study supports a growing body of work highlighting the importance of model checking in phylogenetics, and the usefulness of tailoring tests of model fit to assess the reliability of particular inferences.
引用
收藏
页码:269 / 284
页数:16
相关论文
共 72 条
[1]  
[Anonymous], 2013, Bayesian data analysis, third edition
[2]   Assessing the performance of DNA barcoding using posterior predictive simulations [J].
Barley, Anthony J. ;
Thomson, Robert C. .
MOLECULAR ECOLOGY, 2016, 25 (09) :1944-1957
[3]   THE CHALLENGE OF SPECIES DELIMITATION AT THE EXTREMES: DIVERSIFICATION WITHOUT MORPHOLOGICAL CHANGE IN PHILIPPINE SUN SKINKS [J].
Barley, Anthony J. ;
White, Jordan ;
Diesmos, Arvin C. ;
Brown, Rafe M. .
EVOLUTION, 2013, 67 (12) :3556-3572
[4]   What role does natural selection play in speciation? [J].
Barton, N. H. .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2010, 365 (1547) :1825-1840
[5]   BEAST 2: A Software Platform for Bayesian Evolutionary Analysis [J].
Bouckaert, Remco ;
Heled, Joseph ;
Kuehnert, Denise ;
Vaughan, Tim ;
Wu, Chieh-Hsi ;
Xie, Dong ;
Suchard, Marc A. ;
Rambaut, Andrew ;
Drummond, Alexei J. .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (04)
[6]   Genome-scale coestimation of species and gene trees [J].
Boussau, Bastien ;
Szoellosi, Gergely J. ;
Duret, Laurent ;
Gouy, Manolo ;
Tannier, Eric ;
Daubin, Vincent .
GENOME RESEARCH, 2013, 23 (02) :323-330
[7]   Bayes Factors Unmask Highly Variable Information Content, Bias, and Extreme Influence in Phylogenomic Analyses [J].
Brown, Jeremy M. ;
Thomson, Robert C. .
SYSTEMATIC BIOLOGY, 2017, 66 (04) :517-530
[8]   Predictive Approaches to Assessing the Fit of Evolutionary Models [J].
Brown, Jeremy M. .
SYSTEMATIC BIOLOGY, 2014, 63 (03) :289-292
[9]   Detection of Implausible Phylogenetic Inferences Using Posterior Predictive Assessment of Model Fit [J].
Brown, Jeremy M. .
SYSTEMATIC BIOLOGY, 2014, 63 (03) :334-348
[10]   Considering gene flow when using coalescent methods to delimit lineages of North American pitvipers of the genus Agkistrodon [J].
Burbrink, Frank T. ;
Guiher, Timothy J. .
ZOOLOGICAL JOURNAL OF THE LINNEAN SOCIETY, 2015, 173 (02) :505-526