Phylogenetic estimation error can decrease the accuracy of species delimitation: a Bayesian implementation of the general mixed Yule-coalescent model

被引:462
作者
Reid, Noah M. [1 ]
Carstens, Bryan C. [1 ,2 ]
机构
[1] Louisiana State Univ, Dept Biol Sci, Baton Rouge, LA 70803 USA
[2] Ohio State Univ, Dept Ecol Evolut & Organismal Biol, Columbus, OH 43210 USA
来源
BMC EVOLUTIONARY BIOLOGY | 2012年 / 12卷
基金
美国国家科学基金会;
关键词
Species delimitation; GMYC; Bayesian phylogenetics; DNA barcoding; BARCODING GAP; DNA BARCODES; EVOLUTION; DIVERSITY; MIGRATION; RATES; DIVERSIFICATION; DIVERGENCE; SPECIATION; TAXONOMY;
D O I
10.1186/1471-2148-12-196
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background: Species are considered the fundamental unit in many ecological and evolutionary analyses, yet accurate, complete, accessible taxonomic frameworks with which to identify them are often unavailable to researchers. In such cases DNA sequence-based species delimitation has been proposed as a means of estimating species boundaries for further analysis. Several methods have been proposed to accomplish this. Here we present a Bayesian implementation of an evolutionary model-based method, the general mixed Yule-coalescent model (GMYC). Our implementation integrates over the parameters of the model and uncertainty in phylogenetic relationships using the output of widely available phylogenetic models and Markov-Chain Monte Carlo (MCMC) simulation in order to produce marginal probabilities of species identities. Results: We conducted simulations testing the effects of species evolutionary history, levels of intraspecific sampling and number of nucleotides sequenced. We also re-analyze the dataset used to introduce the original GMYC model. We found that the model results are improved with addition of DNA sequence and increased sampling, although these improvements have limits. The most important factor in the success of the model is the underlying phylogenetic history of the species under consideration. Recent and rapid divergences result in higher amounts of uncertainty in the model and eventually cause the model to fail to accurately assess uncertainty in species limits. Conclusion: Our results suggest that the GMYC model can be useful under a wide variety of circumstances, particularly in cases where divergences are deeper, or taxon sampling is incomplete, as in many studies of ecological communities, but that, in accordance with expectations from coalescent theory, rapid, recent radiations may yield inaccurate results. Our implementation differs from existing ones in two ways: it allows for the accounting for important sources of uncertainty in the model (phylogenetic and in parameters specific to the model) and in the specification of informative prior distributions that can increase the precision of the model. We have incorporated this model into a user-friendly R package available on the authors' websites.
引用
收藏
页数:11
相关论文
共 61 条
  • [1] A step toward barcoding life: A model-based, decision-theoretic method to assign genes to preexisting species groups
    Abdo, Zaid
    Golding, G. Brian
    [J]. SYSTEMATIC BIOLOGY, 2007, 56 (01) : 44 - 56
  • [2] Nine exceptional radiations plus high turnover explain species diversity in jawed vertebrates
    Alfaro, Michael E.
    Santini, Francesco
    Brock, Chad
    Alamillo, Hugo
    Dornburg, Alex
    Rabosky, Daniel L.
    Carnevale, Giorgio
    Harmon, Luke J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2009, 106 (32) : 13410 - 13414
  • [3] Anderson D.R., 2008, MODEL BASED INFERENC
  • [4] [Anonymous], 2009, Bayesian Data Analysis
  • [5] Inferring evolutionarily significant units of bacterial diversity from broad environmental surveys of single-locus data
    Barraclough, Timothy G.
    Hughes, Martin
    Ashford-Hodges, Natalie
    Fujisawa, Tomochika
    [J]. BIOLOGY LETTERS, 2009, 5 (03) : 425 - 428
  • [6] The Ribosomal Database Project: improved alignments and new tools for rRNA analysis
    Cole, J. R.
    Wang, Q.
    Cardenas, E.
    Fish, J.
    Chai, B.
    Farris, R. J.
    Kulam-Syed-Mohideen, A. S.
    McGarrell, D. M.
    Marsh, T.
    Garrity, G. M.
    Tiedje, J. M.
    [J]. NUCLEIC ACIDS RESEARCH, 2009, 37 : D141 - D145
  • [7] Species concepts and species delimitation
    De Queiroz, Kevin
    [J]. SYSTEMATIC BIOLOGY, 2007, 56 (06) : 879 - 886
  • [8] Dixon P, 2003, J VEG SCI, V14, P927, DOI 10.1111/j.1654-1103.2003.tb02228.x
  • [9] BEAST: Bayesian evolutionary analysis by sampling trees
    Drummond, Alexei J.
    Rambaut, Andrew
    [J]. BMC EVOLUTIONARY BIOLOGY, 2007, 7 (1)
  • [10] SpedeSTEM: a rapid and accurate method for species delimitation
    Ence, Daniel D.
    Carstens, Bryan C.
    [J]. MOLECULAR ECOLOGY RESOURCES, 2011, 11 (03) : 473 - 480