Evaluating the Adequacy of Molecular Clock Models Using Posterior Predictive Simulations

被引:27
作者
Duchene, David A. [1 ]
Duchene, Sebastian [2 ,3 ]
Holmes, Edward C. [2 ,3 ]
Ho, Simon Y. W. [2 ]
机构
[1] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[2] Univ Sydney, Sch Biol Sci, Sydney, NSW 2006, Australia
[3] Univ Sydney, Sydney Med Sch, Marie Bashir Inst Infect Dis & Biosecur, Charles Perkins Ctr, Sydney, NSW 2006, Australia
基金
英国医学研究理事会; 澳大利亚研究理事会;
关键词
model adequacy; posterior predictive simulations; Bayesian phylogenetics; molecular clock; evolutionary rates; model selection; PHYLOGENETIC ANALYSIS; SUBSTITUTION; EVOLUTION; INFERENCE; CALIBRATION; CHOICE; UNCERTAINTY; FREQUENTIST; COALESCENT; SELECTION;
D O I
10.1093/molbev/msv154
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular clock models are commonly used to estimate evolutionary rates and timescales from nucleotide sequences. The goal of these models is to account for rate variation among lineages, such that they are assumed to be adequate descriptions of the processes that generated the data. A common approach for selecting a clock model for a data set of interest is to examine a set of candidates and to select the model that provides the best statistical fit. However, this can lead to unreliable estimates if all the candidate models are actually inadequate. For this reason, a method of evaluating absolute model performance is critical. We describe a method that uses posterior predictive simulations to assess the adequacy of clock models. We test the power of this approach using simulated data and find that the method is sensitive to bias in the estimates of branch lengths, which tends to occur when using underparameterized clock models. We also compare the performance of the multinomial test statistic, originally developed to assess the adequacy of substitution models, but find that it has low power in identifying the adequacy of clock models. We illustrate the performance of our method using empirical data sets from coronaviruses, simian immunodeficiency virus, killer whales, and marine turtles. Our results indicate that methods of investigating model adequacy, including the one proposed here, should be routinely used in combination with traditional model selection in evolutionary studies. This will reveal whether a broader range of clock models to be considered in phylogenetic analysis.
引用
收藏
页码:2986 / 2995
页数:10
相关论文
共 50 条
  • [1] Assessing the Adequacy of Morphological Models Using Posterior Predictive Simulations
    Mulvey, Laura P. A.
    May, Michael R.
    Brown, Jeremy M.
    Hoehna, Sebastian
    Wright, April M.
    Warnock, Rachel C. M.
    SYSTEMATIC BIOLOGY, 2024, : 34 - 52
  • [2] Phylodynamic Model Adequacy Using Posterior Predictive Simulations
    Duchene, Sebastian
    Bouckaert, Remco
    Duchene, David A.
    Stadler, Tanja
    Drummond, Alexei J.
    SYSTEMATIC BIOLOGY, 2019, 68 (02) : 358 - 364
  • [3] Assessing the performance of DNA barcoding using posterior predictive simulations
    Barley, Anthony J.
    Thomson, Robert C.
    MOLECULAR ECOLOGY, 2016, 25 (09) : 1944 - 1957
  • [4] Testing the molecular clock using mechanistic models of fossil preservation and molecular evolution
    Warnock, Rachel C. M.
    Yang, Ziheng
    Donoghue, Philip C. J.
    PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2017, 284 (1857)
  • [5] Assessing model adequacy for Bayesian Skyline plots using posterior predictive simulation
    Fonseca, Emanuel M.
    Duckett, Drew J.
    Almeida, Filipe G.
    Smith, Megan L.
    Thome, Maria Tereza C.
    Carstens, Bryan C.
    PLOS ONE, 2022, 17 (07):
  • [6] A framework for evaluating predictive models
    Tan, Yee-Leng
    Saffari, Seyed Ehsan
    Tan, Nigel Choon Kiat
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2022, 150 : 188 - 190
  • [7] Using multiple relaxed-clock models to estimate evolutionary timescales from DNA sequence data
    Duchene, Sebastian
    Ho, Simon Y. W.
    MOLECULAR PHYLOGENETICS AND EVOLUTION, 2014, 77 : 65 - 70
  • [8] Evaluating molecular clock calibrations using Bayesian analyses with soft and hard bounds
    Sanders, Kate L.
    Lee, Michael S. Y.
    BIOLOGY LETTERS, 2007, 3 (03) : 275 - 279
  • [9] Evaluating the predictive abilities of community occupancy models using AUC while accounting for imperfect detection
    Zipkin, Elise F.
    Grant, Evan H. Campbell
    Fagan, William F.
    ECOLOGICAL APPLICATIONS, 2012, 22 (07) : 1962 - 1972
  • [10] Detection of Implausible Phylogenetic Inferences Using Posterior Predictive Assessment of Model Fit
    Brown, Jeremy M.
    SYSTEMATIC BIOLOGY, 2014, 63 (03) : 334 - 348