Evaluating the Adequacy of Molecular Clock Models Using Posterior Predictive Simulations

被引:27
作者
Duchene, David A. [1 ]
Duchene, Sebastian [2 ,3 ]
Holmes, Edward C. [2 ,3 ]
Ho, Simon Y. W. [2 ]
机构
[1] Australian Natl Univ, Res Sch Biol, Canberra, ACT, Australia
[2] Univ Sydney, Sch Biol Sci, Sydney, NSW 2006, Australia
[3] Univ Sydney, Sydney Med Sch, Marie Bashir Inst Infect Dis & Biosecur, Charles Perkins Ctr, Sydney, NSW 2006, Australia
基金
英国医学研究理事会; 澳大利亚研究理事会;
关键词
model adequacy; posterior predictive simulations; Bayesian phylogenetics; molecular clock; evolutionary rates; model selection; PHYLOGENETIC ANALYSIS; SUBSTITUTION; EVOLUTION; INFERENCE; CALIBRATION; CHOICE; UNCERTAINTY; FREQUENTIST; COALESCENT; SELECTION;
D O I
10.1093/molbev/msv154
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Molecular clock models are commonly used to estimate evolutionary rates and timescales from nucleotide sequences. The goal of these models is to account for rate variation among lineages, such that they are assumed to be adequate descriptions of the processes that generated the data. A common approach for selecting a clock model for a data set of interest is to examine a set of candidates and to select the model that provides the best statistical fit. However, this can lead to unreliable estimates if all the candidate models are actually inadequate. For this reason, a method of evaluating absolute model performance is critical. We describe a method that uses posterior predictive simulations to assess the adequacy of clock models. We test the power of this approach using simulated data and find that the method is sensitive to bias in the estimates of branch lengths, which tends to occur when using underparameterized clock models. We also compare the performance of the multinomial test statistic, originally developed to assess the adequacy of substitution models, but find that it has low power in identifying the adequacy of clock models. We illustrate the performance of our method using empirical data sets from coronaviruses, simian immunodeficiency virus, killer whales, and marine turtles. Our results indicate that methods of investigating model adequacy, including the one proposed here, should be routinely used in combination with traditional model selection in evolutionary studies. This will reveal whether a broader range of clock models to be considered in phylogenetic analysis.
引用
收藏
页码:2986 / 2995
页数:10
相关论文
共 50 条
  • [21] Evaluating Predictive Models of Student Success: Closing the Methodological Gap
    Gardner, Josh
    Brooks, Christopher
    JOURNAL OF LEARNING ANALYTICS, 2018, 5 (02): : 105 - 125
  • [22] CALIBRATION AND RANKING OF COARSE-GRAINED MODELS IN MOLECULAR SIMULATIONS USING BAYESIAN FORMALISM
    Meidani, Hadi
    Hooper, Justin B.
    Bedrov, Dmitry
    Kirby, Robert M.
    INTERNATIONAL JOURNAL FOR UNCERTAINTY QUANTIFICATION, 2017, 7 (02) : 99 - 115
  • [23] Accelerated Bayesian Inference for Molecular Simulations using Local Gaussian Process Surrogate Models
    Shanks, Brennon L.
    Sullivan, Harry W.
    Shazed, Abdur R.
    Hoepfner, Michael P.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2024, 20 (09) : 3798 - 3808
  • [24] Choice of generalized linear mixed models using predictive crossvalidation
    Braun, Julia
    Bove, Daniel Sabanes
    Held, Leonhard
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 75 : 190 - 202
  • [25] Predictive Models Based on Molecular Images and Molecular Descriptors for Drug Screening
    Mamada, Hideaki
    Takahashi, Mari
    Ogino, Mizuki
    Nomura, Yukihiro
    Uesawa, Yoshihiro
    ACS OMEGA, 2023, 8 (40): : 37186 - 37195
  • [26] Comparing Gaussian Graphical Models With the Posterior Predictive Distribution and Bayesian Model Selection
    Williams, Donald R.
    Rast, Philippe
    Pericchi, Luis R.
    Mulder, Joris
    PSYCHOLOGICAL METHODS, 2020, 25 (05) : 653 - 672
  • [27] Detecting Episodic Evolution through Bayesian Inference of Molecular Clock Models
    Tay, John H.
    Baele, Guy
    Duchene, Sebastian
    MOLECULAR BIOLOGY AND EVOLUTION, 2023, 40 (10)
  • [28] A Dirichlet Process Covarion Mixture Model and Its Assessments Using Posterior Predictive Discrepancy Tests
    Zhou, Yan
    Brinkmann, Henner
    Rodrigue, Nicolas
    Lartillot, Nicolas
    Philippe, Herve
    MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (02) : 371 - 384
  • [29] ClockstaR: choosing the number of relaxed-clock models in molecular phylogenetic analysis
    Duchene, Sebastian
    Molak, Martyna
    Ho, Simon Y. W.
    BIOINFORMATICS, 2014, 30 (07) : 1017 - 1019
  • [30] Application of Bayesian framework for evaluation of streamflow simulations using multiple climate models
    Achieng, Kevin O.
    Zhu, Jianting
    JOURNAL OF HYDROLOGY, 2019, 574 : 1110 - 1128