Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST

被引:9
作者
Baele, Guy [1 ]
Gill, Mandev S. [1 ]
Bastide, Paul [1 ]
Lemey, Philippe [1 ]
Suchard, Marc A. [2 ,3 ,4 ]
机构
[1] Katholieke Univ Leuven, Dept Microbiol Immunol & Transplantat, Rega Inst, Herestr 49, B-3000 Leuven, Belgium
[2] Univ Calif Los Angeles, Jonathan & Karin Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, UCLA, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Human Genet, David Geffen Sch Med, UCLA, Los Angeles, CA 90095 USA
基金
欧洲研究理事会; 英国惠康基金; 美国国家科学基金会; 美国国家卫生研究院;
关键词
Bayesian inference; BEAGLE; BEAST; covarion; heterotachy; Markov-modulated models; phylogenetics; DNA-SEQUENCES; MODEL;
D O I
10.1093/sysbio/syaa037
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Markov models of character substitution on phylogenies form the foundation of phylogenetic inference frameworks. Early models made the simplifying assumption that the substitution process is homogeneous over time and across sites in the molecular sequence alignment. While standard practice adopts extensions that accommodate heterogeneity of substitution rates across sites, heterogeneity in the process over time in a site-specific manner remains frequently overlooked. This is problematic, as evolutionary processes that act at the molecular level are highly variable, subjecting different sites to different selective constraints over time, impacting their substitution behavior. We propose incorporating time variability through Markov-modulated models (MMMs), which extend covarion-like models and allow the substitution process (including relative character exchange rates as well as the overall substitution rate) at individual sites to vary across lineages. We implement a general MMM framework in BEAST, a popular Bayesian phylogenetic inference software package, allowing researchers to compose a wide range of MMMs through flexible XML specification. Using examples from bacterial, viral, and plastid genome evolution, we show that MMMs impact phylogenetic tree estimation and can substantially improve model fit compared to standard substitution models. Through simulations, we show that marginal likelihood estimation accurately identifies the generative model and does not systematically prefer the more parameter-rich MMMs. To mitigate the increased computational demands associated with MMMs, our implementation exploits recent developments in BEAGLE, a high-performance computational library for phylogenetic inference.
引用
收藏
页码:181 / 189
页数:9
相关论文
共 22 条
  • [1] Covarion structure in plastid genome evolution:: A new statistical test
    Ané, C
    Burleigh, JG
    McMahon, MM
    Sanderson, MJ
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (04) : 914 - 924
  • [2] BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics
    Ayres, Daniel
    Cummings, Michael
    Baele, Guy
    Darling, Aaron
    Lewis, Paul
    Swofford, David
    Huelsenbeck, John
    Lemey, Philippe
    Rambaut, Andrew
    Suchard, Marc
    [J]. SYSTEMATIC BIOLOGY, 2019, 68 (06) : 1052 - 1061
  • [3] Genealogical Working Distributions for Bayesian Model Testing with Phylogenetic Uncertainty
    Baele, Guy
    Lemey, Philippe
    Suchard, Marc A.
    [J]. SYSTEMATIC BIOLOGY, 2016, 65 (02) : 250 - 264
  • [4] A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution
    Blanquart, Samuel
    Lartillot, Nicolas
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2006, 23 (11) : 2058 - 2071
  • [5] EMBLEY TM, 1993, SYST APPL MICROBIOL, V16, P25
  • [6] EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH
    FELSENSTEIN, J
    [J]. JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) : 368 - 376
  • [7] THE MARKOV-MODULATED POISSON-PROCESS (MMPP) COOKBOOK
    FISCHER, W
    MEIERHELLSTERN, K
    [J]. PERFORMANCE EVALUATION, 1993, 18 (02) : 149 - 171
  • [8] GASCUEL O, 2007, RECONSTRUCTING EVOLU, P65
  • [9] Modeling the site-specific variation of selection patterns along lineages
    Guindon, S
    Rodrigo, AG
    Dyer, KA
    Huelsenbeck, JP
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (35) : 12957 - 12962
  • [10] A Phylogenetic Approach Finds Abundant Interlocus Gene Conversion in Yeast
    Ji, Xiang
    Griffing, Alexander
    Thorne, Jeffrey L.
    [J]. MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (09) : 2469 - 2476