On the importance of assessing topological convergence in Bayesian phylogenetic inference

被引:1
作者
Brusselmans, Marius [1 ]
Carvalho, Luiz Max [2 ]
L. Hong, Samuel [1 ]
Gao, Jiansi [3 ]
Matsen, Frederick A. [4 ,5 ,6 ]
Rambaut, Andrew [7 ]
Lemey, Philippe [1 ]
Suchard, Marc A. [8 ]
Dudas, Gytis [9 ]
Baele, Guy [1 ]
机构
[1] Katholieke Univ Leuven, Rega Inst, Dept Microbiol Immunol & Transplantat, Leuven, Belgium
[2] Getulio Vargas Fdn, Sch Appl Math, Praia Botafogo 190, BR-22250900 Rio De Janeiro, Brazil
[3] Fred Hutchinson Canc Ctr, Computat Biol Program, Seattle, WA 98109 USA
[4] Howard Hughes Med Inst, Fred Hutchinson Canc Res Ctr, Computat Biol Program, Seattle, WA USA
[5] Univ Washington, Dept Genome Sci, Seattle, WA USA
[6] Univ Washington, Dept Stat, Seattle, WA USA
[7] Univ Edinburgh, Inst Ecol & Evolut, Edinburgh EH9 3FL, Scotland
[8] Univ Calif Los Angeles, Fielding Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90095 USA
[9] Vilnius Univ, Inst Biotechnol, Life Sci Ctr, Vilnius, Lithuania
基金
欧盟地平线“2020”; 英国惠康基金; 美国国家卫生研究院; 欧洲研究理事会;
关键词
effective sample size; topologies; Bayesian inference; phylogenetics; phylodynamics; convergence; mixing; EBOV; HIV; ALGORITHMS;
D O I
10.1093/ve/veae081
中图分类号
Q93 [微生物学];
学科分类号
071005 ; 100705 ;
摘要
Modern phylogenetics research is often performed within a Bayesian framework, using sampling algorithms such as Markov chain Monte Carlo (MCMC) to approximate the posterior distribution. These algorithms require careful evaluation of the quality of the gener-ated samples. Within the field of phylogenetics, one frequently adopted diagnostic approach is to evaluate the effective sample size and to investigate trace graphs of the sampled parameters. A major limitation of these approaches is that they are developed for continuous parameters and therefore incompatible with a crucial parameter in these inferences: the tree topology. Several recent advancements have aimed at extending these diagnostics to topological space. In this reflection paper, we present two case studies-one on Ebola virus and one on HIV-illustrating how these topological diagnostics can contain information not found in standard diagnostics, and how decisions regarding which of these diagnostics to compute can impact inferences regarding MCMC convergence and mixing. Our results show the importance of running multiple replicate analyses and of carefully assessing topological convergence using the output of these replicate analyses. To this end, we illustrate different ways of assessing and visualizing the topological convergence of these replicates. Given the major importance of detecting convergence and mixing issues in Bayesian phylogenetic analyses, the lack of a unified approach to this problem warrants further action, especially now that additional tools are becoming available to researchers.
引用
收藏
页数:13
相关论文
共 26 条
[1]  
[Anonymous], 2023, R Foundation for Statistical Computing
[2]   Phylogenetic and phylodynamic approaches to understanding and combating the early SARS-CoV-2 pandemic [J].
Attwood, Stephen W. ;
Hill, Sarah C. ;
Aanensen, David M. ;
Connor, Thomas R. ;
Pybus, Oliver G. .
NATURE REVIEWS GENETICS, 2022, 23 (09) :547-562
[3]  
Brooks S, 2011, CH CRC HANDB MOD STA, pXIX
[4]   A Metric on Phylogenetic Tree Shapes [J].
Colijn, C. ;
Plazzotta, G. .
SYSTEMATIC BIOLOGY, 2018, 67 (01) :113-126
[5]   Virus genomes reveal factors that spread and sustained the Ebola epidemic [J].
Dudas, Gytis ;
Carvalho, Luiz Max ;
Bedford, Trevor ;
Tatem, Andrew J. ;
Baele, Guy ;
Faria, Nuno R. ;
Park, Daniel J. ;
Ladner, Jason T. ;
Arias, Armando ;
Asogun, Danny ;
Bielejec, Filip ;
Caddy, Sarah L. ;
Cotten, Matthew ;
D'Ambrozio, Jonathan ;
Dellicour, Simon ;
Di Caro, Antonino ;
Diclaro, Joseph W. ;
Duraffour, Sophie ;
Elmore, Michael J. ;
Fakoli, Lawrence S. ;
Faye, Ousmane ;
Gilbert, Merle L. ;
Gevao, Sahr M. ;
Gire, Stephen ;
Gladden-Young, Adrianne ;
Gnirke, Andreas ;
Goba, Augustine ;
Grant, Donald S. ;
Haagmans, Bart L. ;
Hiscox, Julian A. ;
Jah, Umaru ;
Kugelman, Jeffrey R. ;
Liu, Di ;
Lu, Jia ;
Malboeuf, Christine M. ;
Mate, Suzanne ;
Matthews, David A. ;
Matranga, Christian B. ;
Meredith, Luke W. ;
Qu, James ;
Quick, Joshua ;
Pas, Suzan D. ;
Phan, My V. T. ;
Pollakis, Georgios ;
Reusken, Chantal B. ;
Sanchez-Lockhart, Mariano ;
Schaffner, Stephen F. ;
Schieffelin, John S. ;
Sealfon, Rachel S. ;
Simon-Loriere, Etienne .
NATURE, 2017, 544 (7650) :309-+
[6]   Convergence assessment for Bayesian phylogenetic analysis using MCMC simulation [J].
Fabreti, Luiza Guimaraes ;
Hoehna, Sebastian .
METHODS IN ECOLOGY AND EVOLUTION, 2022, 13 (01) :77-90
[7]   GRAPH DRAWING BY FORCE-DIRECTED PLACEMENT [J].
FRUCHTERMAN, TMJ ;
REINGOLD, EM .
SOFTWARE-PRACTICE & EXPERIENCE, 1991, 21 (11) :1129-1164
[8]   Improving Bayesian Population Dynamics Inference: A Coalescent-Based Model for Multiple Loci [J].
Gill, Mandev S. ;
Lemey, Philippe ;
Faria, Nuno R. ;
Rambaut, Andrew ;
Shapiro, Beth ;
Suchard, Marc A. .
MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (03) :713-724
[9]   In Search of Covariates of HIV-1 Subtype B Spread in the United States-A Cautionary Tale of Large-Scale Bayesian Phylogeography [J].
Hong, Samuel L. ;
Dellicour, Simon ;
Vrancken, Bram ;
Suchard, Marc A. ;
Pyne, Michael T. ;
Hillyard, David R. ;
Lemey, Philippe ;
Baele, Guy .
VIRUSES-BASEL, 2020, 12 (02)
[10]   Mapping Phylogenetic Trees to Reveal Distinct Patterns of Evolution [J].
Kendall, Michelle ;
Colijn, Caroline .
MOLECULAR BIOLOGY AND EVOLUTION, 2016, 33 (10) :2735-2743