Improving the performance of Bayesian phylogenetic inference under relaxed clock models

被引:28
作者
Zhang, Rong [1 ]
Drummond, Alexei [1 ,2 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
关键词
Bayesian MCMC; Bayesian phylogenetics; Proposal kernel; Genetic distances; Divergence times; Evolutionary rates; MOLECULAR EVOLUTION; DIVERGENCE TIMES; DNA-SEQUENCES; PROPOSALS; HISTORY;
D O I
10.1186/s12862-020-01609-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Bayesian MCMC has become a common approach for phylogenetic inference. But the growing size of molecular sequence data sets has created a pressing need to improve the computational efficiency of Bayesian phylogenetic inference algorithms. Results This paper develops a new algorithm to improve the efficiency of Bayesian phylogenetic inference for models that include a per-branch rate parameter. In a Markov chain Monte Carlo algorithm, the presented proposal kernel changes evolutionary rates and divergence times at the same time, under the constraint that the implied genetic distances remain constant. Specifically, the proposal operates on the divergence time of an internal node and the three adjacent branch rates. For the root of a phylogenetic tree, there are three strategies discussed, named Simple Distance, Small Pulley and Big Pulley. Note that Big Pulley is able to change the tree topology, which enables the operator to sample all the possible rooted trees consistent with the implied unrooted tree. To validate its effectiveness, a series of experiments have been performed by implementing the proposed operator in the BEAST2 software. Conclusions The results demonstrate that the proposed operator is able to improve the performance by giving better estimates for a given chain length and by using less running time for a given level of accuracy. Measured by effective samples per hour, use of the proposed operator results in overall mixing more efficient than the current operators in BEAST2. Especially for large data sets, the improvement is up to half an order of magnitude.
引用
收藏
页数:28
相关论文
共 50 条
[41]   An automated framework for fast cognate detection and Bayesian phylogenetic inference in computational historical linguistics [J].
Rama, Taraka ;
List, Johann-Mattis .
57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), 2019, :6225-6235
[42]   Calibrated Birth-Death Phylogenetic Time-Tree Priors for Bayesian Inference [J].
Heled, Joseph ;
Drummond, Alexei J. .
SYSTEMATIC BIOLOGY, 2015, 64 (03) :369-383
[43]   Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference [J].
Zhang, Chi ;
Huelsenbeck, John P. ;
Ronquist, Fredrik .
SYSTEMATIC BIOLOGY, 2020, 69 (05) :1016-1032
[44]   Geometric ergodicity of a Metropolis-Hastings algorithm for Bayesian inference of phylogenetic branch lengths [J].
Spade, David A. .
COMPUTATIONAL STATISTICS, 2020, 35 (04) :2043-2076
[45]   DM-PhyClus: a Bayesian phylogenetic algorithm for infectious disease transmission cluster inference [J].
Villandre, Luc ;
Labbe, Aurelie ;
Brenner, Bluma ;
Roger, Michel ;
Stephens, David A. .
BMC BIOINFORMATICS, 2018, 19
[46]   siMBa-a simple graphical user interface for the Bayesian phylogenetic inference program MrBayes [J].
Mishra, Bagdevi ;
Thines, Marco .
MYCOLOGICAL PROGRESS, 2014, 13 (04) :1255-1258
[47]   Bayesian Inference Under the Multispecies Coalescent with Ancient DNA Sequences [J].
Nagel, Anna A. ;
Flouri, Tomas ;
Yang, Ziheng ;
Rannala, Bruce .
SYSTEMATIC BIOLOGY, 2024, 73 (06) :964-978
[48]   Low-Parameter Phylogenetic Inference Under the General Markov Model [J].
Holland, Barbara R. ;
Jarvis, Peter D. ;
Sumner, Jeremy G. .
SYSTEMATIC BIOLOGY, 2013, 62 (01) :78-92
[49]   Classification of molecular sequence data using Bayesian phylogenetic mixture models [J].
Loza-Reyes, E. ;
Hum, M. A. ;
Robinson, A. .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 75 :81-95
[50]   Investigating the performance of AIC in selecting phylogenetic models [J].
Jhwueng, Dwueng-Chwuan ;
Huzurbazar, Snehalata ;
O'Meara, Brian C. ;
Liu, Liang .
STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) :459-475