Improving the performance of Bayesian phylogenetic inference under relaxed clock models

被引:28
作者
Zhang, Rong [1 ]
Drummond, Alexei [1 ,2 ]
机构
[1] Univ Auckland, Sch Comp Sci, Auckland, New Zealand
[2] Univ Auckland, Sch Biol Sci, Auckland, New Zealand
关键词
Bayesian MCMC; Bayesian phylogenetics; Proposal kernel; Genetic distances; Divergence times; Evolutionary rates; MOLECULAR EVOLUTION; DIVERGENCE TIMES; DNA-SEQUENCES; PROPOSALS; HISTORY;
D O I
10.1186/s12862-020-01609-4
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background Bayesian MCMC has become a common approach for phylogenetic inference. But the growing size of molecular sequence data sets has created a pressing need to improve the computational efficiency of Bayesian phylogenetic inference algorithms. Results This paper develops a new algorithm to improve the efficiency of Bayesian phylogenetic inference for models that include a per-branch rate parameter. In a Markov chain Monte Carlo algorithm, the presented proposal kernel changes evolutionary rates and divergence times at the same time, under the constraint that the implied genetic distances remain constant. Specifically, the proposal operates on the divergence time of an internal node and the three adjacent branch rates. For the root of a phylogenetic tree, there are three strategies discussed, named Simple Distance, Small Pulley and Big Pulley. Note that Big Pulley is able to change the tree topology, which enables the operator to sample all the possible rooted trees consistent with the implied unrooted tree. To validate its effectiveness, a series of experiments have been performed by implementing the proposed operator in the BEAST2 software. Conclusions The results demonstrate that the proposed operator is able to improve the performance by giving better estimates for a given chain length and by using less running time for a given level of accuracy. Measured by effective samples per hour, use of the proposed operator results in overall mixing more efficient than the current operators in BEAST2. Especially for large data sets, the improvement is up to half an order of magnitude.
引用
收藏
页数:28
相关论文
共 50 条
[31]   MrBayes sMC3: Accelerating Bayesian inference of phylogenetic trees [J].
Kuan, Lidia ;
Pratas, Frederico ;
Sousa, Leonel ;
Tomas, Pedro .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (02) :246-265
[32]   Geometric ergodicity of a hybrid sampler for Bayesian inference of phylogenetic branch lengths [J].
Spade, David A. ;
Herbei, Radu ;
Kubatko, Laura S. .
MATHEMATICAL BIOSCIENCES, 2015, 268 :9-21
[33]   Bayesian inference of phylogenetic networks from bi-allelic genetic markers [J].
Zhu, Jiafan ;
Wen, Dingqiao ;
Yu, Yun ;
Meudt, Heidi M. ;
Nakhleh, Luay .
PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (01)
[34]   The Effect of Ambiguous Data on Phylogenetic Estimates Obtained by Maximum Likelihood and Bayesian Inference [J].
Lemmon, Alan R. ;
Brown, Jeremy M. ;
Stanger-Hall, Kathrin ;
Lemmon, Emily Moriarty .
SYSTEMATIC BIOLOGY, 2009, 58 (01) :130-145
[35]   StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent [J].
Douglas, Jordan ;
Jimenez-Silva, Cinthy L. ;
Bouckaert, Remco .
SYSTEMATIC BIOLOGY, 2022, 71 (04) :901-916
[36]   INCORPORATING COMPOSITIONAL HETEROGENEITY INTO LIE MARKOV MODELS FOR PHYLOGENETIC INFERENCE [J].
Hannaford, Naomi E. ;
Heaps, Sarah E. ;
Nye, Tom M. W. ;
Williams, Tom A. ;
Embley, T. Martin .
ANNALS OF APPLIED STATISTICS, 2020, 14 (04) :1964-1983
[37]   Efficient Bayesian inference under the multispecies coalescent with migration [J].
Flouri, Tomas ;
Jiao, Xiyun ;
Huang, Jun ;
Rannala, Bruce ;
Yang, Ziheng .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (44)
[38]   Phylogenetic relationships and demographic histories of the Atherinidae in the Eastern Atlantic and Mediterranean Sea re-examined by Bayesian inference [J].
Pujolar, J. M. ;
Zane, L. ;
Congiu, L. .
MOLECULAR PHYLOGENETICS AND EVOLUTION, 2012, 63 (03) :857-865
[39]   Reassessing the temporal evolution of orchids with new fossils and a Bayesian relaxed clock, with implications for the diversification of the rare South American genus Hoffmannseggella (Orchidaceae: Epidendroideae) [J].
Gustafsson, A. Lovisa S. ;
Verola, Christiano F. ;
Antonelli, Alexandre .
BMC EVOLUTIONARY BIOLOGY, 2010, 10
[40]   adaPop: Bayesian inference of dependent population dynamics in coalescent models [J].
Cappello, Lorenzo ;
Kim, Jaehee ;
Palacios, Julia A. .
PLOS COMPUTATIONAL BIOLOGY, 2023, 19 (03)