Systematic Exploration of the High Likelihood Set of Phylogenetic Tree Topologies

被引:5
作者
Whidden, Chris [1 ]
Claywell, Brian C. [1 ]
Fisher, Thayer [2 ]
Magee, Andrew F. [3 ]
Fourment, Mathieu [4 ]
Matsen, Frederick A. [1 ]
机构
[1] Fred Hutchinson Canc Res Ctr, 1100 Fairview Ave N,Mail Stop M1-B514, Seattle, WA 98109 USA
[2] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[3] Univ Washington, Dept Biol, Seattle, WA 98195 USA
[4] Univ Technol Sydney, Ithree Inst, Sydney, NSW, Australia
基金
美国国家卫生研究院; 美国国家科学基金会;
关键词
Bayesian phylogenetics; consensus trees; phylogenetic islands; phylogenetic tree topology; systematic search; MULTIPLE GENE LOCI; MOLECULAR PHYLOGENY; PROPOSALS; INFERENCE; SURFACE; MODELS; TOOL;
D O I
10.1093/sysbio/syz047
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Bayesian Markov chain Monte Carlo explores tree space slowly, in part because it frequently returns to the same tree topology. An alternative strategy would be to explore tree space systematically, and never return to the same topology. In this article, we present an efficient parallelized method to map out the high likelihood set of phylogenetic tree topologies via systematic search, which we show to be a good approximation of the high posterior set of tree topologies on the data sets analyzed. Here, "likelihood" of a topology refers to the tree likelihood for the corresponding tree with optimized branch lengths. We call this method "phylogenetic topographer" (PT). The PT strategy is very simple: starting in a number of local topology maxima (obtained by hill-climbing from random starting points), explore out using local topology rearrangements, only continuing through topologies that are better than some likelihood threshold below the best observed topology. We show that the normalized topology likelihoods are a useful proxy for the Bayesian posterior probability of those topologies. By using a nonblocking hash table keyed on unique representations of tree topologies, we avoid visiting topologies more than once across all concurrent threads exploring tree space. We demonstrate that PT can be used directly to approximate a Bayesian consensus tree topology. When combined with an accurate means of evaluating per-topology marginal likelihoods, PT gives an alternative procedure for obtaining Bayesian posterior distributions on phylogenetic tree topologies.
引用
收藏
页码:280 / 293
页数:14
相关论文
共 41 条
[1]  
[Anonymous], 2017, Proceedings of the 34th International Conference on Machine Learning
[2]  
[Anonymous], 2011, DRAFT MRBAYES VERSIO
[3]  
[Anonymous], 2014, EFFECT MILD MOXIBUST
[4]  
[Anonymous], 2006, GENETIC ALGORITHM AP
[5]   Variational Inference: A Review for Statisticians [J].
Blei, David M. ;
Kucukelbir, Alp ;
McAuliffe, Jon D. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2017, 112 (518) :859-877
[6]  
Dobra A., 2009, STAT METHODOL, DOI [DOI 10.1016/J.STAMET.2009.04.002, DOI 10.1016/j.stamet.2009.04.002]
[7]  
Flouri T., 2018, LIBPLL
[8]   19 Dubious Ways to Compute the Marginal Likelihood of a Phylogenetic Tree Topology [J].
Fourment, Mathieu ;
Magee, Andrew F. ;
Whidden, Chris ;
Bilge, Arman ;
Matsen, Frederick A. ;
Minin, Vladimir N. .
SYSTEMATIC BIOLOGY, 2020, 69 (02) :209-220
[9]   Molecular evidence for Acanthocephala as a subtaxon of Rotifera [J].
Garey, JR ;
Near, TJ ;
Nonnemacher, MR ;
Nadler, SA .
JOURNAL OF MOLECULAR EVOLUTION, 1996, 43 (03) :287-292
[10]  
HEDGES SB, 1990, MOL BIOL EVOL, V7, P607