MrBayes sMC3: Accelerating Bayesian inference of phylogenetic trees

被引:2
作者
Kuan, Lidia [1 ]
Pratas, Frederico [1 ]
Sousa, Leonel [1 ]
Tomas, Pedro [1 ]
机构
[1] Univ Lisbon, Inst Super Tecn, INESC ID, Rua Alves Redol 9, P-1000029 Lisbon, Portugal
关键词
MrBayes; GPU; CUDA; phylogenetic inference; MAXIMUM-LIKELIHOOD; DNA-SEQUENCES; INSECTA; MODELS;
D O I
10.1177/1094342016652461
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
MrBayes is a popular software package for Bayesian phylogenetic inference, which uses an iterative approach to derive an evolutionary tree for a collection of species whose DNA sequences are known. Computationally, MrBayes is characterized by a large number of iterations, each composed of a set of tasks that isolated are not very time-consuming, but are globally computationally demanding. To accelerate the latest MrBayes 3.2, this paper presents MrBayes sMC(3), which relies on the computational power of an heterogeneous CPU+GPU platform. For this, MrBayes sMC(3) exploits both task and data-level parallelism while minimizing the overheads associated with kernel launches and CPU-GPU data transfers. Experimental results indicate that the proposed parallel approach, together with the proposed set of optimizations, allow for an application acceleration of up to 10x regarding the original MrBayes, and up to 3x regarding the Beagle Library. Furthermore, by analyzing the convergence rate of MrBayes sMC(3) with that of the state-of-the-art approaches, a significant reduction in execution time is observed.
引用
收藏
页码:246 / 265
页数:20
相关论文
共 29 条
[1]  
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[2]  
Ayres DL, 2012, SYST BIOL, V61, P170, DOI [10.1093/sysbio/syr100, 10.1093/sysbio/sys029]
[3]   Efficient Implementation of MrBayes on Multi-GPU [J].
Bao, Jie ;
Xia, Hongju ;
Zhou, Jianfu ;
Liu, Xiaoguang ;
Wang, Gang .
MOLECULAR BIOLOGY AND EVOLUTION, 2013, 30 (06) :1471-1479
[4]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[5]   Performance models for asynchronous data transfers on consumer Graphics Processing Units [J].
Gomez-Luna, Juan ;
Maria Gonzalez-Linares, Jose ;
Ignacio Benavides, Jose ;
Guil, Nicolas .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2012, 72 (09) :1117-1126
[6]   Multi-Kernel Auto-Tuning on GPUs: Performance and Energy-Aware Optimization [J].
Guerreiro, Joao ;
Ilic, Aleksandar ;
Roma, Nuno ;
Tomas, Pedro .
23RD EUROMICRO INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED, AND NETWORK-BASED PROCESSING (PDP 2015), 2015, :438-445
[7]  
Harris Mark, 2007, NVIDIA DEV TECHNOLOG, V2, P4
[8]   MONTE-CARLO SAMPLING METHODS USING MARKOV CHAINS AND THEIR APPLICATIONS [J].
HASTINGS, WK .
BIOMETRIKA, 1970, 57 (01) :97-&
[9]   MRBAYES: Bayesian inference of phylogenetic trees [J].
Huelsenbeck, JP ;
Ronquist, F .
BIOINFORMATICS, 2001, 17 (08) :754-755
[10]   Cache-aware Roofline model: Upgrading the loft [J].
Ilic, Aleksandar ;
Pratas, Frederico ;
Sousa, Leonel .
IEEE COMPUTER ARCHITECTURE LETTERS, 2014, 13 (01) :21-24