Accelerating Large-Scale Biological Database Search on Xeon Phi-based Neo-Heterogeneous Architectures

被引:0
作者
Lan, Haidong [1 ]
Liu, Weiguo [1 ]
Schmidt, Bertil [2 ]
Wang, Bingqiang [3 ]
机构
[1] Shandong Univ, Minist Educ, Engn Res Ctr Digital Media Technol, Sch Comp Sci & Technol, Jinan 250100, Peoples R China
[2] Johannes Gutenberg Univ Mainz, D-55122 Mainz, Germany
[3] Natl Supercomp Ctr Shenzhen, Shenzhen, Peoples R China
来源
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE | 2015年
关键词
Smith-Waterman; neo-heterogeneous architecture; dynamic programming; sequence alignment; Xeon Phi; Knights Corner vector instructions; ALGORITHMS; ALIGNMENT;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In this paper we present new parallelization techniques for searching large-scale biological sequence databases with the Smith-Waterman algorithm on Xeon Phi-based neo-heterogenous architectures. In order to make full use of the compute power of both the multi-core CPU and the many-core Xeon Phi hardware, we use a collaborative computing scheme as well as hybrid parallelism. At the CPU side, we employ SSE intrinsics and multi-threading to implement SIMD parallelism. At the Xeon Phi side, we use Knights Corner vector instructions to gain more data parallelism. We have presented two dynamic task distribution schemes (thread level and device level) in order to achieve better load balancing. Furthermore, a multi-threaded asynchronous scheme is used to overlap communication and computation between CPUs and Xeon Phis. Evaluations on real protein sequence databases show that our method achieves a peak overall performance up to 220 GCUPS on a neo-heterogeneous platform consisting of two Intel ES-2620 CPUs and two Intel Xeon Phi 7110P cards. It also exhibits good scalability in terms of database size and query length. Our implementation is available at: http://turbo0628.github.ioILSBDS/.
引用
收藏
页码:503 / 510
页数:8
相关论文
共 25 条
[1]  
Alpern B, 1995, SUPERCOMP PROC, P536
[2]  
[Anonymous], P 1 IEEE INT WORKSH
[3]  
[Anonymous], 13 ACM INT S FIELD P
[4]  
[Anonymous], BMC RES NOTES
[5]  
[Anonymous], 25 IEEE INT C APPL S
[6]  
[Anonymous], IEEE INT PAR DISTR P
[7]  
[Anonymous], 28 IEEE INT PAR DIST
[8]  
[Anonymous], 2013, BMC BIOINFORMATICS
[9]  
[Anonymous], BMC BIOINFORMATICS
[10]   Computational biology and high-performance computing [J].
Bader, DA .
COMMUNICATIONS OF THE ACM, 2004, 47 (11) :34-41