A scalable multiple pairwise protein sequence alignment acceleration using hybrid CPU-GPU approach

被引:11
作者
Alawneh, Luay [1 ]
Shehab, Mohammed A. [2 ]
Al-Ayyoub, Mahmoud [1 ]
Jararweh, Yaser [1 ]
Al-Sharif, Ziad A. [1 ]
机构
[1] Jordan Univ Sci & Technol, Irbid, Jordan
[2] Concordia Univ, Montreal, PQ, Canada
来源
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS | 2020年 / 23卷 / 04期
关键词
Bioinformatics; Needleman-Wunsch; Smith-Waterman; Parallel programming; Dynamic parallelism; CUDA; SMITH-WATERMAN ALGORITHM; SIMILARITY; CLUSTALW; SEARCH; MAFFT; BLAST;
D O I
10.1007/s10586-019-03035-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Bioinformatics is an interdisciplinary field that applies trending techniques in information technology, mathematics, and statistics in studying large biological data. Bioinformatics involves several computational techniques such as sequence and structural alignment, data mining, macromolecular geometry, prediction of protein structure and gene finding. Protein structure and sequence analysis are vital to the understanding of cellular processes. Understanding cellular processes contributes to the development of drugs for metabolic pathways. Protein sequence alignment is concerned with identifying the similarities and the relationships among different protein structures. In this paper, we target two well-known protein sequence alignment algorithms, the Needleman-Wunsch and the Smith-Waterman algorithms. These two algorithms are computationally expensive which hinders their applicability for large data sets. Thus, we propose a hybrid parallel approach that combines the capabilities of multi-core CPUs and the power of contemporary GPUs, and significantly speeds up the execution of the target algorithms. The validity of our approach is tested on real protein sequences. Moreover, the scalability of the approach is verified on randomly generated sequences with predefined similarity levels. The results showed that the proposed hybrid approach was up to 242 times faster than the sequential approach.
引用
收藏
页码:2677 / 2688
页数:12
相关论文
共 45 条
[1]  
Abualigah LM, 2016, INT CONF COMP SCI, P1, DOI DOI 10.1109/CSIT.2016.7549467
[2]  
Al-Ayyoub M, 2016, MULTIMED TOOLS APP, P1
[3]  
Alberts B., 2002, Molecular Biology of the Cell
[4]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[5]  
[Anonymous], 2017, CISC VIS NETW IND GL, V6
[6]  
[Anonymous], 2014, Professional CUDA C Programming
[7]  
[Anonymous], 2002, CISC VIS NETW IND GL, V6
[8]  
[Anonymous], 2012, CUDA PROGRAMMING DEV
[9]  
[Anonymous], 2012, TECHNICAL REPORTS
[10]   LAGAN and Multi-LAGAN: Efficient tools for large-scale multiple alignment of genomic DNA [J].
Brudno, M ;
Do, CB ;
Cooper, GM ;
Kim, MF ;
Davydov, E ;
Green, ED ;
Sidow, A ;
Batzoglou, S .
GENOME RESEARCH, 2003, 13 (04) :721-731