A scalable parallel algorithm for global sequence alignment with customizable scoring scheme

被引:3
作者
Sadiq, Muhammad Umair [1 ,2 ]
Yousaf, Muhammad Murtaza [3 ]
机构
[1] Govt Coll Univ, Dept Comp Sci, Lahore, Punjab, Pakistan
[2] Univ Punjab, Fac Comp & Informat Technol, Lahore, Punjab, Pakistan
[3] Univ Punjab, Fac Comp & Informat Technol, Dept Software Engn, Lahore, Punjab, Pakistan
关键词
dynamic programming; global alignment; GPUs; high-performance computing (HPC); scoring scheme; sequence alignment; SPACE;
D O I
10.1002/cpe.7888
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Sequence alignment is a critical computational problem in various domains, including genomics, proteomics, and natural language processing. The Needleman-Wunsch (NW) algorithm is a classical dynamic programming approach for finding the optimal global alignment between two sequences. However, its quadratic time and space complexity make it impractical for aligning large-scale sequences, which are increasingly common in modern applications. In this article, we propose a parallel variation of the NW algorithm that enables scalable global sequence alignment with customizable scoring schemes. Our approach re-formulates the dependencies in the NW algorithm to enable parallel execution, thereby leveraging the computational power of modern parallel architectures, such as graphics processing unit (GPU). Furthermore, our algorithm supports arbitrary linear scoring schemes, which allows us to use domain-specific knowledge to improve alignment accuracy. We establish the correctness of our algorithm and evaluate its performance using real DNA and user trajectory sequences on GPUs. Our parallel algorithm has shown impressive results in our experiments, with a peak performance of 27.99 GCUPS (giga cell updates per second) and a maximum speedup of 48.18 times compared to the traditional sequential implementation. Additionally, our algorithm demonstrates remarkable scalability, enabling the alignment of sequences of any length while ensuring balanced work distribution and optimal utilization of resources. Our primary objective is to harness the computational capabilities of a single GPU and fully utilize the processing power of multi-core CPUs.
引用
收藏
页数:22
相关论文
共 67 条
[1]   Sequence analysis and optimal matching methods in sociology - Review and prospect [J].
Abbott, A ;
Tsay, A .
SOCIOLOGICAL METHODS & RESEARCH, 2000, 29 (01) :3-33
[2]   A scalable multiple pairwise protein sequence alignment acceleration using hybrid CPU-GPU approach [J].
Alawneh, Luay ;
Shehab, Mohammed A. ;
Al-Ayyoub, Mahmoud ;
Jararweh, Yaser ;
Al-Sharif, Ziad A. .
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2020, 23 (04) :2677-2688
[3]   Parallel biological sequence comparison using prefix computations [J].
Aluru, S ;
Futamura, N ;
Mehrotra, K .
IPPS/SPDP 1999: 13TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM & 10TH SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING, PROCEEDINGS, 1999, :653-659
[4]  
Alves CER, 2003, LECT NOTES COMPUT SC, V2668, P249
[5]  
[Anonymous], 2015, 2015 IEEE ACS 12 INT
[6]  
[Anonymous], Information, National Center for Biotechnology Pubchem Element Summary for Atomicnumber 33, Arsenic
[7]   Modified Needleman-Wunsch algorithm for clinical pathway clustering [J].
Aspland, Emma ;
Harper, Paul R. ;
Gartner, Daniel ;
Webb, Philip ;
Barrett-Lee, Peter .
JOURNAL OF BIOMEDICAL INFORMATICS, 2021, 115
[8]   Post-correction of OCR Errors Using PyEnchant Spelling Suggestions Selected Through a Modified Needleman-Wunsch Algorithm [J].
Cappelatti, Ewerton ;
Heidrich, Regina De Oliveira ;
Oliveira, Ricardo ;
Monticelli, Cintia ;
Rodrigues, Ronaldo ;
Goulart, Rodrigo ;
Velho, Eduardo .
HCI INTERNATIONAL 2018 - POSTERS' EXTENDED ABSTRACTS, PT I, 2018, 850 :3-10
[9]   Analysis and experimental evaluation of the Needleman-Wunsch algorithm for trajectory comparison [J].
Cavojsky, Maros ;
Drozda, Martin ;
Balogh, Zoltan .
EXPERT SYSTEMS WITH APPLICATIONS, 2021, 165
[10]   Comparison of User Trajectories with the Needleman-Wunsch Algorithm [J].
Cavojsky, Maros ;
Drozda, Martin .
MOBILE COMPUTING, APPLICATIONS, AND SERVICES, MOBICASE 2019, 2019, 290 :141-154