A Block-Based Systolic Array on an HBM2 FPGA for DNA Sequence Alignment

被引:11
作者
Ben Abdelhamid, Riadh [1 ]
Yamaguchi, Yoshiki [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, 1-1-1 Ten Ou Dai, Tsukuba, Ibaraki 3058573, Japan
[2] Univ Tsukuba, Fac Engn Informat & Syst, 1-1-1 Ten Ou Dai, Tsukuba, Ibaraki 3058573, Japan
来源
APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2020 | 2020年 / 12083卷
关键词
DNA sequence alignment; Smith-Waterman algorithm; Systolic array; HBM2; High Level Synthesis; Reconfigurable High Performance Computing; SEARCH;
D O I
10.1007/978-3-030-44534-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Revealing the optimal local similarity between a pair of genomic sequences is one of the most fundamental issues in bioinformatics. The Smith-Waterman algorithm is a method that was developed for that specific purpose. With the continuous advances in the computer field, this method becomes widely used to an extent where it expanded its reach to cover a broad range of applications, even in areas such as network packet inspections and pattern matching. This algorithm is based on Dynamic Programming and is guaranteed to find the optimal local sequence alignment between two base pairs. The computational complexity is O(mn), where m and n are defined as the number of the elements of a query and a database sequence, respectively. Researchers have investigated several manners to accelerate the calculation using CPU, GPU, Cell B.E., and FPGA. Most of them have proposed a data-reuse approach because the Smith-Waterman algorithm has rather high "bytes per operation"; in other words, the Smith-Waterman algorithm requires large memory bandwidth. In this paper, we try to minimize the impact of the memory bandwidth bottleneck through the implementation of a block-based systolic array approach that maximizes the usage of memory banks in HBM2 (High Bandwidth Memory). The proposed approach demonstrates a higher performance in terms of GCUPS (Giga Cell Update Per Second) compared to one of the best cases reported in previous works, and also achieves a significant improvement in power efficiency. For example, our implementation could reach 429.39 GCUPS while achieving a power efficiency of 7.68 GCUPS/W. With a different configuration, it could reach 316.73 GCUPS while hitting a peak power efficiency of 8.86 GCUPS/W.
引用
收藏
页码:298 / 313
页数:16
相关论文
共 21 条
[11]  
Nurdin Dayana Saiful, 2018, MATEC Web of Conferences, V150, DOI 10.1051/matecconf/201815006009
[12]   Faster Smith-Waterman database searches with inter-sequence SIMD parallelisation [J].
Rognes, Torbjorn .
BMC BIOINFORMATICS, 2011, 12
[13]   SWIFOLD: Smith-Waterman implementation on FPGA with OpenCL for long DNA sequences [J].
Rucci, Enzo ;
Garcia, Carlos ;
Botella, Guillermo ;
De Giusti, Armando ;
Naiouf, Marcelo ;
Prieto-Matias, Manuel .
BMC SYSTEMS BIOLOGY, 2018, 12
[14]   OSWALD: OpenCL Smith-Waterman on Altera's FPGA for Large Protein Databases [J].
Rucci, Enzo ;
Garcia, Carlos ;
Botella, Guillermo ;
De Giusti, Armando E. ;
Naiouf, Marcelo ;
Prieto-Matias, Manuel .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2018, 32 (03) :337-350
[15]  
Sandes E., 2014, IEEE ACM INT SYMP, P160
[16]   IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES [J].
SMITH, TF ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) :195-197
[17]   FPGASW: Accelerating Large-Scale Smith-Waterman Sequence Alignment Application with Backtracking on FPGA Linear Systolic Array [J].
Xia Fei ;
Zou Dan ;
Lu Lina ;
Man Xin ;
Zhang Chunlei .
INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, 2018, 10 (01) :176-188
[18]  
Xilinx, Vivado HLS Optimization Methodology Guide
[19]  
xilinx, Alveo U280 Data Center Accelerator Card
[20]  
Yamaguchi Y, 2011, LECT NOTES COMPUT SC, V6578, P181, DOI 10.1007/978-3-642-19475-7_20