A Block-Based Systolic Array on an HBM2 FPGA for DNA Sequence Alignment

被引:11
作者
Ben Abdelhamid, Riadh [1 ]
Yamaguchi, Yoshiki [2 ]
机构
[1] Univ Tsukuba, Grad Sch Syst & Informat Engn, 1-1-1 Ten Ou Dai, Tsukuba, Ibaraki 3058573, Japan
[2] Univ Tsukuba, Fac Engn Informat & Syst, 1-1-1 Ten Ou Dai, Tsukuba, Ibaraki 3058573, Japan
来源
APPLIED RECONFIGURABLE COMPUTING. ARCHITECTURES, TOOLS, AND APPLICATIONS, ARC 2020 | 2020年 / 12083卷
关键词
DNA sequence alignment; Smith-Waterman algorithm; Systolic array; HBM2; High Level Synthesis; Reconfigurable High Performance Computing; SEARCH;
D O I
10.1007/978-3-030-44534-8_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Revealing the optimal local similarity between a pair of genomic sequences is one of the most fundamental issues in bioinformatics. The Smith-Waterman algorithm is a method that was developed for that specific purpose. With the continuous advances in the computer field, this method becomes widely used to an extent where it expanded its reach to cover a broad range of applications, even in areas such as network packet inspections and pattern matching. This algorithm is based on Dynamic Programming and is guaranteed to find the optimal local sequence alignment between two base pairs. The computational complexity is O(mn), where m and n are defined as the number of the elements of a query and a database sequence, respectively. Researchers have investigated several manners to accelerate the calculation using CPU, GPU, Cell B.E., and FPGA. Most of them have proposed a data-reuse approach because the Smith-Waterman algorithm has rather high "bytes per operation"; in other words, the Smith-Waterman algorithm requires large memory bandwidth. In this paper, we try to minimize the impact of the memory bandwidth bottleneck through the implementation of a block-based systolic array approach that maximizes the usage of memory banks in HBM2 (High Bandwidth Memory). The proposed approach demonstrates a higher performance in terms of GCUPS (Giga Cell Update Per Second) compared to one of the best cases reported in previous works, and also achieves a significant improvement in power efficiency. For example, our implementation could reach 429.39 GCUPS while achieving a power efficiency of 7.68 GCUPS/W. With a different configuration, it could reach 316.73 GCUPS while hitting a peak power efficiency of 8.86 GCUPS/W.
引用
收藏
页码:298 / 313
页数:16
相关论文
共 21 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
Chen P., 2013, P 2013 INT C, P480
[3]   Parasail: SIMD C library for global, semi-global, and local pairwise sequence alignments [J].
Daily, Jeff .
BMC BIOINFORMATICS, 2016, 16
[4]   CUDAlign 4.0: Incremental Speculative Traceback for Exact Chromosome-Wide Alignment in GPU Clusters [J].
de Oliveira Sandes, Edans Flavius ;
Miranda, Guillermo ;
Martorell, Xavier ;
Ayguade, Eduard ;
Teodoro, George ;
Magalhaes Melo, Alba Cristina .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2016, 27 (10) :2838-2850
[5]  
Di Tucci L., 2017, DES AUT TEST EUROPE, P716, DOI 10.23919DATE.2017.7927082
[6]  
Hasan L., 2008, A systolic array architecture for the SmithWaterman algorithm with high performance cell design, P35
[7]  
Houtgast E., 2017, IEEE INT C BIOINF BI
[8]  
KUNG HT, 1982, COMPUTER, V15, P37, DOI 10.1109/MC.1982.1653825
[9]   CUDASW++3.0: accelerating Smith-Waterman protein database search by coupling CPU and GPU SIMD instructions [J].
Liu, Yongchao ;
Wirawan, Adrianto ;
Schmidt, Bertil .
BMC BIOINFORMATICS, 2013, 14 :117
[10]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+