Fast Genome Analysis Leveraging Exact String Matching

被引:4
作者
Branchini, Beatrice [1 ]
Breschi, Sofia [1 ]
Zeni, Alberto [1 ]
Santambrogio, Marco D. [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn DEIB, Milan, Italy
来源
2022 IEEE 36TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW 2022) | 2022年
关键词
ALIGNMENT;
D O I
10.1109/IPDPSW55747.2022.00032
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Genome assembly is one of the most challenging tasks in bioinformatics, as it is the key to many applications. One of the fundamental tasks in genome assembly is exact sequence alignment. This process enables the identification of recurrent patterns and mutations inside the DNA, which can substantially support clinicians in providing a quicker diagnosis and producing individual-specific drugs. However, this procedure represents a bottleneck in genome analysis as it is computationally intensive and time-consuming. In this scenario, the efficiency of the chosen algorithm to perform this operation also plays a crucial role to speed up the analysis process. In this paper, we present a high-performance, energy-efficient FPGA implementation of the Knuth Morris Pratt (KMP) algorithm. Our multi-core architecture can parallelize the alignment procedure of the sequences, significantly reducing the execution time while still maintaining high flexibility. Experimental results show that our implementation on a Xilinx Alveo U280 achieves up to 2.68x speedup and up to 7.46 x improvement in energy efficiency against Bowtie2, a State-of-the-Art application for sequence alignment run on a 40-thread Intel Xeon processor. Finally, our design also outperforms hardware-accelerated applications of the KMP present the State of the Art by up to 19.38x and 15.63x in terms of throughput and energy efficiency respectively.
引用
收藏
页码:136 / 139
页数:4
相关论文
共 15 条
[1]  
Baker Z., 2004, P 2004 ACMSIGDA 12 I, P223, DOI 10.1145/968280.968312
[2]   The Path to Personalized Medicine [J].
Hamburg, Margaret A. ;
Collins, Francis S. .
NEW ENGLAND JOURNAL OF MEDICINE, 2010, 363 (04) :301-304
[3]  
Jain KK, 2002, CURR OPIN MOL THER, V4, P548
[4]  
Kim Y, 2020, DES AUT TEST EUROPE, P115, DOI 10.23919/DATE48585.2020.9116397
[5]  
Knuth D. E., 1977, SIAM Journal on Computing, V6, P323, DOI 10.1137/0206024
[6]  
Langmead B, 2012, NAT METHODS, V9, P357, DOI [10.1038/NMETH.1923, 10.1038/nmeth.1923]
[7]   Ultrafast and memory-efficient alignment of short DNA sequences to the human genome [J].
Langmead, Ben ;
Trapnell, Cole ;
Pop, Mihai ;
Salzberg, Steven L. .
GENOME BIOLOGY, 2009, 10 (03)
[8]  
Lei SM, 2016, IEEE TRUST BIG, P1190, DOI [10.1109/TrustCom.2016.191, 10.1109/TrustCom.2016.0193]
[9]  
Moghaddam A, 2012, IEEE CONF OPEN SYST, P11
[10]  
National Center for Biotechnology Information U.S. National Library of Medicine, SEQUENCE READ ARCHIV