Accelerating the Next Generation Long Read Mapping with the FPGA-Based System

被引:36
作者
Chen, Peng [1 ]
Wang, Chao [1 ]
Li, Xi [2 ]
Zhou, Xuehai [2 ]
机构
[1] Univ Sci & Technol China, Dept Comp Sci, Hefei 230027, Anhui, Peoples R China
[2] Univ Sci & Technol China, Suzhou Inst, Suzhou 215123, Jiangsu, Peoples R China
基金
美国国家科学基金会;
关键词
Sequence alignment; long read mapping; BWA-SW; hardware acceleration; Smith-Waterman; BURROWS-WHEELER TRANSFORM; ALIGNMENT PROGRAM; TOOL;
D O I
10.1109/TCBB.2014.2326876
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
To compare the newly determined sequences against the subject sequences stored in the databases is a critical job in the bioinformatics. Fortunately, recent survey reports that the state-of-the-art aligners are already fast enough to handle the ultra amount of short sequence reads in the reasonable time. However, for aligning the long sequence reads (>400 bp) generated by the next generation sequencing (NGS) technology, it is still quite inefficient with present aligners. Furthermore, the challenge becomes more and more serious as the lengths and the amounts of the sequence reads are both keeping increasing with the improvement of the sequencing technology. Thus, it is extremely urgent for the researchers to enhance the performance of the long read alignment. In this paper, we propose a novel FPGA-based system to improve the efficiency of the long read mapping. Compared to the state-of-the-art long read aligner BWA-SW, our accelerating platform could achieve a high performance with almost the same sensitivity. Experiments demonstrate that, for reads with lengths ranging from 512 up to 4,096 base pairs, the described system obtains a 10 x -48x speedup for the bottleneck of the software. As to the whole mapping procedure, the FPGA-based platform could achieve a 1.8 x -3.3x speedup versus the BWA-SW aligner, reducing the alignment cycles from weeks to days.
引用
收藏
页码:840 / 852
页数:13
相关论文
共 39 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
[Anonymous], IEEE T COMPUTERS
[3]  
Benkrid K., 2009, IEEE T VERY LARGE SC, V17, P1859
[4]  
Chen P., 2013, P ACM SIGDA INT S FI, P271
[5]   A FPGA-based high performance acceleration platform for the Next Generation long read mapping [J].
Chen, Peng ;
Wang, Chao ;
Li, Xi ;
Zhou, Xuehai .
2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC), 2013, :308-315
[6]  
Chen P, 2013, PROCEEDINGS OF THE 2013 INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE TECHNOLOGY (FPT), P480, DOI 10.1109/FPT.2013.6718421
[7]   PerM: efficient mapping of short sequencing reads with periodic full sensitive spaced seeds [J].
Chen, Yangho ;
Souaiaia, Tade ;
Chen, Ting .
BIOINFORMATICS, 2009, 25 (19) :2514-2521
[8]  
Cox A.J., 2007, ELAND EFFICIENT LOCA
[9]  
Faes P., 2006, PROC 1 INT C SCALABL
[10]  
Fernandez E., 2010, Proceedings 2010 International Conference on Field Programmable Logic and Applications (FPL 2010), P360, DOI 10.1109/FPL.2010.78