Dataflow acceleration of Smith-Waterman with Traceback for high throughput Next Generation Sequencing

被引:16
作者
Koliogeorgi, K. [1 ]
Voss, N. [2 ]
Fytraki, S. [2 ]
Xydis, S. [1 ]
Gaydadjiev, G. [2 ]
Soudris, D. [1 ]
机构
[1] NTUA, ECE, Microproc & Digital Syst Lab, Athens, Greece
[2] Maxeler Technol, London, England
来源
2019 29TH INTERNATIONAL CONFERENCE ON FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS (FPL) | 2019年
基金
欧盟地平线“2020”;
关键词
Next Generation Sequencing; Reconfigurable Acceleration; Dataflow Computing; Bowtie2; Smith Waterman; Traceback; READ ALIGNMENT;
D O I
10.1109/FPL.2019.00021
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Smith-Waterman algorithm is widely adopted by most popular DNA sequence aligners. The inherent algorithm computational intensity and the vast amount of NGS input data it operates on, create a bottleneck in genomic analysis flows for short-read alignment. FPGA architectures have been extensively leveraged to alleviate the problem, each one adopting a different approach. In existing solutions, effective co-design of the NGS short-read alignment still remains an open issue, mainly due to narrow view on real integration aspects, such as system wide communication and accelerator call overheads. In this paper, we propose a dataflow architecture for Smith-Waterman Matrix-fill and Traceback alignment stages, to perform short-read alignment on NGS data. The architectural decision of moving both stages on chip extinguishes the communication overhead, and coupled with radical software restructuring, allows for efficient integration into widely-used Bowtie2 aligner. This approach delivers x18 speedup over the respective Bowtie2 standalone components, while our co-designed Bowtie2 demonstrates a 35% boost in performance.
引用
收藏
页码:74 / 80
页数:7
相关论文
共 26 条
[1]  
[Anonymous], 2006, FIELD PROGRAMMABLE L
[2]  
[Anonymous], ALIGNING SEQUENCE RE, DOI DOI 10.48550/ARXIV.1303.3997
[3]  
Arram J, 2013, LECT NOTES COMPUT SC, V7806, P13, DOI 10.1007/978-3-642-36812-7_2
[4]   Reconfigurable Acceleration of Short Read Mapping [J].
Arram, James ;
Tsoi, Kuen Hung ;
Luk, Wayne ;
Jiang, Peiyong .
2013 IEEE 21ST ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2013, :210-217
[5]   A Highly Parameterized and Efficient FPGA-Based Skeleton for Pairwise Biological Sequence Alignment [J].
Benkrid, Khaled ;
Liu, Ying ;
Benkrid, AbdSamad .
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2009, 17 (04) :561-570
[6]   Preemptive Genotyping for Personalized Medicine: Design of the Right Drug, Right Dose, Right Time-Using Genomic Data to Individualize Treatment Protocol [J].
Bielinski, Suzette J. ;
Olson, Janet E. ;
Pathak, Jyotishman ;
Weinshilboum, Richard M. ;
Wang, Liewei ;
Lyke, Kelly J. ;
Ryu, Euijung ;
Targonski, Paul V. ;
Van Norstrand, Michael D. ;
Hathcock, Matthew A. ;
Takahashi, Paul Y. ;
McCormick, Jennifer B. ;
Johnson, Kiley J. ;
Maschke, Karen J. ;
Vitek, Carolyn R. Rohrer ;
Ellingson, Marissa S. ;
Wieben, Eric D. ;
Farrugia, Gianrico ;
Morrisette, Jody A. ;
Kruckeberg, Keri J. ;
Bruflat, Jamie K. ;
Peterson, Lisa M. ;
Blommel, Joseph H. ;
Skierka, Jennifer M. ;
Ferber, Matthew J. ;
Black, John L. ;
Baudhuin, Linnea M. ;
Klee, Eric W. ;
Ross, Jason L. ;
Veldhuizen, Tamra L. ;
Schultz, Cloann G. ;
Caraballo, Pedro J. ;
Freimuth, Robert R. ;
Chute, Christopher G. ;
Kullo, Iftikhar J. .
MAYO CLINIC PROCEEDINGS, 2014, 89 (01) :25-33
[7]  
Chen DC, 2007, SPRINGER SER OPTIM A, V7, P39
[8]  
Chen Yu-Ting., 2016, Proceedings of the 8th USENIX Conference on Hot Topics in Cloud Computing, P64
[9]   A framework for variation discovery and genotyping using next-generation DNA sequencing data [J].
DePristo, Mark A. ;
Banks, Eric ;
Poplin, Ryan ;
Garimella, Kiran V. ;
Maguire, Jared R. ;
Hartl, Christopher ;
Philippakis, Anthony A. ;
del Angel, Guillermo ;
Rivas, Manuel A. ;
Hanna, Matt ;
McKenna, Aaron ;
Fennell, Tim J. ;
Kernytsky, Andrew M. ;
Sivachenko, Andrey Y. ;
Cibulskis, Kristian ;
Gabriel, Stacey B. ;
Altshuler, David ;
Daly, Mark J. .
NATURE GENETICS, 2011, 43 (05) :491-+
[10]  
Guccione SA, 2002, LECT NOTES COMPUT SC, V2438, P1168