Accelerated Seeding for Genome Sequence Alignment with Enumerated Radix Trees

被引:20
作者
Subramaniyan, Arun [1 ]
Wadden, Jack [1 ]
Goliya, Kush [1 ]
Ozog, Nathan [1 ]
Wu, Xiao [1 ]
Narayanasamy, Satish [1 ]
Blaauw, David [1 ]
Das, Reetuparna [1 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
来源
2021 ACM/IEEE 48TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA 2021) | 2021年
关键词
Genomics; Sequence Alignment; Bioinformatics; Computer Architecture; READ ALIGNMENT; FM-INDEX; MRSFAST; SNP;
D O I
10.1109/ISCA52012.2021.00038
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Read alignment is a time-consuming step in genome sequencing analysis. The most widely used software for read alignment, BWA-MEM, and the recently published faster version BWA-MEM2 are based on the seed-and-extend paradigm for read alignment. The seeding step of read alignment is a major bottleneck contributing similar to 40% to the overall execution time of BWA-MEM2 when aligning whole human genome reads from the Platinum Genomes dataset. This is because both BWA-MEM and BWA-MEM2 use a compressed index structure called the FN1D-Index, which results in high bandwidth requirements, primarily due to its character-by-character processing of reads. For instance, to seed each read (101 DNA base-pairs stored in 37.8 bytes), the FMD-Index solution in BWA-MEM2 requires similar to 48.5 KB of index data. We propose a novel indexing data structure named Enumerated Radix Tree (ERT) and design a custom sling accelerator based on it. ERT improves bandwidth efficiency of BWA-MEM2 by 4.5x while guaranteeing 100% identical output to the original software, and still fitting in 64 GB DRAM. Overall, the proposed seeding accelerator implemented on AWS Fl FPGA (f1.4xlarge) improves seeding throughput of BWA-MEM2 by 3.3x. When combined with seed-extension accelerators, we observe a 2.1x improvement in overall read alignment throughput over BWAMEM2. The software implementation of ERT is integrated into BWA-MEM2 (ert branch: https://github.com/bwa-mem2/bwa-mem2/tree/ert) and is open sourced for the benefit of the research community.
引用
收藏
页码:388 / 401
页数:14
相关论文
共 53 条
[1]   Hobbes: optimized gram-based methods for efficient read alignment [J].
Ahmadi, Athena ;
Behm, Alexander ;
Honnalli, Nagesh ;
Li, Chen ;
Weng, Lingjie ;
Xie, Xiaohui .
NUCLEIC ACIDS RESEARCH, 2012, 40 (06) :e41
[2]  
Ahmed N, 2015, ICCAD-IEEE ACM INT, P240, DOI 10.1109/ICCAD.2015.7372576
[3]   A Scalable Processing-in-Memory Accelerator for Parallel Graph Processing [J].
Ahn, Junwhan ;
Hong, Sungpack ;
Yoo, Sungjoo ;
Mutlu, Onur ;
Choi, Kiyoung .
2015 ACM/IEEE 42ND ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2015, :105-117
[4]   Personalized copy number and segmental duplication maps using next-generation sequencing [J].
Alkan, Can ;
Kidd, Jeffrey M. ;
Marques-Bonet, Tomas ;
Aksay, Gozde ;
Antonacci, Francesca ;
Hormozdiari, Fereydoun ;
Kitzman, Jacob O. ;
Baker, Carl ;
Malig, Maika ;
Mutlu, Onur ;
Sahinalp, S. Cenk ;
Gibbs, Richard A. ;
Eichler, Evan E. .
NATURE GENETICS, 2009, 41 (10) :1061-U29
[5]   Shouji: a fast and efficient pre-alignment filter for sequence alignment [J].
Alser, Mohammed ;
Hassan, Hasan ;
Kumar, Akash ;
Mutlu, Onur ;
Alkan, Can .
BIOINFORMATICS, 2019, 35 (21) :4255-4263
[6]   GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping [J].
Alser, Mohammed ;
Hassan, Hasan ;
Xin, Hongyi ;
Ergin, Oguz ;
Mutlu, Onur ;
Alkan, Can .
BIOINFORMATICS, 2017, 33 (21) :3355-3363
[7]  
[Anonymous], 2010, P 2010 ACM SIGMOD IN, DOI DOI 10.1145/1807167.1807206
[8]  
Boehm M., EFFICIENT IN MEMORY
[9]   GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis [J].
Cali, Damla Senol ;
Kalsi, Gurpreet S. ;
Bingol, Zulal ;
Firtina, Can ;
Subramanian, Lavanya ;
Kim, Jeremie S. ;
Ausavarungnirun, Rachata ;
Alser, Mohammed ;
Gomez-Luna, Juan ;
Boroumand, Amirali ;
Nori, Anant ;
Scibisz, Allison ;
Subramoney, Sreenivas ;
Alkan, Can ;
Ghose, Saugata ;
Mutlu, Onur .
2020 53RD ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO 2020), 2020, :951-966
[10]   Boosting the FM-Index on the GPU: Effective Techniques to Mitigate Random Memory Access [J].
Chacon, Alejandro ;
Marco-Sola, Santiago ;
Espinosa, Antonio ;
Ribeca, Paolo ;
Carlos Moure, Juan .
IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2015, 12 (05) :1048-1059