TALCO: Tiling Genome Sequence Alignment using Convergence of Traceback Pointers

被引：5

作者：

Walia, Sumit ^{[1
]}

Ye, Cheng ^{[1
]}

Bera, Arkid ^{[1
]}

Lodhavia, Dhruvi ^{[1
]}

Turakhia, Yatish ^{[1
]}

机构：

[1] Univ Calif San Diego, Dept Elect & Comp Engn, La Jolla, CA 92093 USA

来源：

2024 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, HPCA 2024 | 2024年

关键词：

SMITH-WATERMAN IMPLEMENTATION; ALGORITHM; DNA; ACCELERATOR; SEARCH;

D O I：

10.1109/HPCA57654.2024.00044

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Pairwise sequence alignment is one of the most fundamental and computationally intensive steps in genome analysis. With the improving costs and throughput of third-generation sequencing technologies and the growing availability of whole-genome datasets, longer alignments are becoming more common in the field of bioinformatics. However, the high memory demands of long alignments create significant obstacles to hardware acceleration. Banding techniques allow recovering high-quality alignments with lower memory, but they also require more memory for long alignments than what is typically available on-chip in hardware accelerators. Recently, tiling-based hardware accelerators have made remarkable strides in accelerating sequence alignment, achieving three to four orders of magnitude improvement in alignment throughput over software tools without any restrictions on alignment length. However, it is crucial to note that existing tiling heuristics can cause the alignment quality to degrade, which is a critical concern for the wider adoption of accelerators in the field of bioinformatics. To address this issue, this paper describes TALCO - a novel method for tiling long sequence alignments, that, similar to prior tiling techniques, maintains a constant memory footprint during the acceleration step independent of alignment length. However, unlike previous techniques, TALCO also ensures optimal alignments under banding constraints. TALCO does this by leveraging the convergence of traceback paths beyond a tile to a single point on the boundary of that tile - a strategy that generalizes well to a broad set of sequence alignment algorithms. We demonstrate the advantages of TALCO by applying it to two different and widely-used banded sequence alignment algorithms, X-Drop and WFA-Adapt. To the best of our knowledge, this is the first time that a tiling technique is being applied to a non-classical algorithm for sequence alignment, such as WFA-Adapt. The TALCO tiling strategy is beneficial to both software and hardware. When implemented in software, the TALCO strategy reduces the memory requirements for X-Drop and WFA-Adapt algorithms by up to 39x and 57x, respectively, and when implemented as ASIC accelerator, it provides up to 1,900x and 2,000x improvement in alignment throughput/watt over CPU baselines implementing the same algorithms. Compared to state-of-the-art GPU and ASIC baselines implementing tiling heuristics, TALCO provides up to 50x and 1.1x improvement in alignment throughput, respectively, while also maintaining a higher alignment quality. Code availability: https://github.com/TurakhiaLab/TALCO.

引用

页码：498 / 514

页数：17

共 112 条

[1] GPU acceleration of Darwin read overlapper for de novo assembly of long DNA reads [J].

Ahmed, Nauman ;

Qiu, Tong Dong ;

Bertels, Koen ;

Al-Ars, Zaid .

BMC BIOINFORMATICS, 2020, 21 (Suppl 13)

[2] GASAL2: a GPU accelerated sequence alignment library for high-throughput NGS data [J].

Ahmed, Nauman ;

Levy, Jonathan ;

Ren, Shanshan ;

Mushtaq, Hamid ;

Bertels, Koen ;

Al-Ars, Zaid .

BMC BIOINFORMATICS, 2019, 20 (01)

[3] INVITED: Toward an Open-Source Digital Flow: First Learnings from the OpenROAD Project [J].

Ajayi, Tutu ;

Chhabria, Vidya A. ;

Fogaca, Mateus ;

Hashemi, Soheil ;

Hosny, Abdelrahman ;

Kahng, Andrew B. ;

Kim, Minsoo ;

Lee, Jeongsup ;

Mallappa, Uday ;

Neseem, Marina ;

Pradipta, Geraldo ;

Reda, Sherief ;

Saligane, Mehdi ;

Sapatnekar, Sachin S. ;

Sechen, Carl ;

Shalan, Mohamed ;

Swartz, William ;

Wang, Lutong ;

Wang, Zhehong ;

Woo, Mingyu ;

Xu, Bangqi .

PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC), 2019,

[4] Dynamic programming algorithms for RNA secondary structure prediction with pseudoknots [J].

Akutsu, T .

DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :45-62

[5]

Allred J, 2009, INT PARALL DISTRIB P, P2989

[6] SneakySnake: a fast and accurate universal genome pre-alignment filter for CPUs, GPUs and FPGAs [J].

Alser, Mohammed ;

Shahroodi, Taha ;

Gomez-Luna, Juan ;

Alkan, Can ;

Mutlu, Onur .

BIOINFORMATICS, 2020, 36 (22-23) :5282-5290

[7] Accelerating Genome Analysis: A Primer on an Ongoing Journey [J].

Alser, Mohammed ;

Bingol, Zulal ;

Cali, Damla Senol ;

Kim, Jeremie ;

Ghose, Saugata ;

Alkan, Can ;

Mutlu, Onur .

IEEE MICRO, 2020, 40 (05) :65-75

[8] GateKeeper: a new hardware architecture for accelerating pre-alignment in DNA short read mapping [J].

Alser, Mohammed ;

Hassan, Hasan ;

Xin, Hongyi ;

Ergin, Oguz ;

Mutlu, Onur ;

Alkan, Can .

BIOINFORMATICS, 2017, 33 (21) :3355-3363

[9] BASIC LOCAL ALIGNMENT SEARCH TOOL [J].

ALTSCHUL, SF ;

GISH, W ;

MILLER, W ;

MYERS, EW ;

LIPMAN, DJ .

JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410

[10]

Amazon Web Service, Overview of aws ec2 fpga development kit

← 1 2 3 4 5 6 7 8 9 10 →