Adapting the GACT-X Aligner to Accelerate Minimap2 in an FPGA Cloud Instance

被引:5
作者
Teng, Carolina [1 ]
Achjian, Renan Weege [2 ]
Wang, Jiang Chau [1 ]
Fonseca, Fernando Josepetti [1 ]
机构
[1] Univ Sao Paulo, Sch Engn, Dept Elect Syst Engn, BR-05508010 Sao Paulo, Brazil
[2] Univ Sao Paulo, Inst Biomed Sci, Dept Parasitol, BR-05508000 Sao Paulo, Brazil
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 07期
关键词
field programmable gate arrays; cloud computing; FPGA cloud; Minimap2; Smith-Waterman-Gotoh; coprocessors; acceleration; genomics; bioinformatics; hybrid systems; ALIGNMENT;
D O I
10.3390/app13074385
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
In genomic analysis, long reads are an emerging type of data processed by assembly algorithms to recover the complete genome sample. They are, on average, one or two orders of magnitude longer than short reads from the previous generation, which provides important advantages in information quality. However, longer sequences bring new challenges to computer processing, undermining the performance of assembly algorithms developed for short reads. This issue is amplified by the exponential growth of genetic data generation and by the slowdown of transistor technology progress, illustrated by Moore's Law. Minimap2 is the current state-of-the-art long-read assembler and takes dozens of CPU hours to assemble a human genome with clinical standard coverage. One of its bottlenecks, the alignment stage, has not been successfully accelerated on FPGAs in the literature. GACT-X is an alignment algorithm developed for FPGA implementation, suitable for any size input sequence. In this work, GACT-X was adapted to work as the aligner of Minimap2, and these are integrated and implemented in an FPGA cloud platform. The measurements for accuracy and speed-up are presented for three different datasets in different combinations of numbers of kernels and threads. The integrated solution's performance limitations due to data transfer are also analyzed and discussed.
引用
收藏
页数:22
相关论文
共 46 条
[41]   Introducing difference recurrence relations for faster semi-global alignment of long sequences [J].
Suzuki, Hajime ;
Kasahara, Masahiro .
BMC BIOINFORMATICS, 2018, 19
[42]   Accelerating the base-level alignment step of DNA assembling in Minimap2 Algorithm using FPGA [J].
Teng, Carolina ;
Achjian, Renan W. ;
Braga, Caio C. ;
Zuffo, Marcelo K. ;
Chau, Wang J. .
2021 IEEE 12TH LATIN AMERICA SYMPOSIUM ON CIRCUITS AND SYSTEM (LASCAS), 2021,
[43]   Mechanisms and enzymes involved in SARS coronavirus genome expression [J].
Thiel, V ;
Ivanov, KA ;
Putics, A ;
Hertzig, T ;
Schelle, B ;
Bayer, S ;
Weissbrich, B ;
Snijder, EJ ;
Rabenau, H ;
Doerr, HW ;
Gorbalenya, AE ;
Ziebuhr, J .
JOURNAL OF GENERAL VIROLOGY, 2003, 84 :2305-2315
[44]  
tmux, US
[45]   Darwin-WGA: A Co-processor Provides Increased Sensitivity in Whole Genome Alignments with High Speedup [J].
Turakhia, Yatish ;
Goenka, Sneha D. ;
Bejerano, Gill ;
Dally, William J. .
2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, :359-372
[46]   When FPGA Meets Cloud: A First Look at Performance [J].
Wang, Xiuxiu ;
Niu, Yipei ;
Liu, Fangming ;
Xu, Zichen .
IEEE TRANSACTIONS ON CLOUD COMPUTING, 2022, 10 (02) :1344-1357