Next-Generation Massively Parallel Short-Read Mapping on FPGAs

被引:0
|
作者
Knodel, Oliver [1 ]
Preusser, Thomas B. [1 ]
Spallek, Rainer G. [1 ]
机构
[1] Tech Univ Dresden, Dept Comp Sci, Dresden, Germany
关键词
Short-Read Mapping; Sequence Alignment; FPGA; ALIGNMENT; GENOME; TOOL;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The mapping of DNA sequences to huge genome databases is an essential analysis task in modern molecular biology. Having linearized reference genomes available, the alignment of short DNA reads obtained from the sequencing of an individual genome against such a database provides a powerful diagnostic and analysis tool. In essence, this task amounts to a simple string search tolerating a certain number of mismatches to account for the diversity of individuals. The complexity of this process arises from the sheer size of the reference genome. It is further amplified by current next-generation sequencing technologies, which produce a huge number of increasingly short reads. These short reads hurt established alignment heuristics like BLAST severely. This paper proposes an FPGA-based custom computation, which performs the alignment of short DNA reads in a timely manner by the use of tremendous concurrency for reasonable costs. The special measures to achieve an extremely efficient and compact mapping of the computation to a Xilinx FPGA architecture are described. The presented approach also surpasses all software heuristics in the quality of its results. It guarantees to find all alignment locations of a read in the database while also allowing a freely adjustable character mismatch threshold. On the contrary, advanced fast alignment heuristics like Bowtie and Maq can only tolerate small mismatch maximums with a quick deterioration of the probability to detect existing valid alignments. The performance comparison with these widely used software tools also demonstrates that the proposed FPGA computation achieves its guaranteed exact results in very competitive time.
引用
收藏
页码:195 / 201
页数:7
相关论文
共 50 条
  • [1] Next-generation sequencing of newborn screening genes: the accuracy of short-read mapping
    Trier, C.
    Fournous, G.
    Strand, J. M.
    Stray-Pedersen, A.
    Pettersen, R. D.
    Rowe, A. D.
    NPJ GENOMIC MEDICINE, 2020, 5 (01)
  • [2] Next-generation sequencing of newborn screening genes: the accuracy of short-read mapping
    C. Trier
    G. Fournous
    J. M. Strand
    A. Stray-Pedersen
    R. D. Pettersen
    A. D. Rowe
    npj Genomic Medicine, 5
  • [3] MOSAIK: A Hash-Based Algorithm for Accurate Next-Generation Sequencing Short-Read Mapping
    Lee, Wan-Ping
    Stromberg, Michael P.
    Ward, Alistair
    Stewart, Chip
    Garrison, Erik P.
    Marth, Gabor T.
    PLOS ONE, 2014, 9 (03):
  • [4] An approach for determination of copy number variation using short-read next-generation sequencing
    Reeves, K.
    Bourbon, M.
    Hurd, D.
    Reid, J.
    Molha, D.
    Houniet, D.
    Cousin, J.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 649 - 649
  • [5] Detection of somatic structural variants from short-read next-generation sequencing data
    Gong, Tingting
    Hayes, Vanessa M.
    Chan, Eva K. F.
    BRIEFINGS IN BIOINFORMATICS, 2021, 22 (03)
  • [6] Tandem repeat genotyping using massively parallel second generation sequencing: comparison of short-read and long-read technologies
    Radvanszky, Jan
    Lojova, Ingrid
    Kucharik, Marcel
    Balaz, Andrej
    Kvapilova, Katerina
    Kvapil, Petr
    Brzon, Ondrej
    Kasny, Martin
    Duranova, Terezia
    Forgacova, Natalia
    Hrnciar, Matej
    Holesova, Zuzana
    Martis, Jozef
    Sitarcik, Jozef
    Budis, Jaroslav
    Szemes, Tomas
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1784 - 1785
  • [7] Fast and accurate HLA typing from short-read next-generation sequence data with xHLA
    Xie, Chao
    Yeo, Zhen Xuan
    Wong, Marie
    Piper, Jason
    Long, Tao
    Kirkness, Ewen F.
    Biggs, William H.
    Bloom, Ken
    Spellman, Stephen
    Vierra-Green, Cynthia
    Brady, Colleen
    Scheuermann, Richard H.
    Telenti, Amalio
    Howard, Sally
    Brewerton, Suzanne
    Turpaz, Yaron
    Venter, J. Craig
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2017, 114 (30) : 8059 - 8064
  • [8] Accurate estimation of short read mapping quality for next-generation genome sequencing
    Ruffalo, Matthew
    Koyutuerk, Mehmet
    Ray, Soumya
    LaFramboise, Thomas
    BIOINFORMATICS, 2012, 28 (18) : I349 - I355
  • [9] Comparison of short-read and long-read next-generation sequencing technologies for determining HIV-1 drug resistance
    Vellas, Camille
    Doudou, Amira
    Mohamed, Sofiane
    Raymond, Stephanie
    Jeanne, Nicolas
    Latour, Justine
    Demmou, Sofia
    Ranger, Noemie
    Gonzalez, Dimitri
    Delobel, Pierre
    Izopet, Jacques
    JOURNAL OF MEDICAL VIROLOGY, 2024, 96 (10)
  • [10] Citation Classic: Massively Parallel ("Next-Generation") DNA Sequencing
    Rothberg, Bonnie E. Gould
    Rothberg, Jonathan M.
    CLINICAL CHEMISTRY, 2015, 61 (07) : 997 - 998