Next-Generation Massively Parallel Short-Read Mapping on FPGAs

被引:0
|
作者
Knodel, Oliver [1 ]
Preusser, Thomas B. [1 ]
Spallek, Rainer G. [1 ]
机构
[1] Tech Univ Dresden, Dept Comp Sci, Dresden, Germany
关键词
Short-Read Mapping; Sequence Alignment; FPGA; ALIGNMENT; GENOME; TOOL;
D O I
暂无
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The mapping of DNA sequences to huge genome databases is an essential analysis task in modern molecular biology. Having linearized reference genomes available, the alignment of short DNA reads obtained from the sequencing of an individual genome against such a database provides a powerful diagnostic and analysis tool. In essence, this task amounts to a simple string search tolerating a certain number of mismatches to account for the diversity of individuals. The complexity of this process arises from the sheer size of the reference genome. It is further amplified by current next-generation sequencing technologies, which produce a huge number of increasingly short reads. These short reads hurt established alignment heuristics like BLAST severely. This paper proposes an FPGA-based custom computation, which performs the alignment of short DNA reads in a timely manner by the use of tremendous concurrency for reasonable costs. The special measures to achieve an extremely efficient and compact mapping of the computation to a Xilinx FPGA architecture are described. The presented approach also surpasses all software heuristics in the quality of its results. It guarantees to find all alignment locations of a read in the database while also allowing a freely adjustable character mismatch threshold. On the contrary, advanced fast alignment heuristics like Bowtie and Maq can only tolerate small mismatch maximums with a quick deterioration of the probability to detect existing valid alignments. The performance comparison with these widely used software tools also demonstrates that the proposed FPGA computation achieves its guaranteed exact results in very competitive time.
引用
收藏
页码:195 / 201
页数:7
相关论文
共 50 条
  • [31] Next-generation sequencing and massively parallel analysis of gene expression: uses in clinical diagnostics
    Cullen, Paul
    Hoffmann, Georg
    Klein, Hanns-Georg
    Funke, Harald
    LABORATORIUMSMEDIZIN-JOURNAL OF LABORATORY MEDICINE, 2010, 34 (06): : 349 - 356
  • [32] Massively parallel analysis of single-molecule dynamics on next-generation sequencing chips
    Rivera, J. Aguirre
    Mao, G.
    Sabantsev, A.
    Panfilov, M.
    Hou, Q.
    Lindell, M.
    Chanez, C.
    Ritort, F.
    Jinek, M.
    Deindl, S.
    SCIENCE, 2024, 385 (6711) : 892 - 898
  • [33] mrsFAST: a cache-oblivious algorithm for short-read mapping
    Faraz Hach
    Fereydoun Hormozdiari
    Can Alkan
    Farhad Hormozdiari
    Inanc Birol
    Evan E Eichler
    S Cenk Sahinalp
    Nature Methods, 2010, 7 : 576 - 577
  • [34] mrsFAST: a cache-oblivious algorithm for short-read mapping
    Hach, Faraz
    Hormozdiari, Fereydoun
    Alkan, Can
    Hormozdiari, Farhad
    Birol, Inanc
    Eichler, Evan E.
    Sahinalp, S. Cenk
    NATURE METHODS, 2010, 7 (08) : 576 - 577
  • [35] ART: a next-generation sequencing read simulator
    Huang, Weichun
    Li, Leping
    Myers, Jason R.
    Marth, Gabor T.
    BIOINFORMATICS, 2012, 28 (04) : 593 - 594
  • [36] Closing the Gap - Detection of 5q-Spinal Muscular Atrophy by Short-Read Next-Generation Sequencing and Unexpected Results in a Diagnostic Patient Cohort
    Kleinle, Stephanie
    Scholz, Veronika
    Benet-Pages, Anna
    Wohlfrom, Tobias
    Gehling, Stefanie
    Scharf, Florentine
    Rost, Simone
    Prott, Eva-Christina
    Grinzinger, Susanne
    Hotter, Anna
    Haug, Verena
    Niemeier, Sabine
    Wiethoff-Ubrig, Lucia
    Hagenacker, Tim
    Goldhahn, Klaus
    von Moers, Arpad
    Walter, Maggie C.
    Reilich, Peter
    Eggermann, Katja
    Kraft, Florian
    Kurth, Ingo
    Erdmann, Hannes
    Holinski-Feder, Elke
    Neuhann, Teresa
    Abicht, Angela
    JOURNAL OF NEUROMUSCULAR DISEASES, 2023, 10 (05) : 835 - 846
  • [37] Software for pre-processing Illumina next-generation sequencing short read sequences
    Chen, Chuming
    Khaleel, Sari S.
    Huang, Hongzhan
    Wu, Cathy H.
    SOURCE CODE FOR BIOLOGY AND MEDICINE, 2014, 9 (01):
  • [38] Sequence and structural variation in a human genome uncovered by short-read, massively parallel ligation sequencing using two-base encoding
    McKernan, Kevin Judd
    Peckham, Heather E.
    Costa, Gina L.
    McLaughlin, Stephen F.
    Fu, Yutao
    Tsung, Eric F.
    Clouser, Christopher R.
    Duncan, Cisyla
    Ichikawa, Jeffrey K.
    Lee, Clarence C.
    Zhang, Zheng
    Ranade, Swati S.
    Dimalanta, Eileen T.
    Hyland, Fiona C.
    Sokolsky, Tanya D.
    Zhang, Lei
    Sheridan, Andrew
    Fu, Haoning
    Hendrickson, Cynthia L.
    Li, Bin
    Kotler, Lev
    Stuart, Jeremy R.
    Malek, Joel A.
    Manning, Jonathan M.
    Antipova, Alena A.
    Perez, Damon S.
    Moore, Michael P.
    Hayashibara, Kathleen C.
    Lyons, Michael R.
    Beaudoin, Robert E.
    Coleman, Brittany E.
    Laptewicz, Michael W.
    Sannicandro, Adam E.
    Rhodes, Michael D.
    Gottimukkala, Rajesh K.
    Yang, Shan
    Bafna, Vineet
    Bashir, Ali
    MacBride, Andrew
    Alkan, Can
    Kidd, Jeffrey M.
    Eichler, Evan E.
    Reese, Martin G.
    De la Vega, Francisco M.
    Blanchard, Alan P.
    GENOME RESEARCH, 2009, 19 (09) : 1527 - 1541
  • [39] Compressive mapping for next-generation sequencing
    Deniz Yorukoglu
    Yun William Yu
    Jian Peng
    Bonnie Berger
    Nature Biotechnology, 2016, 34 : 374 - 376
  • [40] Next-generation peptide sequencing The concept of massively parallel single-molecule protein sequencing emerges
    Tang, Lei
    NATURE METHODS, 2018, 15 (12) : 997 - 997