Alignment of High-Throughput Sequencing Data Inside In-Memory Databases

被引:3
作者
Firnkorn, Daniel [1 ]
Knaup-Gregori, Petra [1 ]
Bermejo, Justo Lorenzo [1 ]
Ganzinger, Matthias [1 ]
机构
[1] Inst Med Biometry & Informat, Heidelberg, Germany
来源
E-HEALTH - FOR CONTINUITY OF CARE | 2014年 / 205卷
关键词
In-Memory-Technology; DNA-Alignment; HANA; high-throughput sequencing; stored procedures;
D O I
10.3233/978-1-61499-432-9-476
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In times of high-throughput DNA sequencing techniques, performance-capable analysis of DNA sequences is of high importance. Computer supported DNA analysis is still an intensive time-consuming task. In this paper we explore the potential of a new In-Memory database technology by using SAP's High Performance Analytic Appliance (HANA). We focus on read alignment as one of the first steps in DNA sequence analysis. In particular, we examined the widely used Burrows-Wheeler Aligner (BWA) and implemented stored procedures in both, HANA and the free database system MySQL, to compare execution time and memory management. To ensure that the results are comparable, MySQL has been running in memory as well, utilizing its integrated memory engine for database table creation. We implemented stored procedures, containing exact and inexact searching of DNA reads within the reference genome GRCh37. Due to technical restrictions in SAP HANA concerning recursion, the inexact matching problem could not be implemented on this platform. Hence, performance analysis between HANA and MySQL was made by comparing the execution time of the exact search procedures. Here, HANA was approximately 27 times faster than MySQL which means, that there is a high potential within the new In-Memory concepts, leading to further developments of DNA analysis procedures in the future.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [41] A Pipeline for the Error-Free Identification of Somatic Alu Insertions in High-Throughput Sequencing Data
    G. A. Nugmanov
    A. Y. Komkov
    M. V. Saliutina
    A. A. Minervina
    Y. B. Lebedev
    I. Z. Mamedov
    [J]. Molecular Biology, 2019, 53 : 138 - 146
  • [42] Effect of k-tuple length on sample-comparison with high-throughput sequencing data
    Wang, Ying
    Lei, Xiaoye
    Wang, Shun
    Wang, Zicheng
    Song, Nianfeng
    Zeng, Feng
    Chen, Ting
    [J]. BIOCHEMICAL AND BIOPHYSICAL RESEARCH COMMUNICATIONS, 2016, 469 (04) : 1021 - 1027
  • [43] MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data
    Ségolène Caboche
    Gaël Even
    Alexandre Loywick
    Christophe Audebert
    David Hot
    [J]. Genome Biology, 18
  • [44] MICRA: an automatic pipeline for fast characterization of microbial genomes from high-throughput sequencing data
    Caboche, Segolene
    Even, Gael
    Loywick, Alexandre
    Audebert, Christophe
    Hot, David
    [J]. GENOME BIOLOGY, 2017, 18
  • [45] Evaluating amplicon high-throughput sequencing data of microalgae living in melting snow: improvements and limitations
    Lutz, Stefanie
    Prochazkova, Lenka
    Benning, Liane G.
    Nedbalova, Linda
    Remias, Daniel
    [J]. FOTTEA, 2019, 19 (02) : 115 - 131
  • [46] Characterization of microbiota of naturally fermented sauerkraut by high-throughput sequencing
    Zhang, Shuang
    Zhang, Yichen
    Wu, Lihong
    Zhang, Lili
    Wang, Song
    [J]. FOOD SCIENCE AND BIOTECHNOLOGY, 2023, 32 (06) : 855 - 862
  • [47] High-throughput sequencing: a breakthrough in molecular diagnosis for precision medicine
    Dongare, Dipali Barku
    Nishad, Shaik Shireen
    Mastoli, Sakshi Y.
    Saraf, Shubhini A.
    Srivastava, Nidhi
    Dey, Abhishek
    [J]. FUNCTIONAL & INTEGRATIVE GENOMICS, 2025, 25 (01)
  • [48] Commercial high-throughput sequencing and its applications in DNA analysis
    Peng, Hai
    Zhang, Jing
    [J]. BIOLOGIA, 2009, 64 (01) : 20 - 26
  • [49] Characterization of microbiota of naturally fermented sauerkraut by high-throughput sequencing
    Shuang Zhang
    Yichen Zhang
    Lihong Wu
    Lili Zhang
    Song Wang
    [J]. Food Science and Biotechnology, 2023, 32 : 855 - 862
  • [50] Application of High-Throughput Sequencing in Medicinal Plant Transcriptome Studies
    Hao, Da-Cheng
    Chen, Shi-Lin
    Xiao, Pei-Gen
    Liu, Ming
    [J]. DRUG DEVELOPMENT RESEARCH, 2012, 73 (08) : 487 - 498