Alignment of High-Throughput Sequencing Data Inside In-Memory Databases

被引:3
|
作者
Firnkorn, Daniel [1 ]
Knaup-Gregori, Petra [1 ]
Bermejo, Justo Lorenzo [1 ]
Ganzinger, Matthias [1 ]
机构
[1] Inst Med Biometry & Informat, Heidelberg, Germany
来源
E-HEALTH - FOR CONTINUITY OF CARE | 2014年 / 205卷
关键词
In-Memory-Technology; DNA-Alignment; HANA; high-throughput sequencing; stored procedures;
D O I
10.3233/978-1-61499-432-9-476
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In times of high-throughput DNA sequencing techniques, performance-capable analysis of DNA sequences is of high importance. Computer supported DNA analysis is still an intensive time-consuming task. In this paper we explore the potential of a new In-Memory database technology by using SAP's High Performance Analytic Appliance (HANA). We focus on read alignment as one of the first steps in DNA sequence analysis. In particular, we examined the widely used Burrows-Wheeler Aligner (BWA) and implemented stored procedures in both, HANA and the free database system MySQL, to compare execution time and memory management. To ensure that the results are comparable, MySQL has been running in memory as well, utilizing its integrated memory engine for database table creation. We implemented stored procedures, containing exact and inexact searching of DNA reads within the reference genome GRCh37. Due to technical restrictions in SAP HANA concerning recursion, the inexact matching problem could not be implemented on this platform. Hence, performance analysis between HANA and MySQL was made by comparing the execution time of the exact search procedures. Here, HANA was approximately 27 times faster than MySQL which means, that there is a high potential within the new In-Memory concepts, leading to further developments of DNA analysis procedures in the future.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [21] Porcine bacteriospermia examined by high-throughput sequencing
    Even, Gael
    Mottais, David
    Morien, Florianne
    Minh Duc Pham
    Ostergaard, Anais
    Martel, Sophie
    Merlin, Sophie
    Audebert, Christophe
    THERIOGENOLOGY, 2020, 142 : 268 - 275
  • [22] DNA Fragment Enrichment for High-Throughput Sequencing
    Sinyakov, A. N.
    Kostina, E. V.
    MOLECULAR BIOLOGY, 2023, 57 (03) : 424 - 439
  • [23] Statistical and Computational Methods for High-Throughput Sequencing Data Analysis of Alternative Splicing
    Chen L.
    Statistics in Biosciences, 2013, 5 (1) : 138 - 155
  • [24] Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data
    Caboche, Segolene
    Audebert, Christophe
    Lemoine, Yves
    Hot, David
    BMC GENOMICS, 2014, 15
  • [25] A comprehensive evaluation of normalization methods for Illumina high-throughput RNA sequencing data analysis
    Dillies, Marie-Agnes
    Rau, Andrea
    Aubert, Julie
    Hennequet-Antier, Christelle
    Jeanmougin, Marine
    Servant, Nicolas
    Keime, Celine
    Marot, Guillemette
    Castel, David
    Estelle, Jordi
    Guernec, Gregory
    Jagla, Bernd
    Jouneau, Luc
    Laloe, Denis
    Le Gall, Caroline
    Schaeffer, Brigitte
    Le Crom, Stephane
    Guedj, Mickael
    Jaffrezic, Florence
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (06) : 671 - 683
  • [26] Comparison of mapping algorithms used in high-throughput sequencing: application to Ion Torrent data
    Ségolène Caboche
    Christophe Audebert
    Yves Lemoine
    David Hot
    BMC Genomics, 15
  • [27] Environmental bio-monitoring with high-throughput sequencing
    Wang, Jing
    McLenachan, Patricia A.
    Biggs, Patrick J.
    Winder, Linton H.
    Schoenfeld, Barbara I. K.
    Narayan, Vinay V.
    Phiri, Bernard J.
    Lockhart, Peter J.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (05) : 575 - 588
  • [28] High-Throughput Sequencing: A Roadmap Toward Community Ecology
    Poisot, Timothee
    Pequin, Berangere
    Gravel, Dominique
    ECOLOGY AND EVOLUTION, 2013, 3 (04): : 1125 - 1139
  • [29] High-throughput sequencing in veterinary infection biology and diagnostics
    Belak, S.
    Karlsson, O. E.
    Leijon, M.
    Granberg, F.
    REVUE SCIENTIFIQUE ET TECHNIQUE-OFFICE INTERNATIONAL DES EPIZOOTIES, 2013, 32 (03): : 893 - 915
  • [30] Applications of high-throughput sequencing to analyze and engineer ribozymes
    Yokobayashi, Yohei
    METHODS, 2019, 161 : 41 - 45