Alignment of High-Throughput Sequencing Data Inside In-Memory Databases

被引:3
作者
Firnkorn, Daniel [1 ]
Knaup-Gregori, Petra [1 ]
Bermejo, Justo Lorenzo [1 ]
Ganzinger, Matthias [1 ]
机构
[1] Inst Med Biometry & Informat, Heidelberg, Germany
来源
E-HEALTH - FOR CONTINUITY OF CARE | 2014年 / 205卷
关键词
In-Memory-Technology; DNA-Alignment; HANA; high-throughput sequencing; stored procedures;
D O I
10.3233/978-1-61499-432-9-476
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In times of high-throughput DNA sequencing techniques, performance-capable analysis of DNA sequences is of high importance. Computer supported DNA analysis is still an intensive time-consuming task. In this paper we explore the potential of a new In-Memory database technology by using SAP's High Performance Analytic Appliance (HANA). We focus on read alignment as one of the first steps in DNA sequence analysis. In particular, we examined the widely used Burrows-Wheeler Aligner (BWA) and implemented stored procedures in both, HANA and the free database system MySQL, to compare execution time and memory management. To ensure that the results are comparable, MySQL has been running in memory as well, utilizing its integrated memory engine for database table creation. We implemented stored procedures, containing exact and inexact searching of DNA reads within the reference genome GRCh37. Due to technical restrictions in SAP HANA concerning recursion, the inexact matching problem could not be implemented on this platform. Hence, performance analysis between HANA and MySQL was made by comparing the execution time of the exact search procedures. Here, HANA was approximately 27 times faster than MySQL which means, that there is a high potential within the new In-Memory concepts, leading to further developments of DNA analysis procedures in the future.
引用
收藏
页码:476 / 480
页数:5
相关论文
共 50 条
  • [31] Detection of tobacco virus by siRNA high-throughput sequencing
    Wang, Haojun
    Wang, Fang
    Yao, Zhongda
    Guo, Dongfeng
    Shu, Junsheng
    Acta Tabacaria Sinica, 2015, 21 (03) : 107 - 110
  • [32] The application of high-throughput sequencing technology in corneal diseases
    Zhao, Jing yi
    He, Yu xi
    Wu, Mei liang
    Wang, Rui qing
    INTERNATIONAL OPHTHALMOLOGY, 2024, 44 (01)
  • [33] Advances, practice, and clinical perspectives in high-throughput sequencing
    Park, S-J
    Saito-Adachi, M.
    Komiyama, Y.
    Nakai, K.
    ORAL DISEASES, 2016, 22 (05) : 353 - 364
  • [34] Environmental bio-monitoring with high-throughput sequencing
    Wang, Jing
    McLenachan, Patricia A.
    Biggs, Patrick J.
    Winder, Linton H.
    Schoenfeld, Barbara I. K.
    Narayan, Vinay V.
    Phiri, Bernard J.
    Lockhart, Peter J.
    BRIEFINGS IN BIOINFORMATICS, 2013, 14 (05) : 575 - 588
  • [35] High-Throughput Sequencing: A Roadmap Toward Community Ecology
    Poisot, Timothee
    Pequin, Berangere
    Gravel, Dominique
    ECOLOGY AND EVOLUTION, 2013, 3 (04): : 1125 - 1139
  • [36] Analysis of Paramyxovirus Transcription and Replication by High-Throughput Sequencing
    Wignall-Fleming, Elizabeth B.
    Hughes, David J.
    Vattipally, Sreenu
    Modha, Sejal
    Goodbourn, Steve
    Davison, Andrew J.
    Randall, Richard E.
    JOURNAL OF VIROLOGY, 2019, 93 (17)
  • [37] Application of high-throughput sequencing technology in dairy product
    Zhang, Heping
    Zheng, Yi
    Journal of Chinese Institute of Food Science and Technology, 2015, 15 (03) : 1 - 7
  • [38] The Role of High Performance, Grid and Cloud Computing in High-Throughput Sequencing
    Lightbody, Gaye
    Browne, Fiona
    Zheng, Huiru
    Haberland, Valeriia
    Blayney, Jaine
    2016 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2016, : 890 - 895
  • [39] Reads2Vec: Efficient Embedding of Raw High-Throughput Sequencing Reads Data
    Chourasia, Prakash
    Ali, Sarwan
    Ciccolella, Simone
    Vedova, Gianluca Della
    Patterson, Murray
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2023, 30 (04) : 469 - 491
  • [40] A Pipeline for the Error-Free Identification of Somatic Alu Insertions in High-Throughput Sequencing Data
    Nugmanov, G. A.
    Komkov, A. Y.
    Saliutina, M. V.
    Minervina, A. A.
    Lebedev, Y. B.
    Mamedov, I. Z.
    MOLECULAR BIOLOGY, 2019, 53 (01) : 138 - 146