The efficient algorithm for mapping next generation sequencing reads to reference genome

被引:0
作者
Pankiewicz, Patryk [1 ]
Kusmirek, Wiktor [1 ]
Nowak, Robert M. [1 ]
机构
[1] Warsaw Univ Technol, Inst Comp Sci, Nowowiejska 15-19, PL-00665 Warsaw, Poland
来源
PHOTONICS APPLICATIONS IN ASTRONOMY, COMMUNICATIONS, INDUSTRY, AND HIGH-ENERGY PHYSICS EXPERIMENTS 2019 | 2019年 / 11176卷
关键词
next generation sequencing; k-spectrum; algorithm; reads alignment; C plus; BBMap; SEARCH;
D O I
10.1117/12.2536653
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
One of the main problem related to genomics is finding similarities between different species represented by DNA sequences. The dynamic programming algorithms (Needleman-Wunsch, Smith-Waterman) give a good measure of similarity, but are not efficient for big data sets. In this study we present the new heuristic algorithm based on common parts of reads. The approach can handle all types of sequencing errors: insertions, deletions and replacements. Our algorithm result is similar to other well known tools. The presented algorithm is implemented in C++, it uses Boost libraries, it internally use threads for parallel computing. This algorithm is a part of the DNA assembler 'dnaasm'. Source code, demo application and supplementary materials are available at project homepage: http://dnaasm.sourceforge.net.
引用
收藏
页数:7
相关论文
共 8 条
  • [1] BASIC LOCAL ALIGNMENT SEARCH TOOL
    ALTSCHUL, SF
    GISH, W
    MILLER, W
    MYERS, EW
    LIPMAN, DJ
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) : 403 - 410
  • [2] [Anonymous], Boost C++ Library
  • [3] Bushnell B., 2014, Tech Rep
  • [4] Linking De Novo Assembly Results with Long DNA Reads Using the dnaasm-link Application
    Kusmirek, Wiktor
    Franus, Wiktor
    Nowak, Robert
    [J]. BIOMED RESEARCH INTERNATIONAL, 2019, 2019
  • [5] De novo assembly of bacterial genomes with repetitive DNA regions by dnaasm application
    Kusmirek, Wiktor
    Nowak, Robert
    [J]. BMC BIOINFORMATICS, 2018, 19
  • [6] A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS
    NEEDLEMAN, SB
    WUNSCH, CD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) : 443 - +
  • [7] IDENTIFICATION OF COMMON MOLECULAR SUBSEQUENCES
    SMITH, TF
    WATERMAN, MS
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1981, 147 (01) : 195 - 197
  • [8] Sung Wing-Kin., 2017, Algorithms for Next-generation Sequencing