A parallel hash-based method for local sequence alignment

被引:2
作者
Esmat, Aghaee-Meybodi [1 ]
Amin, Nezarat [2 ]
Sima, Emadi [1 ]
Reza, Ghaffari Mohammad [3 ]
机构
[1] Islamic Azad Univ, Dept Comp Engn, Yazd Branch, Yazd, Iran
[2] Masaryk Univ, Inst Comp Sci, Brno, Czech Republic
[3] Agr Res Educ & Extens Org, Dept Syst Biol, Agr Biotechnol Res Inst Iran, Tehran, Iran
关键词
DNA sequencing; hash table; local alignment; sequence alignment; string matching; READ ALIGNMENT; SEARCH; ACID;
D O I
10.1002/cpe.6568
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Algorithms utilize an index-based aligning strategy, like a hash table, which typically entails the seed-and-extend method and is a time-consuming task. Here, we developed a hash-based search algorithm based on the SSAHA method without the use of seed-and-extend to conduct search and alignment faster than previous methods with multiple processors. In the proposed method by using the overlapping method in query and reference sequences, the accuracy and sensitivity increased. Further, the speed also increased by creating a hash table for the reference sequence when it was placed in the memory. Furthermore, by evaluating three datasets of different sequences in size and volumes, the effect of the created piece lengths as well as the effect of multiple processors on each dataset was evaluated indicating not only appeasing the time issue in alignment but also improving the mapping speed compared to the BLAST and SSAHA algorithms.
引用
收藏
页数:16
相关论文
共 27 条
[1]  
Albrecht F., 2015, ARXIV PREPRINT ARXIV
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]   Short Read Mapping: An Algorithmic Tour [J].
Canzar, Stefan ;
Salzberg, Steven L. .
PROCEEDINGS OF THE IEEE, 2017, 105 (03) :436-458
[4]   HIA: a genome mapper using hybrid index-based sequence alignment [J].
Choi, Jongpill ;
Park, Kiejung ;
Cho, Seong Beom ;
Chung, Myungguen .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2015, 10
[5]   Hardware acceleration of sequence alignment algorithms - An overview [J].
Hasan, Laiq ;
Al-Ars, Zaid ;
Vassiliadis, Stamatis .
2007 INTERNATIONAL CONFERENCE ON DESIGN & TECHNOLOGY OF INTEGRATED SYSTEMS IN NANOSCALE ERA, 2007, :92-+
[6]  
Herve D., 2014, ARXIV PREPRINT ARXIV
[7]  
Langmead B, 2012, NAT METHODS, V9, P357, DOI [10.1038/NMETH.1923, 10.1038/nmeth.1923]
[8]   MOSAIK: A Hash-Based Algorithm for Accurate Next-Generation Sequencing Short-Read Mapping [J].
Lee, Wan-Ping ;
Stromberg, Michael P. ;
Ward, Alistair ;
Stewart, Chip ;
Garrison, Erik P. ;
Marth, Gabor T. .
PLOS ONE, 2014, 9 (03)
[9]   Mapping short DNA sequencing reads and calling variants using mapping quality scores [J].
Li, Heng ;
Ruan, Jue ;
Durbin, Richard .
GENOME RESEARCH, 2008, 18 (11) :1851-1858
[10]   A survey of sequence alignment algorithms for next-generation sequencing [J].
Li, Heng ;
Homer, Nils .
BRIEFINGS IN BIOINFORMATICS, 2010, 11 (05) :473-483