An Efficient Algorithm for Local Sequence Alignment

被引:3
作者
Haque, Waqar [1 ]
Aravind, Alex [1 ]
Reddy, Bharath [1 ]
机构
[1] Univ No British Columbia, Comp Sci Program, Prince George, BC V2N 4Z9, Canada
来源
2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8 | 2008年
关键词
suffix tree; pairwise sequence alignment; longest common substring; Rosetta dataset; SEARCH; BLAST; TOOL;
D O I
10.1109/IEMBS.2008.4649419
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
DNA pairwise sequence alignment has been a subject of great interest in the past and still evokes large interest. Recent algorithms have either been slow and sensitive or fast and less sensitive. Here, we present a new algorithm which is fast and at the same time relatively sensitive. To increase the speed, we first build a suffix free for both sequences and the alignment is triggered by the maximum matching substring. The algorithm employs in mismatch seeds to increase both sensitivity and speed in the later stages. We tested our algorithm on randomly generated sequences of length up to 500 thousand and used Rosetta dataset to test the sensitivity of the algorithm.
引用
收藏
页码:1367 / 1372
页数:6
相关论文
共 17 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]  
ALTSCHUL SF, 1997, NUCLEIC ACIDS RES, V25, P3392
[4]   Human and mouse gene structure: Comparative analysis and application to exon prediction [J].
Batzoglou, S ;
Pachter, L ;
Mesirov, JP ;
Berger, B ;
Lander, ES .
GENOME RESEARCH, 2000, 10 (07) :950-958
[5]  
BRUDNO M, 2002, FAST SENSITIVE ALIGN, P51903
[6]   Alignment of whole genomes [J].
Delcher, AL ;
Kasif, S ;
Fleischmann, RD ;
Peterson, J ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (11) :2369-2376
[7]  
Kent WJ, 2002, GENOME RES, V12, P656, DOI 10.1101/gr.229202. Article published online before March 2002
[8]   Conservation, regulation, synteny, and introns in a large-scale C-briggsae-C-elegans genomic alignment [J].
Kent, WJ ;
Zahler, AM .
GENOME RESEARCH, 2000, 10 (08) :1115-1125
[9]  
LI M, 2004, J BIOINFORM COMPUT B, P51903
[10]  
LIPMAN D, 1988, P NATL ACAD SCI US, P51903