A new heuristic for multiple sequence alignment

被引:3
作者
Agrawal, Ankit [1 ]
Khaitan, Siddhartha Kumar [2 ]
机构
[1] Iowa State Univ, Dept Comp Sci, Ames, IA 50011 USA
[2] Iowa State Univ, Dept Elect Engn, Ames, IA 50011 USA
来源
2008 IEEE INTERNATIONAL CONFERENCE ON ELECTRO/INFORMATION TECHNOLOGY | 2008年
关键词
D O I
10.1109/EIT.2008.4554299
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Multiple sequence alignment is an important task in bioinformatics which forms the basis of many other tasks like protein structure prediction, protein function prediction and phylogenetic analysis. An optimal multiple sequence alignment for a given set of sequences is difficult to determine by standard dynamic programming algorithms because of impractically high computational complexity. Therefore, heuristics are commonly used to reduce the alignment construction time, even though the resulting alignment may not be optimal. ClustalW is one of the most popular progressive multiple sequence alignment algorithm where the pairwise sequence alignments are done according to a static guide-tree based on the sequences alone. For a multiple sequence alignment of N sequences, each of length O(n), the ClustalW algorithm takes O(N(2)n(2)) time. In this paper, a modification to the algorithm is proposed which reduces the time complexity of the algorithm to O(N log(2) Nn(2)). The proposed heuristic also makes the alignment process more dynamic as the order of sequences added to the multiple sequence alignment also depends on the already computed multiple sequence alignments.
引用
收藏
页码:215 / +
页数:2
相关论文
共 15 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   MULTIPLE SEQUENCE ALIGNMENT WITH HIERARCHICAL-CLUSTERING [J].
CORPET, F .
NUCLEIC ACIDS RESEARCH, 1988, 16 (22) :10881-10890
[3]   Multiple sequence alignment [J].
Edgar, Robert C. ;
Batzoglou, Serafim .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2006, 16 (03) :368-373
[4]   PROGRESSIVE SEQUENCE ALIGNMENT AS A PREREQUISITE TO CORRECT PHYLOGENETIC TREES [J].
FENG, DF ;
DOOLITTLE, RF .
JOURNAL OF MOLECULAR EVOLUTION, 1987, 25 (04) :351-360
[5]   THE ALIGNMENT OF SETS OF SEQUENCES AND THE CONSTRUCTION OF PHYLETIC TREES - AN INTEGRATED METHOD [J].
HOGEWEG, P ;
HESPER, B .
JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (02) :175-186
[6]  
MIKHAILOV D, 2001, PERFORMANCE OPTIMIZA
[7]   Recent progress in multiple sequence alignment: a survey [J].
Notredame, C .
PHARMACOGENOMICS, 2002, 3 (01) :131-144
[8]  
Pearson W R, 2000, Methods Mol Biol, V132, P185
[9]  
REZAEI S, 2006, CCECE, P717
[10]   PATTERN-RECOGNITION IN GENETIC SEQUENCES BY MISMATCH DENSITY [J].
SELLERS, PH .
BULLETIN OF MATHEMATICAL BIOLOGY, 1984, 46 (04) :501-514