Dynamic programming based approximation algorithms for sequence alignment with constraints

被引:5
作者
Arslan, AN [1 ]
Egecioglu, Ö
机构
[1] Univ Vermont, Dept Comp Sci, Burlington, VT 05405 USA
[2] Univ Calif Santa Barbara, Dept Comp Sci, Santa Barbara, CA 93106 USA
关键词
local alignment; cyclic sequence comparison; normalized local alignment; length-restricted local alignment; approximation algorithm; dynamic programming; ratio maximization; fractional programming;
D O I
10.1287/ijoc.1040.0097
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Given two sequences X and Y, the classical dynamic programming solution to the local alignment problem searches for two subsequences I subset of or equal to X and J subset of or equal to Y with maximum similarity score under a given scoring scheme. In several applications, variants of this problem arise with different objectives and with length constraints on the subsequences I and J. This constraint can be explicit, such as requiring \I\ + \J\ greater than or equal to t, or \J\ < T, or may be implicit such as in cyclic sequence comparison, or as in the maximization of length-normalized scores, and driven by practical considerations. We present a survey of approximation algorithms for various alignment problems with constraints, and several new approximation algorithms. These approximations are in two distinct senses: In one the constraints are satisfied but the score computed is within a prescribed tolerance of the optimum instead of the exact optimum. In another, the alignment returned is assured to have at least the Optimum score with respect to the given constraints, but the length constraints are satisfied to within a prescribed tolerance from the required values. The algorithms proposed involve applications of techniques from fractional programming and dynamic programming.
引用
收藏
页码:441 / 458
页数:18
相关论文
共 25 条
[1]  
Altschul SF, 1996, METHOD ENZYMOL, V266, P460
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]  
Arslan A. N., 2002, International Journal of Foundations of Computer Science, V13, P751, DOI 10.1142/S0129054102001436
[5]  
Arslan AN, 2001, LECT NOTES COMPUT SC, V2108, P111
[6]   A new approach to sequence comparison:: normalired sequence alignment [J].
Arslan, AN ;
Egecioglu, Ö ;
Pevzner, PA .
BIOINFORMATICS, 2001, 17 (04) :327-337
[7]   APPLICATIONS OF APPROXIMATE STRING-MATCHING TO 2D SHAPE-RECOGNITION [J].
BUNKE, H ;
BUHLER, U .
PATTERN RECOGNITION, 1993, 26 (12) :1797-1812
[8]  
Cormen T. H., 2001, Introduction to Algorithms, V2nd
[9]  
Craven B.D., 1988, Fractional Programming, VVolume 4
[10]   OPTIMAL SEQUENCE ALIGNMENTS [J].
FITCH, WM ;
SMITH, TF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1983, 80 (05) :1382-1386