Reusable dynamic programming: Updating sequence alignment

被引:0
作者
Hong, Changjin [1 ]
Tewfik, Ahmed H. [1 ]
机构
[1] Univ Minnesota, Dept Elect & Comp Engn, 200 Union St SE, Minneapolis, MN 55455 USA
来源
2006 IEEE INTERNATIONAL WORKSHOP ON GENOMIC SIGNAL PROCESSING AND STATISTICS | 2006年
关键词
D O I
10.1109/GENSIPS.2006.353154
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Sequence alignment in genomics and proteomics is mostly done via dynamic programming (DP) based approaches. In this work, we show how computational results from DP can be reused to update alignments when analyzing new versions of a sequence. We derive relative tolerance bounds on node distances from a root node that guarantee that partial shortest path distances remain optimal. We then propose an algorithm that uses these bounds to skip all unperturbed parts of a sequence when recomputing an alignment. We also discuss techniques to reduce the memory requirements of the algorithm by focusing on the highly conserved segments of the sequence. Experimental results establish that our proposed alignment procedure can update alignment decisions of modified sequence with 4.6% to 18% of the number of computations required by the normal Needleman-Wunsch algorithm, depending on sequence length. Higher computational savings are achieved with longer sequences.
引用
收藏
页码:57 / +
页数:2
相关论文
共 6 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
CORMEN TH, 1992, INTRO ALGORITHMS
[3]  
EPPSTEIN D, 1990, LECT NOTES COMPUT SC, V447, P38
[4]   ALGORITHMS FOR LONGEST COMMON SUBSEQUENCE PROBLEM [J].
HIRSCHBERG, DS .
JOURNAL OF THE ACM, 1977, 24 (04) :664-675
[5]  
HONG C, 2005, EUSIPCO
[6]   ARC TOLERANCES IN SHORTEST-PATH AND NETWORK FLOW PROBLEMS [J].
SHIER, DR ;
WITZGALL, C .
NETWORKS, 1980, 10 (04) :277-291