Sequence alignment using FastLSA

被引:0
作者
Charter, K [1 ]
Schaeffer, J [1 ]
Szafron, D [1 ]
机构
[1] Univ Alberta, Dept Comp Sci, Edmonton, AB T6G 2H1, Canada
来源
METMBS'00: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON MATHEMATICS AND ENGINEERING TECHNIQUES IN MEDICINE AND BIOLOGICAL SCIENCES, VOLS I AND II | 2000年
关键词
sequence alignment; DNA; protein; Hirschberg's algorithm; dynamic programming; linear space;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For two strings of length m and n (m less than or equal to n), optimal sequence alignment (as a function of the alignment scoring function) tales time and space proportional to m X n to compute. The time actually consists of two parts: computing the score of the best alignment (calculating (m+1)x(n+1) values), and then extracting the alignment (by reading the computed values). The space requirement is usually prohibitive. Hirschberg's algorithm reduces the space needs to roughly 2 x m, but doubles the cost of computing and extracting the alignment. This paper introduces the FastLSA algorithm that is adaptive to the amount of space available. At one extreme, it uses linear space, while at the other it uses quadratic space. Based on the memory resources available, the algorithm saves the maximum amount of information to achieve the lowest extraction cost. The algorithm is shown to be analytically and experimentally superior to Hirschberg's algorithm.
引用
收藏
页码:239 / 245
页数:7
相关论文
共 8 条
  • [1] ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
  • [2] Chao K M, 1994, J Comput Biol, V1, P271, DOI 10.1089/cmb.1994.1.271
  • [3] LINEAR SPACE ALGORITHM FOR COMPUTING MAXIMAL COMMON SUBSEQUENCES
    HIRSCHBERG, DS
    [J]. COMMUNICATIONS OF THE ACM, 1975, 18 (06) : 341 - 343
  • [4] KORF R, 1999, COMMUNICATION
  • [5] Korf RE, 1999, IJCAI-99: PROCEEDINGS OF THE SIXTEENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS 1 & 2, P1184
  • [6] OPTIMAL ALIGNMENTS IN LINEAR-SPACE
    MYERS, EW
    MILLER, W
    [J]. COMPUTER APPLICATIONS IN THE BIOSCIENCES, 1988, 4 (01): : 11 - 17
  • [7] A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS
    NEEDLEMAN, SB
    WUNSCH, CD
    [J]. JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) : 443 - +
  • [8] SMITH TF, 1981, J MOL BIOL, V197, P723