Faster Approximate String Matching for Short Patterns

被引:2
作者
Bille, Philip [1 ]
机构
[1] Tech Univ Denmark, DK-2800 Lyngby, Denmark
关键词
Algorithms; Approximate string matching; Word level parallelism; ALGORITHM; PARALLEL;
D O I
10.1007/s00224-011-9322-y
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
We study the classical approximate string matching problem, that is, given strings P and Q and an error threshold k, find all ending positions of substrings of Q whose edit distance to P is at most k. Let P and Q have lengths m and n, respectively. On a standard unit-cost word RAM with word size w >= log n we present an algorithm using time O(nk . min(log(2) m/log n, log(2) m log w/w) + n) When P is short, namely, m = 2(o(root logn)) or m = 2(o(root w/logw) this improves the previously best known time bounds for the problem. The result is achieved using a novel implementation of the Landau-Vishkin algorithm based on tabulation and word-level parallelism.
引用
收藏
页码:492 / 515
页数:24
相关论文
共 32 条
[31]   APPROXIMATE STRING-MATCHING USING WITHIN-WORD PARALLELISM [J].
WRIGHT, AH .
SOFTWARE-PRACTICE & EXPERIENCE, 1994, 24 (04) :337-362
[32]   FAST TEXT SEARCHING ALLOWING ERRORS [J].
WU, S ;
MANBER, U .
COMMUNICATIONS OF THE ACM, 1992, 35 (10) :83-91