APPROXIMATE BOYER-MOORE STRING MATCHING

被引:48
|
作者
TARHIO, J
UKKONEN, E
机构
[1] Univ of Helsinki, Helsinki
关键词
STRING MATCHING; EDIT DISTANCE; BOYER-MOORE ALGORITHM; K-MISMATCHES PROBLEM; K-DIFFERENCES PROBLEM;
D O I
10.1137/0222018
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The Boyer-Moore idea applied in exact string matching is generalized to approximate string matching. Two versions of the problem are considered. The k mismatches problem is to find all approximate occurrences of a pattern string (length m) in a text string (length n) with at most k mismatches. The generalized Boyer-Moore algorithm is shown (under a mild independence assumption) to solve the problem in expected time O(kn(1/(m -k) + (k/c))), where c is the size of the alphabet. A related algorithm is developed for the k differences problem, where the task is to find all approximate occurrences of a pattern in a text with less-than-or-equal-to k differences (insertions, deletions, changes). Experimental evaluation of the algorithms is reported, showing that the new algorithms are often significantly faster than the old ones. Both algorithms are functionally equivalent with the Horspool version of the Boyer-Moore algorithm when k = 0.
引用
收藏
页码:243 / 260
页数:18
相关论文
共 50 条
  • [21] A Boyer-Moore Type Algorithm for Timed Pattern Matching
    Waga, Masaki
    Akazaki, Takumi
    Hasuo, Ichiro
    FORMAL MODELING AND ANALYSIS OF TIMED SYSTEMS, FORMATS 2016, 2016, 9884 : 121 - 139
  • [22] A NEW PROOF OF THE LINEARITY OF THE BOYER-MOORE STRING SEARCHING ALGORITHM
    GUIBAS, LJ
    ODLYZKO, AM
    SIAM JOURNAL ON COMPUTING, 1980, 9 (04) : 672 - 682
  • [23] A New Method to Obtain the shift-table in Boyer-Moore's String Matching Algorithm
    Wang, Yang
    19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 3282 - 3285
  • [24] A CORRECT PREPROCESSING ALGORITHM FOR BOYER-MOORE STRING-SEARCHING
    RYTTER, W
    SIAM JOURNAL ON COMPUTING, 1980, 9 (03) : 509 - 512
  • [25] Research on intrusion detection based on Boyer-Moore pattern matching algorithm
    Li, Yulong
    Li, Chenhao
    Jiao, Yang
    Zhao, Guogang
    Liu, Yang
    Zhang, Tian
    PROCEEDINGS OF 2023 7TH INTERNATIONAL CONFERENCE ON ELECTRONIC INFORMATION TECHNOLOGY AND COMPUTER ENGINEERING, EITCE 2023, 2023, : 1490 - 1494
  • [26] On the size of Boyer-Moore automata
    Baeza-Yates, Ricardo
    Bruyere, Veronique
    Delgrange, Olivier
    Scheihing, Rodrigo
    THEORETICAL COMPUTER SCIENCE, 2009, 410 (43) : 4432 - 4443
  • [27] A VARIATION ON THE BOYER-MOORE ALGORITHM
    LECROQ, T
    THEORETICAL COMPUTER SCIENCE, 1992, 92 (01) : 119 - 144
  • [28] ON THE PREPROCESSING ALGORITHM USED IN THE BOYER-MOORE ALGORITHM FOR STRING SEARCHING.
    Semba, Ichiro
    Journal of information processing, 1986, 9 (04) : 228 - 231
  • [29] FASTER STRING SEARCHES - BOYER-MOORE MAY BE THE ALGORITHM YOU NEED
    MENICO, C
    DR DOBBS JOURNAL, 1989, 14 (07): : 74 - 75
  • [30] A RAMSEY THEOREM IN BOYER-MOORE LOGIC
    KUNEN, K
    JOURNAL OF AUTOMATED REASONING, 1995, 15 (02) : 217 - 235