USEFULNESS OF THE KARP-MILLER-ROSENBERG ALGORITHM IN PARALLEL COMPUTATIONS ON STRINGS AND ARRAYS

被引：35

作者：

CROCHEMORE, M

RYTTER, W

机构：

[1] WARSAW UNIV,INST INFORMAT,PL-00913 WARSAW 59,POLAND

[2] UNIV PARIS 13,DEPT MATH INFORMAT,F-93430 VILLETANEUSE,FRANCE

来源：

THEORETICAL COMPUTER SCIENCE | 1991年 / 88卷 / 01期

关键词：

D O I：

10.1016/0304-3975(91)90073-B

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

The Karp-Miller-Rosenberg (1972) algorithm was one of the first efficient (almost linear) sequential algorithms for finding repeated patterns and for string matching. In the area of efficient sequential computations on strings it was soon superseded by more efficient (and more sophisticated) algorithms. We show that the Karp-Miller-Rosenberg algorithm (KMR) must be considered as a basic technique in parallel computations. For many problems, variations of KMR must be considered as a basic technique in parallel computations. For many problems, variations of KMR give the (known) most efficient parallel algorithms. The representation of the set of basic factors (subarrays) of a string (array) produced by the algorithm is an extremely useful data structure in parallel algorithms on strings and arrays. This gives also a general unifying framework for a large variety of problems. We show that the following problems for strings and arrays can be solved by almost optimal parallel algorithms: pattern-matching, longest repeated factor (subarray), longest common factor (subarray), maximal symmetric factor (subarray). Also the following problems for strings can be solved within the same complexity bounds: finding squares, testing even palstars and compositions of k palindromes for k = 2, 3, 4, computing Lyndon factorization and building minimal pattern-matching automata. In the model without concurrent writes the parallel time is O(log(n)2) (with n processors) and in the model with concurrent writes the time, for most of the problems, is O(log(n)) (with n processors). For two problems related to the one-dimensional case (longest repeated factor and longest common factor) there were designed parallel algorithms using suffix trees (Apostolico et al. 1988). However, our data structure is simpler and, furthermore, for the two-dimensional case suffix trees do not work. The complexity of our algorithms does not depend on the size of the alphabet, except for the computation of pattern-matching automata.

引用

页码：59 / 82

页数：24

共 28 条

[1] Aho A. V., 1974, DESIGN ANAL COMPUTER
[2] EFFICIENT STRING MATCHING - AID TO BIBLIOGRAPHIC SEARCH
AHO, AV
CORASICK, MJ
[J]. COMMUNICATIONS OF THE ACM, 1975, 18 (06) : 333 - 340
[3] PARALLEL CONSTRUCTION OF A SUFFIX TREE WITH APPLICATIONS
APOSTOLICO, A
ILIOPOULOS, C
LANDAU, GM
SCHIEBER, B
VISHKIN, U
[J]. ALGORITHMICA, 1988, 3 (03) : 347 - 365
[4] APOSTOLICO A, 1984, RAIRO-INF THEOR APPL, V18, P147
[5] TECHNIQUE FOR EXTENDING RAPID EXACT-MATCH STRING MATCHING TO ARRAYS OF MORE THAN ONE DIMENSION
BAKER, TP
[J]. SIAM JOURNAL ON COMPUTING, 1978, 7 (04) : 533 - 541
[6] Bird R. S., 1977, Information Processing Letters, V6, P168, DOI 10.1016/0020-0190(77)90017-5
[7] BOYER RS, 1977, COMM ACM, V20
[8] COLE R, 1987, F COMPUT SCI
[9] TRANSDUCERS AND REPETITIONS
CROCHEMORE, M
[J]. THEORETICAL COMPUTER SCIENCE, 1986, 45 (01) : 63 - 86
[10] CROCHEMORE M, IN PRESS J ACM

← 1 2 3 →