String search in coarse-grained parallel computers

被引:6
作者
Ferragina, P [1 ]
Luccio, F [1 ]
机构
[1] Univ Pisa, Dipartimento Informat, Pisa, Italy
关键词
string matching; distributed data structures; BSP model; parallel computing; computational complexity;
D O I
10.1007/PL00008259
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Given a text string T[1, n], the multistring search problem consists of determining which of k pattern strings {X-t [1, m], X-2[1, m],..., X-k[1, m]}, provided on-line, occur in T. We study this problem in the BSP model [27], and then extend our analysis to other coarse-grained parallel computational models. We refer to the relevant and difficult case of long patterns, that is m greater than or equal to p, where p is the number of available processors, and provide an optimal result with respect to both computation and communication times, attaining a constant number of supersteps. We then study single string search (i.e., k = 1), and show how the multistring search algorithm can be employed to speed up the process and balance the communication cost. The proposed solution takes a constant number of supersteps, and achieves optimal communication time if the string to be searched is longer than p(2). Our results are based on the distribution of a proper data structure among the p processors, to reduce and balance the communication cost. We also indicate how short patterns can be efficiently dealt with, through a completely different algorithmic approach.
引用
收藏
页码:177 / 194
页数:18
相关论文
共 28 条
[1]  
AMIR A, 1991, PROCEEDINGS - 32ND ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, P760, DOI 10.1109/SFCS.1991.185445
[2]   PARALLEL CONSTRUCTION OF A SUFFIX TREE WITH APPLICATIONS [J].
APOSTOLICO, A ;
ILIOPOULOS, C ;
LANDAU, GM ;
SCHIEBER, B ;
VISHKIN, U .
ALGORITHMICA, 1988, 3 (03) :347-365
[3]  
Apostolico A., 1985, NATO ASI Series, V12, P85, DOI [DOI 10.1007/978-3-642-82456-2_6, 10.1007/978-3-642-82456-26, DOI 10.1007/978-3-642-82456-26]
[4]  
Arge L., 1997, P 29 STOC, P540, DOI [10.1145/258533.258647, DOI 10.1145/258533.258647]
[5]   Hierarchies of indices for text searching [J].
BaezaYates, R ;
Barbosa, EF ;
Ziviani, N .
INFORMATION SYSTEMS, 1996, 21 (06) :497-514
[6]  
BAUMKER A, 1996, LECT NOTES COMPUTER, V1097, P404
[7]  
Baumker A., 1995, LECT NOTES COMPUTER, V979, P17
[8]  
Clark DR, 1996, PROCEEDINGS OF THE SEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P383
[9]   Scalable parallel computational geometry for coarse grained multicomputers [J].
Dehne, F ;
Fabri, A ;
RauChaplin, A .
INTERNATIONAL JOURNAL OF COMPUTATIONAL GEOMETRY & APPLICATIONS, 1996, 6 (03) :379-400
[10]  
DEHNE F, 1997, P 9 ACM S PAR ALG AR, P106