A programmable array processor architecture for flexible approximate string matching algorithms

被引:10
作者
Michailidis, Panagiotis D. [1 ]
Margaritis, Konstantinos G. [1 ]
机构
[1] Univ Macedonia, Dept Appl Informat, Parallel & Distributed Proc Lab, Thessaloniki 54006, Greece
关键词
approximate string matching; array processors architectures; parallel architectures; parallel implementations; VLSI; FPGA;
D O I
10.1016/j.jpdc.2006.10.002
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Approximate string matching problem is a common and often repeated task in information retrieval and bioinformatics. This paper proposes a generic design of a programmable array processor architecture for a wide variety of approximate string matching algorithms to gain high performance at low cost. Further, we describe the architecture of the array and the architecture of the cell in detail in order to efficiently implement for both the preprocessing and searching phases of most string matching algorithms. Further, the architecture performs approximate string matching for complex patterns that contain don't care, complement and classes symbols. We also simulate and evaluate the proposed architecture on a field programmable gate array (FPGA) device using the JHDL tool for synthesis and the Xilinx Foundation tools for mapping, placement, and routing. Finally, our programmable implementation achieves about 8-340 times faster execution than a desktop computer with a Pentium 4 3.5 GHz for all algorithms when the length of the pattern is 1024. (C) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:131 / 141
页数:11
相关论文
共 69 条
[1]  
[Anonymous], 2006, RECONFIGURABLE COMPU
[2]   Faster approximate string matching [J].
BaezaYates, R ;
Navarro, G .
ALGORITHMICA, 1999, 23 (02) :127-158
[3]   A NEW APPROACH TO TEXT SEARCHING [J].
BAEZAYATES, R ;
GONNET, GH .
COMMUNICATIONS OF THE ACM, 1992, 35 (10) :74-82
[4]   JHDL - An HDL for reconfigurable systems [J].
Bellows, P ;
Hutchings, B .
IEEE SYMPOSIUM ON FPGAS FOR CUSTOM COMPUTING MACHINES, PROCEEDINGS, 1998, :175-184
[5]   A survey of longest common subsequence algorithms [J].
Bergroth, L ;
Hakonen, H ;
Raita, T .
SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, :39-48
[6]   PARALLEL STRING-MATCHING WITH VARIABLE-LENGTH DONT CARES [J].
BERTOSSI, AA ;
LOGI, F .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 22 (02) :229-234
[7]  
Chen CF, 2001, NEUROUROL URODYNAM, V20, P125, DOI 10.1002/1520-6777(2001)20:1<125::AID-NAU14>3.0.CO
[8]  
2-Y
[9]  
*COMP LTD, 1994, BIOCC INF PACK
[10]   A fast and practical bit-vector algorithm for the Longest Common Subsequence problem [J].
Crochemore, M ;
Iliopoulos, CS ;
Pinzon, YJ ;
Reid, JF .
INFORMATION PROCESSING LETTERS, 2001, 80 (06) :279-285