Engineering order-preserving pattern matching with SIMD parallelism

被引:10
作者
Chhabra, Tamanna [1 ]
Faro, Simone [2 ]
Kulekci, M. Oguzhan [3 ]
Tarhio, Jorma [1 ]
机构
[1] Aalto Univ, Dept Comp Sci, Espoo, Finland
[2] Univ Catania, Dept Math & Comp Sci, Catania, Italy
[3] Istanbul Tech Univ, Inst Informat, Istanbul, Turkey
关键词
SIMD; SSE; AVX/AVX2; order-preserving pattern matching;
D O I
10.1002/spe.2433
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The order-preserving pattern matching problem has gained attention in recent years. It consists in finding all substrings in the text, which have the same length and relative order as the input pattern. Typically, the text and the pattern consist of numbers. Since recent times, there has been a tendency to utilize the ability of the word RAM model to increase the efficiency of string matching algorithms. This model works on computer words, reading and processing blocks of characters at once, so that usual arithmetic and logic operations on words can be performed in one unit of time. In this paper, we present a fast order-preserving pattern matching algorithm, which uses specialized word-size packed string matching instructions, grounded on the single instruction multiple data instruction set architecture. We show with experimental results that the new proposed algorithm is more efficient than the previous solutions. (C) 2016 The Authors. Software: Practice and Experience Published by John Wiley & Sons Ltd.
引用
收藏
页码:731 / 739
页数:9
相关论文
共 19 条
  • [1] [Anonymous], 2013, Proceedings of the 15th Meeting on Algorithm Engineering and Experiments, ALENEX 2013, New Orleans, Louisiana
  • [2] ARNDT J., 2009, MATTERS COMPUTATIONA
  • [3] Belazzougui D, 2013, LECT NOTES COMPUT SC, V8283, P66, DOI 10.1007/978-3-642-45030-3_7
  • [4] FAST STRING SEARCHING ALGORITHM
    BOYER, RS
    MOORE, JS
    [J]. COMMUNICATIONS OF THE ACM, 1977, 20 (10) : 762 - 772
  • [5] Cantone D, 2015, PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2015, P22
  • [6] Charras C, 1998, LECT NOTES COMPUT SC, V1448, P55, DOI 10.1007/BFb0030780
  • [7] Chhabra T, 2015, PROCEEDINGS OF THE PRAGUE STRINGOLOGY CONFERENCE 2015, P36
  • [8] Chhabra T, 2014, LECT NOTES COMPUT SC, V8504, P307, DOI 10.1007/978-3-319-07959-2_26
  • [9] Cho S, 2013, P 7 INT C COMB OPT A, P84, DOI 10.1007/978-3-319-02432-513
  • [10] Improving practical exact string matching
    Durian, Branislav
    Holub, Jan
    Peltola, Hannu
    Tarhio, Jorma
    [J]. INFORMATION PROCESSING LETTERS, 2010, 110 (04) : 148 - 152