Parameterized dictionary matching and recognition with one gap

被引:2
|
作者
Shalom, B. Riva [1 ]
机构
[1] Shenkar Coll, Dept Software Engn, IL-52526 Ramat Gan, Israel
关键词
Parameterized matching; Dictionary matching; Gapped pattern matching; ALGORITHMS; STRINGS; PATTERN; SET;
D O I
10.1016/j.tcs.2020.11.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Dictionary Matching is a variant of the Pattern Matching problem where multiple patterns are simultaneously matched to a single text. In case where the patterns contain sequences of don't care symbols, the problem is called Dictionary Matching with Gaps. The problem is related to cyber security, where the patterns represent the malware sequences we want to detect in the text, which may appear in several packets. Another famous variant of Pattern matching is the Parameterized Matching, where two equal-length strings are a parameterized match if there exists a bijection on the alphabets, such that one string matches the other under the bijection. In this paper the problem of Parameterized Dictionary Matching with One Gap is described, which is an extension of the Dictionary Matching with Gaps, where the parameterized match serves as encryption system of the malware sequences. The paper presents two algorithms solving the Parameterized Dictionary Matching with One Gap, for dictionaries with non-uniformly bounded gaps. The first solves the problem with a query time of O (vertical bar T vertical bar delta(max)log(2) d + occ), while the second solution has a query time of O (vertical bar T vertical bar delta(max) + occ), where vertical bar T vertical bar is the size of the text, d is the number of gapped patterns in the dictionary, delta(max) is the difference between the highest upper bound and the lowest lower bound of the gaps and occ is the number of the gapped patterns reported as output. We also suggest the related problem of Parameterized Dictionary Recognition with One Gap, which requires reporting a single parameterized appearance of each gapped pattern. This is of interest in case we want only to know which malware sequences where detected in the text, and not the details of all their appearances in the text, that may be numerous. We present similar algorithms for this problem as well. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [31] Dictionary-based matching graph network for biomedical named entity recognition
    Lou, Yinxia
    Zhu, Xun
    Tan, Kai
    SCIENTIFIC REPORTS, 2023, 13 (01):
  • [32] Dictionary-based matching graph network for biomedical named entity recognition
    Yinxia Lou
    Xun Zhu
    Kai Tan
    Scientific Reports, 13 (1)
  • [33] Internal Dictionary Matching
    Charalampopoulos, Panagiotis
    Kociumaka, Tomasz
    Mohamed, Manal
    Radoszewski, Jakub
    Rytter, Wojciech
    Walen, Tomasz
    ALGORITHMICA, 2021, 83 (07) : 2142 - 2169
  • [34] DYNAMIC DICTIONARY MATCHING
    AMIR, A
    FARACH, M
    GALIL, Z
    GIANCARLO, R
    PARK, K
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1994, 49 (02) : 208 - 222
  • [35] Dictionary Matching in a Stream
    Clifford, Raphael
    Fontaine, Allyx
    Porat, Ely
    Sach, Benjamin
    Starikovskaya, Tatiana
    ALGORITHMS - ESA 2015, 2015, 9294 : 361 - 372
  • [36] Internal Dictionary Matching
    Panagiotis Charalampopoulos
    Tomasz Kociumaka
    Manal Mohamed
    Jakub Radoszewski
    Wojciech Rytter
    Tomasz Waleń
    Algorithmica, 2021, 83 : 2142 - 2169
  • [37] The parameterized complexity of the induced matching problem
    Moser, Hannes
    Sikdar, Somnath
    DISCRETE APPLIED MATHEMATICS, 2009, 157 (04) : 715 - 727
  • [38] A brief history of parameterized matching problems
    Mendivelso, Juan
    Thankachan, Sharma, V
    Pinzon, Yoan
    DISCRETE APPLIED MATHEMATICS, 2020, 274 : 103 - 115
  • [39] Parameterized Cost Volume for Stereo Matching
    Zeng, Jiaxi
    Yao, Chengtang
    Yu, Lidong
    Wu, Yuwei
    Jia, Yunde
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2023), 2023, : 18301 - 18311
  • [40] Two-Dimensional Parameterized Matching
    Cole, Richard
    Hazay, Carmit
    Lewenstein, Moshe
    Tsur, Dekel
    ACM TRANSACTIONS ON ALGORITHMS, 2014, 11 (02) : 1 - 30