Parameterized dictionary matching and recognition with one gap

被引:2
|
作者
Shalom, B. Riva [1 ]
机构
[1] Shenkar Coll, Dept Software Engn, IL-52526 Ramat Gan, Israel
关键词
Parameterized matching; Dictionary matching; Gapped pattern matching; ALGORITHMS; STRINGS; PATTERN; SET;
D O I
10.1016/j.tcs.2020.11.017
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Dictionary Matching is a variant of the Pattern Matching problem where multiple patterns are simultaneously matched to a single text. In case where the patterns contain sequences of don't care symbols, the problem is called Dictionary Matching with Gaps. The problem is related to cyber security, where the patterns represent the malware sequences we want to detect in the text, which may appear in several packets. Another famous variant of Pattern matching is the Parameterized Matching, where two equal-length strings are a parameterized match if there exists a bijection on the alphabets, such that one string matches the other under the bijection. In this paper the problem of Parameterized Dictionary Matching with One Gap is described, which is an extension of the Dictionary Matching with Gaps, where the parameterized match serves as encryption system of the malware sequences. The paper presents two algorithms solving the Parameterized Dictionary Matching with One Gap, for dictionaries with non-uniformly bounded gaps. The first solves the problem with a query time of O (vertical bar T vertical bar delta(max)log(2) d + occ), while the second solution has a query time of O (vertical bar T vertical bar delta(max) + occ), where vertical bar T vertical bar is the size of the text, d is the number of gapped patterns in the dictionary, delta(max) is the difference between the highest upper bound and the lowest lower bound of the gaps and occ is the number of the gapped patterns reported as output. We also suggest the related problem of Parameterized Dictionary Recognition with One Gap, which requires reporting a single parameterized appearance of each gapped pattern. This is of interest in case we want only to know which malware sequences where detected in the text, and not the details of all their appearances in the text, that may be numerous. We present similar algorithms for this problem as well. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 50 条
  • [21] On Counting Parameterized Matching and Packing
    Liu, Yunlong
    Wang, Jianxin
    FRONTIERS IN ALGORITHMICS, FAW 2016, 2016, 9711 : 125 - 134
  • [22] ALPHABET DEPENDENCE IN PARAMETERIZED MATCHING
    AMIR, A
    FARACH, M
    MUTHUKRISHNAN, S
    INFORMATION PROCESSING LETTERS, 1994, 49 (03) : 111 - 115
  • [23] Compressed Parameterized Pattern Matching
    Beal, Richard
    Adjeroh, Donald A.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 461 - 470
  • [24] Multiple matching of parameterized patterns
    Idury, RM
    Schaffer, AA
    THEORETICAL COMPUTER SCIENCE, 1996, 154 (02) : 203 - 224
  • [25] Sublinear algorithms for parameterized matching
    Salmela, Leena
    Tarhio, Jorma
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2006, 4009 : 354 - 364
  • [26] Two dimensional parameterized matching
    Hazay, C
    Lewenstein, M
    Tsur, D
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2005, 3537 : 266 - 279
  • [27] Compressed parameterized pattern matching
    Beal, Richard
    Adjeroh, Donald
    THEORETICAL COMPUTER SCIENCE, 2016, 609 : 129 - 142
  • [28] Efficient parameterized string matching
    Fredriksson, Kimmo
    Mozgovoy, Maxim
    INFORMATION PROCESSING LETTERS, 2006, 100 (03) : 91 - 96
  • [29] UNDERSAMPLED FACE RECOGNITION WITH ONE-PASS DICTIONARY LEARNING
    Wei, Chia-Po
    Wang, Yu-Chiang Frank
    2015 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA & EXPO (ICME), 2015,
  • [30] Mitigating Effect of Dictionary Matching Errors in Distantly Supervised Named Entity Recognition
    Kobayashi, Koga
    Wakabayashi, Kei
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 111 - 114