A survey on improving pattern matching algorithms for biological sequences

被引:8
|
作者
Hamed, Belal A. [1 ]
Ibrahim, Osman Ali Sadek [1 ]
Abd El-Hafeez, Tarek [1 ,2 ]
机构
[1] Minia Univ, Fac Sci, Dept Comp Sci, Al Minya, Egypt
[2] Deraya Univ, Comp Sci Unit, Al Minya, Egypt
来源
关键词
bioinformatics; character comparison; DNA sequences; pattern matching algorithms; string matching;
D O I
10.1002/cpe.7292
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pattern matching is a highly useful procedure in several stages of the computational pipelines. Furthermore, some research trends in this research domain contributed to growing biological databases and updated them throughout time. This article proposes an comparison and analysis of different algorithms for match equivalent pattern matching like complexity, efficiency, and techniques. Which algorithm is best for which DNA sequence and why? This describes the different algorithms for various activities that include pattern matching as an important aspect of functionality. This article shows that BM, Horspool, ZT, QS, FS, Smith, and SSABS methods employ the bad character preprocessing function. In addition, BM, SSABS, TVSBS, and BRFS methods are using two approaches in the preprocessing stage, which decreases the preprocessing time. Furthermore, KR, QS, SSABS, BRFS, and Shift-Or are not recommended for the long pattern, whereas ZT, FS, d-BM, Raita, and Smith are not recommended for the short pattern. This is because they are time-consuming and certain algorithms, such as ZT and DCPM, use a lot of time and space during the matching and search process, while others, such as d-BM and TSW, save space and time. Although DCPM, BRFS, and QS are quicker than other algorithms, FLPM, PAPM, and LFPM rank highest in terms of complexity time.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] A STUDY OF PATTERN-MATCHING ALGORITHMS
    PIRKLBAUER, K
    STRUCTURED PROGRAMMING, 1992, 13 (02): : 89 - 98
  • [32] Parallel Algorithms for Combinatorial Pattern Matching
    Brimkov, Valentin E.
    COMBINATORIAL IMAGE ANALYSIS, IWCIA 2014, 2014, 8466 : 8 - 16
  • [33] Complexity of sequential pattern matching algorithms
    Régnier, M
    Szpankowski, W
    RANDOMIZATION AND APPROXIMATION TECHNIQUES IN COMPUTER SCIENCE, 1998, 1518 : 187 - 199
  • [34] EFFICIENT PATTERN MATCHING ALGORITHMS IN IDS
    Salve, Vandana B.
    Savalkar, Vishwayogita
    Mhatre, Sonali
    PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON INVENTIVE SYSTEMS AND CONTROL (ICISC 2018), 2018, : 1083 - 1089
  • [35] ALGORITHMS FOR JUMBLED PATTERN MATCHING IN STRINGS
    Burcsi, Peter
    Cicalese, Ferdinando
    Fici, Gabriele
    Liptak, Zsuzsanna
    INTERNATIONAL JOURNAL OF FOUNDATIONS OF COMPUTER SCIENCE, 2012, 23 (02) : 357 - 374
  • [36] Parameterized pattern matching: Algorithms and applications
    Baker, BS
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1996, 52 (01) : 28 - 42
  • [37] Experimenting with pattern-matching algorithms
    Manolopoulos, Y
    Faloutsos, C
    INFORMATION SCIENCES, 1996, 90 (1-4) : 75 - 89
  • [38] A New Approach to Pattern Matching in Degenerate DNA/RNA Sequences and Distributed Pattern Matching
    Iliopoulos, Costas S.
    Mouchard, Laurent
    Rahman, M. Sohel
    MATHEMATICS IN COMPUTER SCIENCE, 2008, 1 (04) : 557 - 569
  • [39] Improving an algorithm for approximate pattern matching
    Navarro, G
    BaezaYates, R
    ALGORITHMICA, 2001, 30 (04) : 473 - 502
  • [40] Improving an Algorithm for Approximate Pattern Matching
    G. Navarro
    R. Baeza-Yates
    Algorithmica, 2001, 30 : 473 - 502