A survey on improving pattern matching algorithms for biological sequences

被引:8
|
作者
Hamed, Belal A. [1 ]
Ibrahim, Osman Ali Sadek [1 ]
Abd El-Hafeez, Tarek [1 ,2 ]
机构
[1] Minia Univ, Fac Sci, Dept Comp Sci, Al Minya, Egypt
[2] Deraya Univ, Comp Sci Unit, Al Minya, Egypt
来源
关键词
bioinformatics; character comparison; DNA sequences; pattern matching algorithms; string matching;
D O I
10.1002/cpe.7292
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pattern matching is a highly useful procedure in several stages of the computational pipelines. Furthermore, some research trends in this research domain contributed to growing biological databases and updated them throughout time. This article proposes an comparison and analysis of different algorithms for match equivalent pattern matching like complexity, efficiency, and techniques. Which algorithm is best for which DNA sequence and why? This describes the different algorithms for various activities that include pattern matching as an important aspect of functionality. This article shows that BM, Horspool, ZT, QS, FS, Smith, and SSABS methods employ the bad character preprocessing function. In addition, BM, SSABS, TVSBS, and BRFS methods are using two approaches in the preprocessing stage, which decreases the preprocessing time. Furthermore, KR, QS, SSABS, BRFS, and Shift-Or are not recommended for the long pattern, whereas ZT, FS, d-BM, Raita, and Smith are not recommended for the short pattern. This is because they are time-consuming and certain algorithms, such as ZT and DCPM, use a lot of time and space during the matching and search process, while others, such as d-BM and TSW, save space and time. Although DCPM, BRFS, and QS are quicker than other algorithms, FLPM, PAPM, and LFPM rank highest in terms of complexity time.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Improving the Efficiency of Exact Two Dimensional On-line Pattern Matching Algorithms
    Kouzinopoulos, Charalarnpos S.
    Margaritis, Konstantincis G.
    PCI 2008: 12TH PAN-HELLENIC CONFERENCE ON INFORMATICS, PROCEEDINGS, 2008, : 232 - 236
  • [22] Optimal pattern matching algorithms
    Didier, Gilles
    JOURNAL OF COMPLEXITY, 2019, 51 : 79 - 109
  • [23] Algorithms for Forest Pattern Matching
    Zhang, Kaizhong
    Zhu, Yunkun
    COMBINATORIAL PATTERN MATCHING, PROCEEDINGS, 2010, 6129 : 1 - 12
  • [24] ALGORITHMS FOR PATTERN-MATCHING
    DAVIES, G
    BOWSHER, S
    SOFTWARE-PRACTICE & EXPERIENCE, 1986, 16 (06): : 575 - 601
  • [25] Practical fast on-line exact pattern matching algorithms for highly similar sequences
    Ben Nsira, Nadia
    Lecroq, Thierry
    Prieur-Gaston, Elise
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 1772 - 1777
  • [26] Efficient pattern matching of multidimensional sequences
    Lee, S
    Oh, K
    Kwon, D
    Choi, W
    Hong, J
    Choi, J
    Lee, D
    ROUGH SETS, FUZZY SETS, DATA MINING, AND GRANULAR COMPUTING, PT 2, PROCEEDINGS, 2005, 3642 : 202 - 210
  • [27] Compressed Pattern Matching in Dna Sequences
    Kanchana, N.
    Sarala, S.
    PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY, VOL 9 (ICCSIT 2010), 2010, : 157 - 160
  • [28] Pattern Matching in Polyphonic Musical Sequences
    Alzamel, Mai
    Iliopoulos, Costas
    Lim, Zara
    Moser-Schwaiger, Marie
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, PT IV, AIAI 2024, 2024, 714 : 139 - 151
  • [29] Fast practical online exact single and multiple pattern matching algorithms in highly similar sequences
    Ben Nsira, Nadia
    Lecroq, Thierry
    Prieur-Gaston, Elise
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2019, 22 (01) : 1 - 18
  • [30] AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES
    GOTOH, O
    JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) : 705 - 708