A survey on improving pattern matching algorithms for biological sequences

被引:8
|
作者
Hamed, Belal A. [1 ]
Ibrahim, Osman Ali Sadek [1 ]
Abd El-Hafeez, Tarek [1 ,2 ]
机构
[1] Minia Univ, Fac Sci, Dept Comp Sci, Al Minya, Egypt
[2] Deraya Univ, Comp Sci Unit, Al Minya, Egypt
来源
关键词
bioinformatics; character comparison; DNA sequences; pattern matching algorithms; string matching;
D O I
10.1002/cpe.7292
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pattern matching is a highly useful procedure in several stages of the computational pipelines. Furthermore, some research trends in this research domain contributed to growing biological databases and updated them throughout time. This article proposes an comparison and analysis of different algorithms for match equivalent pattern matching like complexity, efficiency, and techniques. Which algorithm is best for which DNA sequence and why? This describes the different algorithms for various activities that include pattern matching as an important aspect of functionality. This article shows that BM, Horspool, ZT, QS, FS, Smith, and SSABS methods employ the bad character preprocessing function. In addition, BM, SSABS, TVSBS, and BRFS methods are using two approaches in the preprocessing stage, which decreases the preprocessing time. Furthermore, KR, QS, SSABS, BRFS, and Shift-Or are not recommended for the long pattern, whereas ZT, FS, d-BM, Raita, and Smith are not recommended for the short pattern. This is because they are time-consuming and certain algorithms, such as ZT and DCPM, use a lot of time and space during the matching and search process, while others, such as d-BM and TSW, save space and time. Although DCPM, BRFS, and QS are quicker than other algorithms, FLPM, PAPM, and LFPM rank highest in terms of complexity time.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Simple and Efficient Pattern Matching Algorithms for Biological Sequences
    Neamatollahi, Peyman
    Hadi, Montassir
    Naghibzadeh, Mahmoud
    IEEE ACCESS, 2020, 8 (08): : 23838 - 23846
  • [2] EXPERIMENTAL RESULTS ON MULTIPLE PATTERN MATCHING ALGORITHMS FOR BIOLOGICAL SEQUENCES
    Kouzinopoulos, Charalampos S.
    Michailidis, Panagiotis D.
    Margaritis, Konstantinos G.
    BIOINFORMATICS 2011, 2011, : 274 - 277
  • [3] Quantum-based exact pattern matching algorithms for biological sequences
    Soni, Kapil Kumar
    Rasool, Akhtar
    ETRI JOURNAL, 2021, 43 (03) : 483 - 510
  • [4] PERFORMANCE STUDY OF PARALLEL HYBRID MULTIPLE PATTERN MATCHING ALGORITHMS FOR BIOLOGICAL SEQUENCES
    Kouzinopoulos, Charalampos S.
    Michailidis, Panagiotis D.
    Margaritis, Konstantinos G.
    BIOINFORMATICS: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOINFORMATICS MODELS, METHODS AND ALGORITHMS, 2012, : 182 - 187
  • [5] A Comparative Study of Pattern Matching Algorithms on Sequences
    Min, Fan
    Wu, Xindong
    ROUGH SETS, FUZZY SETS, DATA MINING AND GRANULAR COMPUTING, PROCEEDINGS, 2009, 5908 : 510 - +
  • [6] Comparison of exact string matching algorithms for biological sequences
    Kalsi, Petri
    Peltola, Hannu
    Tarhio, Jorma
    BIOINFORMATICS RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 13 : 417 - 426
  • [7] A fast exact pattern matching algorithm for biological sequences
    Huang, Yong
    Ping, Lingdi
    Pan, Xuezeng
    Cai, Guoyong
    BMEI 2008: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOL 1, 2008, : 8 - +
  • [8] A new fast technique for pattern matching in biological sequences
    Osman Ali Sadek Ibrahim
    Belal A. Hamed
    Tarek Abd El-Hafeez
    The Journal of Supercomputing, 2023, 79 : 367 - 388
  • [9] A Fast Hybrid Pattern Matching Algorithm for Biological Sequences
    Cai, Guoyong
    Nie, Xining
    Huang, Yong
    PROCEEDINGS OF THE 2009 2ND INTERNATIONAL CONFERENCE ON BIOMEDICAL ENGINEERING AND INFORMATICS, VOLS 1-4, 2009, : 468 - +
  • [10] Kangaroo - A pattern-matching program for biological sequences
    Betel, D
    Hogue, CWV
    BMC BIOINFORMATICS, 2002, 3 (1)