A survey on improving pattern matching algorithms for biological sequences

被引:8
|
作者
Hamed, Belal A. [1 ]
Ibrahim, Osman Ali Sadek [1 ]
Abd El-Hafeez, Tarek [1 ,2 ]
机构
[1] Minia Univ, Fac Sci, Dept Comp Sci, Al Minya, Egypt
[2] Deraya Univ, Comp Sci Unit, Al Minya, Egypt
来源
关键词
bioinformatics; character comparison; DNA sequences; pattern matching algorithms; string matching;
D O I
10.1002/cpe.7292
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Pattern matching is a highly useful procedure in several stages of the computational pipelines. Furthermore, some research trends in this research domain contributed to growing biological databases and updated them throughout time. This article proposes an comparison and analysis of different algorithms for match equivalent pattern matching like complexity, efficiency, and techniques. Which algorithm is best for which DNA sequence and why? This describes the different algorithms for various activities that include pattern matching as an important aspect of functionality. This article shows that BM, Horspool, ZT, QS, FS, Smith, and SSABS methods employ the bad character preprocessing function. In addition, BM, SSABS, TVSBS, and BRFS methods are using two approaches in the preprocessing stage, which decreases the preprocessing time. Furthermore, KR, QS, SSABS, BRFS, and Shift-Or are not recommended for the long pattern, whereas ZT, FS, d-BM, Raita, and Smith are not recommended for the short pattern. This is because they are time-consuming and certain algorithms, such as ZT and DCPM, use a lot of time and space during the matching and search process, while others, such as d-BM and TSW, save space and time. Although DCPM, BRFS, and QS are quicker than other algorithms, FLPM, PAPM, and LFPM rank highest in terms of complexity time.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A Survey on Stereo Vision Matching Algorithms
    Zhang, Xiaoxue
    Liu, Zhigang
    2014 11TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, : 2026 - 2031
  • [42] Combinatorial Algorithms for Subsequence Matching: A Survey
    Kosche, Maria
    Koss, Tore
    Manea, Florin
    Siemer, Stefan
    ELECTRONIC PROCEEDINGS IN THEORETICAL COMPUTER SCIENCE, 2022, (367): : 11 - 27
  • [43] Parallel algorithms for the analysis of biological sequences
    Mauri, G
    Pavesi, G
    PARALLEL COMPUTING TECHNOLOGIES, 2001, 2127 : 456 - 468
  • [44] A Survey on Map-Matching Algorithms
    Chao, Pingfu
    Xu, Yehong
    Hua, Wen
    Zhou, Xiaofang
    DATABASES THEORY AND APPLICATIONS, ADC 2020, 2020, 12008 : 121 - 133
  • [45] Fast profile matching algorithms - A survey
    Pizzi, Cinzia
    Ukkonen, Esko
    THEORETICAL COMPUTER SCIENCE, 2008, 395 (2-3) : 137 - 157
  • [46] Pattern matching for arc-annotated sequences
    Gramm, J
    Guo, J
    Niedermeier, R
    FST TCS 2002: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEOETICAL COMPUTER SCIENCE, PROCEEDINGS, 2002, 2556 : 182 - 193
  • [47] Pattern Matching for Arc-Annotated Sequences
    Gramm, Jens
    Guo, Jiong
    Niedermeier, Rolf
    ACM TRANSACTIONS ON ALGORITHMS, 2006, 2 (01) : 44 - 65
  • [48] RSMA Matching Algorithm for Searching Biological Sequences
    Klaib, Ahmad Fadel
    Osborne, Hugh
    2009 INTERNATIONAL CONFERENCE ON INNOVATIONS IN INFORMATION TECHNOLOGY, 2009, : 190 - 194
  • [49] Pattern Matching Based Algorithms for Graph Compression
    Chatterjee, Amlan
    Shah, Rushabh Jitendrakumar
    Sen, Soumya
    2018 FOURTH IEEE INTERNATIONAL CONFERENCE ON RESEARCH IN COMPUTATIONAL INTELLIGENCE AND COMMUNICATION NETWORKS (ICRCICN), 2018, : 93 - 97
  • [50] DISTRIBUTED ALGORITHMS FOR TREE PATTERN-MATCHING
    SINGH, G
    SMOLKA, SA
    RAMAKRISHNAN, IV
    LECTURE NOTES IN COMPUTER SCIENCE, 1988, 312 : 92 - 107