High Performance Pattern Matching on Heterogeneous Platform

被引:5
|
作者
Soroushnia, Shima [1 ]
Daneshtalab, Masoud [1 ]
Plosila, Juha [1 ]
Pahikkala, Tapio [1 ]
Liljeberg, Pasi [1 ]
机构
[1] Univ Turku, Dept Informat Technol, Turku, Finland
来源
JOURNAL OF INTEGRATIVE BIOINFORMATICS | 2014年 / 11卷 / 03期
关键词
D O I
10.2390/biecoll-jib-2014-253
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Pattern discovery is one of the fundamental tasks in bioinformatics and pattern recognition is a powerful technique for searching sequence patterns in the biological sequence databases. Fast and high performance algorithms are highly demanded in many applications in bioinformatics and computational molecular biology since the significant increase in the number of DNA and protein sequences expand the need for raising the performance of pattern matching algorithms. For this purpose, heterogeneous architectures can be a good choice due to their potential for high performance and energy efficiency. In this paper we present an efficient implementation of Aho-Corasick (AC) which is a well known exact pattern matching algorithm with linear complexity, and Parallel Failureless Aho-Corasick (PFAC) algorithm which is the massively parallelized version of AC algorithm without failure transitions, on a heterogeneous CPU/GPU architecture. We progressively redesigned the algorithms and data structures to fit on the GPU architecture. Our results on different protein sequence data sets show that the new implementation runs 15 times faster compared to the original implementation of the PFAC algorithm.
引用
收藏
页数:11
相关论文
共 50 条
  • [21] ANALYZING THE PERFORMANCE DIFFERENCES BETWEEN PATTERN MATCHING AND COMPRESSED PATTERN MATCHING ON TEXTS
    Erdogan, Cihat
    Bulus, H. Nusret
    Diri, Banu
    2013 INTERNATIONAL CONFERENCE ON ELECTRONICS, COMPUTER AND COMPUTATION (ICECCO), 2013, : 135 - 138
  • [22] Tree pattern matching in heterogeneous fuzzy XML databases
    Liu, Jian
    Zhang, X. X.
    Zhang, Lei
    KNOWLEDGE-BASED SYSTEMS, 2017, 122 : 119 - 130
  • [23] High-performance spatiotemporal trajectory matching across heterogeneous data sources
    Gong, Xuri
    Huang, Zhou
    Wang, Yaoli
    Wu, Lun
    Liu, Yu
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 105 : 148 - 161
  • [24] Highly Parallel Sequential Pattern Mining on a Heterogeneous Platform
    Hsieh, Yu-Heng
    Chen, Chun-Chieh
    Shuai, Hong-Han
    Chen, Ming-Syan
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 1037 - 1042
  • [25] GraphPi: High Performance Graph Pattern Matching through Effective Redundancy Elimination
    Shi, Tianhui
    Zhai, Mingshu
    Xu, Yi
    Zhai, Jidong
    PROCEEDINGS OF SC20: THE INTERNATIONAL CONFERENCE FOR HIGH PERFORMANCE COMPUTING, NETWORKING, STORAGE AND ANALYSIS (SC20), 2020,
  • [26] A Pattern-Matching Scheme With High Throughput Performance and Low Memory Requirement
    Lee, Tsern-Huei
    Huang, Nai-Lun
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2013, 21 (04) : 1104 - 1116
  • [27] High-performance multi-pattern matching structure in hardware networkfirewall
    Wang, Jie
    Ji, Zhenzhou
    Hu, Mingzeng
    ICIC Express Letters, 2010, 4 (01): : 137 - 142
  • [28] Survey on Feasibility of Pattern Matching Techniques In Heterogeneous Architectures for Bioinformatics
    Pungila, Ciprian
    Galis, Darius
    Negru, Viorel
    2018 20TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2018), 2019, : 368 - 375
  • [29] Pattern Matching Based Sensor Identification Layer for an Android Platform
    Min, Hong
    Kim, Taesik
    Heo, Junyoung
    Cerny, Tomas
    Sankaran, Sriram
    Ahmed, Bestoun S.
    Jung, Jinman
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2018,
  • [30] Efficient Pattern Matching on CPU-GPU Heterogeneous Systems
    Sanz, Victoria
    Pousa, Adrian
    Naiouf, Marcelo
    De Giusti, Armando
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING (ICA3PP 2019), PT I, 2020, 11944 : 391 - 403