CICERO: A Domain-Specific Architecture for Efficient Regular Expression Matching

被引:13
作者
Parravicini, Daniele [1 ]
Conficconi, Davide [1 ]
Del Sozzo, Emanuele [1 ]
Pilato, Christian [1 ]
Santambrogio, Marco D. [1 ]
机构
[1] Politecn Milan, Milan, Italy
关键词
Domain-specific architecture; regular expressions; non-deterministic automata; energy efficiency; PARALLEL;
D O I
10.1145/3476982
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Regular Expression (RE) matching is a computational kernel used in several applications. Since RE complexity and data volumes are steadily increasing, hardware acceleration is gaining attention also for this problem. Existing approaches have limited flexibility as they require a different implementation for each RE. On the other hand, it is complex to map efficient RE representations like non-deterministic finite-state automata onto software-programmable engines or parallel architectures. In this work, we present CICERO, an end-to-end framework composed of a domain-specific architecture and a companion compilation framework for RE matching. Our solution is suitable for many applications, such as genomics/proteomics and natural language processing. CICERO aims at exploiting the intrinsic parallelism of non-deterministic representations of the REs. CICERO can trade-off accelerators' efficiency and processors' flexibility thanks to its programmable architecture and the compilation framework. We implemented CICERO prototypes on embedded FPGA achieving up to 28.6x and 20.8x more energy efficiency than embedded and mainstream processors, respectively. Since it is a programmable architecture, it can be implemented as a custom ASIC that is orders of magnitude more energy-efficient thanmainstream processors.
引用
收藏
页数:24
相关论文
共 50 条
  • [31] iFPNA: A Flexible and Efficient Deep Learning Processor in 28-nm CMOS Using a Domain-Specific Instruction Set and Reconfigurable Fabric
    Chen, Chixiao
    Liu, Xindi
    Peng, Huwan
    Ding, Hongwei
    Shi, C-j Richard
    IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2019, 9 (02) : 346 - 357
  • [32] APPROXIMATE REGULAR EXPRESSION PATTERN-MATCHING WITH CONCAVE GAP PENALTIES
    KNIGHT, JR
    MYERS, EW
    ALGORITHMICA, 1995, 14 (01) : 85 - 121
  • [33] Feature-rich Regular Expression Matching Accelerator for Text Analytics
    Atasu, Kubilay
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2016, 85 (03): : 355 - 371
  • [34] A Boyer-Moore-style algorithm for regular expression pattern matching
    Watson, BW
    Watson, RE
    SCIENCE OF COMPUTER PROGRAMMING, 2003, 48 (2-3) : 99 - 117
  • [35] Feature-rich Regular Expression Matching Accelerator for Text Analytics
    Kubilay Atasu
    Journal of Signal Processing Systems, 2016, 85 : 355 - 371
  • [36] Generation of high-performance code based on a domain-specific language for algorithmic skeletons
    Wrede, Fabian
    Rieger, Christoph
    Kuchen, Herbert
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (07) : 5098 - 5116
  • [37] A novel JSON based regular expression language for pattern matching in the internet of things
    Raihan ur Rasool
    Maleeha Najam
    Hafiz Farooq Ahmad
    Hua Wang
    Zahid Anwar
    Journal of Ambient Intelligence and Humanized Computing, 2019, 10 : 1463 - 1481
  • [38] Regular Expression Pattern Matching with Sliding Windows over Probabilistic Event Streams
    Sugiura, Kento
    Ishikawa, Yoshiharu
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 103 - 110
  • [39] Scaling Regular Expression Matching Performance in Parallel Systems through Sampling Techniques
    Ficara, Domenico
    Antichi, Gianni
    Bonelli, Nicola
    Di Pietro, Andrea
    Giordano, Stefano
    Procissi, Gregorio
    Vitucci, Fabio
    2011 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE (GLOBECOM 2011), 2011,
  • [40] An efficient algorithm for updating regular expression indexes in RDF databases
    Lee, Jinsoo
    Kasperovics, Romans
    Han, Wook-Shin
    Lee, Jeong-Hoon
    Kim, Min Soo
    Cho, Hune
    INTERNATIONAL JOURNAL OF DATA MINING AND BIOINFORMATICS, 2015, 11 (02) : 205 - 222