Dependability Improvement for PPM Compressed Data by Using Compression Pattern Matching

被引:0
|
作者
Kitakami, Masato [1 ]
Okura, Toshihiro [2 ]
机构
[1] Chiba Univ, Grad Sch Adv Integrat Sci, Chiba 2638522, Japan
[2] Chiba Univ, Grad Sch Sci & Technol, Chiba 2638522, Japan
来源
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2008年 / E91D卷 / 10期
关键词
dependability; string match; CPM (compression pattern matching); computer virus scan; PPM; context;
D O I
10.1093/ietisy/e91-d.10.2435
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Data compression is popularly applied to computer systems and communication systems in order to reduce storage size and communication time, respectively. Since large data are used frequently, string matching for such data takes a long time. If the data are compressed, the time vets much longer because decompression is necessary. Long string matching time makes computer virus scan time longer and gives serious influence to the security of data. From this. CPM (Compression Pattern Matching) methods for several compression methods have been proposed. This paper proposes CPM method for PPM which achieves fast virus scan and improves dependability of the compressed data. where PPM is based on a Markov model. uses a context information. and achieves a better compression ratio than BW transform and Ziv-Lempel coding, The proposed method encodes the context information, which is generated in the compression process. and appends the encoded data at the beginning of the compressed data as a header. The proposed method use., only the header information. Computer simulation says that augmentation of the compression ratio is less than 5 percent if the order of the PPM is less than 5 and the source file size is more than 1 M bytes. where order is the maximum length of the context used in PPM compression. String matching time is independent of the source file size and is very, short, less than 0.3 micro seconds in the PC used for the simulation.
引用
收藏
页码:2435 / 2439
页数:5
相关论文
共 50 条
  • [1] Fast Pattern Matching in Compressed Data Packages
    Berger, Michael S.
    Mortensen, Brian B.
    2010 IEEE GLOBECOM WORKSHOPS, 2010, : 1591 - 1595
  • [2] Effect of data compression on pattern matching in historical data
    Singhal, A
    Seborg, DE
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2005, 44 (09) : 3203 - 3212
  • [3] Data compression issues with pattern matching in historical data
    Singhal, A
    Seborg, DE
    PROCEEDINGS OF THE 2003 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2003, : 3696 - 3701
  • [4] A Data Compression Scheme Using Adaptive Word-Based Pattern Matching
    Mowafi, Moad
    Kasasbeh, Dima
    Al-Omari, Faruq
    2017 8TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS & APPLICATIONS (IISA), 2017, : 350 - 353
  • [5] A new word-based compression model allowing compressed pattern matching
    Bulus, Halil Nusret
    Carus, Aydin
    Mesut, Altan
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2017, 25 (05) : 3607 - 3622
  • [6] Compressed Parameterized Pattern Matching
    Beal, Richard
    Adjeroh, Donald A.
    2013 DATA COMPRESSION CONFERENCE (DCC), 2013, : 461 - 470
  • [7] Compressed pattern matching for SEQUITUR
    Mitarai, S
    Hirao, M
    Matsumoto, T
    Shinohara, A
    Takeda, M
    Arikawa, S
    DCC 2001: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2001, : 469 - 478
  • [8] Pattern matching and lossy data compression on random fields
    Kontoyiannis, I
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2003, 49 (04) : 1047 - 1051
  • [9] Compressed Consecutive Pattern Matching
    Gawrychowski, Pawel
    Gourdel, Garance
    Starikovskaya, Tatiana
    Steiner, Teresa Anna
    2024 DATA COMPRESSION CONFERENCE, DCC, 2024, : 163 - 172
  • [10] Compressed parameterized pattern matching
    Beal, Richard
    Adjeroh, Donald
    THEORETICAL COMPUTER SCIENCE, 2016, 609 : 129 - 142