Partial Gated Feedback Recurrent Neural Network for Data Compression Type Classification

被引:3
作者
Song, Hyewon [1 ]
Kwon, Beom [1 ]
Yoo, Hoon [2 ]
Lee, Sanghoon [1 ]
机构
[1] Yonsei Univ, Dept Elect & Elect Engn, Seoul 03722, South Korea
[2] Sangmyung Univ, Dept Elect, Seoul 03016, South Korea
来源
IEEE ACCESS | 2020年 / 8卷 / 08期
关键词
Feature extraction; Compression algorithms; Logic gates; Recurrent neural networks; Encoding; Data mining; Image coding; Compression type classification; deep learning; gated recurrent unit; lossless compression; partial gated feedback recurrent neural network; recurrent neural network; ALGORITHM;
D O I
10.1109/ACCESS.2020.3015493
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Owing to the widespread use of digital devices such as mobile phones and tablet PCs that are capable of easily viewing contents, the number of digital crimes committed using these digital devices has increased. One of the most common digital crimes is to hide the header information of the compressed data, which makes the user's data unusable. It is difficult to restore original data without the header because header contains the compression type. In this paper, we propose a Partial Gated Feedback Recurrent Neural Network (PGF-RNN) for the identification of lossless compression algorithms. We modify the gated recurrent units to improve the correlation of layers by grouping the fully-connected layers to effectively determine the characteristics of the compressed data. We emphasize on the temporal features, which consider a wide range of data, and spatial features from fully-connected layers to extract the feature vectors of each compression type. To improve the performance of the proposed PGF-RNN, we apply post-processing that considers the frequency of bit sequences on some compression types with similar compressed data. The proposed method is evaluated on 31 well-known lossless compression algorithms of the Association for Computational Linguistics dataset. The average top 1 accuracy of the proposed method is 92.63%.
引用
收藏
页码:151426 / 151436
页数:11
相关论文
共 25 条
  • [1] [Anonymous], 2019, ASIAPAC SIGN INFO PR
  • [2] [Anonymous], 2005, INT J FOOD ENG, DOI DOI 10.2202/1556-3758.1031
  • [3] [Anonymous], 2013, PREPRINT ARXIV 1308
  • [4] [Anonymous], 2018, ASIAPAC SIGN INFO PR
  • [5] [Anonymous], 1997, Neural Computation
  • [6] When Deep Learning Meets Metric Learning: Remote Sensing Image Scene Classification via Learning Discriminative CNNs
    Cheng, Gong
    Yang, Ceyuan
    Yao, Xiwen
    Guo, Lei
    Han, Junwei
    [J]. IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2018, 56 (05): : 2811 - 2821
  • [7] Cho K., 2014, ARXIV14061078, V1406, P1078
  • [8] Chung J., 2014, ARXIV
  • [9] Chung JY, 2015, PR MACH LEARN RES, V37, P2067
  • [10] OPTIMAL SOURCE CODES FOR GEOMETRICALLY DISTRIBUTED INTEGER ALPHABETS
    GALLAGER, RG
    VANVOORHIS, DC
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1975, 21 (02) : 228 - 230