On the Randomness of Compressed Data

被引:4
|
作者
Klein, Shmuel T. [1 ]
Shapira, Dana [2 ]
机构
[1] Bar Ilan Univ, Comp Sci Dept, IL-5290002 Ramat Gan, Israel
[2] Ariel Univ, Data Sci & Artificial Intelligence Ctr, Comp Sci Dept, IL-40700 Ariel, Israel
关键词
data compression; Huffman coding; arithmetic coding; Ziv-Lempel coding; HUFFMAN; ALGORITHM; ACCESS;
D O I
10.3390/info11040196
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known lossless compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we present evidence here that arithmetic coding may produce an output that is identical to that of Huffman.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] On unequal error protection for LZSS compressed data
    Souza, Richard Demo
    Pellenz, Marcelo Eduardo
    Pereira, Zaqueu Cabral
    ANNALS OF TELECOMMUNICATIONS, 2010, 65 (5-6) : 285 - 292
  • [2] Scalable and queryable compressed storage structure for raster data
    Ladra, Susana
    Parama, Jose R.
    Silva-Coira, Fernando
    INFORMATION SYSTEMS, 2017, 72 : 179 - 204
  • [3] IMAGING METHOD WITH COMPRESSED SAR RAW DATA BASED ON COMPRESSED SENSING
    Cheng, Jian
    Gu, Fufei
    Bai, Youqing
    Zhang, Lan
    Zhang, Qun
    2012 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS), 2012, : 3963 - 3966
  • [4] Hardware Implementation of Compressed Data Packing
    Mursaev, Alexander
    2020 9TH MEDITERRANEAN CONFERENCE ON EMBEDDED COMPUTING (MECO), 2020, : 392 - 395
  • [5] Ultrasound Beamforming Using Compressed Data
    Li, Yen-Feng
    Li, Pai-Chi
    IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2012, 16 (03): : 308 - 313
  • [6] Robust Subspace Clustering With Compressed Data
    Liu, Guangcan
    Zhang, Zhao
    Liu, Qingshan
    Xiong, Hongkai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 28 (10) : 5161 - 5170
  • [7] Collapsing the Hierarchy of Compressed Data Structures: Suffix Arrays in Optimal Compressed Space
    Kempa, Dominik
    Kociumaka, Tomasz
    2023 IEEE 64TH ANNUAL SYMPOSIUM ON FOUNDATIONS OF COMPUTER SCIENCE, FOCS, 2023, : 1877 - 1886
  • [8] IC-Data: Improving Compressed Data Processing in Hadoop
    Haider, Adnan
    Yang, Xi
    Liu, Ning
    Sun, Xian-He
    He, Shuibing
    2015 IEEE 22nd International Conference on High Performance Computing (HiPC), 2015, : 356 - 365
  • [9] On unequal error protection for LZSS compressed data
    Richard Demo Souza
    Marcelo Eduardo Pellenz
    Zaqueu Cabral Pereira
    annals of telecommunications - annales des télécommunications, 2010, 65 : 285 - 292
  • [10] Compressed Identification By Sparse Sampled Frequency Data
    Xiong, Dan
    Chai, Li
    Zhang, Jingxin
    PROCEEDINGS OF THE 30TH CHINESE CONTROL AND DECISION CONFERENCE (2018 CCDC), 2018, : 3842 - 3846