On the Randomness of Compressed Data

被引:4
作者
Klein, Shmuel T. [1 ]
Shapira, Dana [2 ]
机构
[1] Bar Ilan Univ, Comp Sci Dept, IL-5290002 Ramat Gan, Israel
[2] Ariel Univ, Data Sci & Artificial Intelligence Ctr, Comp Sci Dept, IL-40700 Ariel, Israel
关键词
data compression; Huffman coding; arithmetic coding; Ziv-Lempel coding; HUFFMAN; ALGORITHM; ACCESS;
D O I
10.3390/info11040196
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
It seems reasonable to expect from a good compression method that its output should not be further compressible, because it should behave essentially like random data. We investigate this premise for a variety of known lossless compression techniques, and find that, surprisingly, there is much variability in the randomness, depending on the chosen method. Arithmetic coding seems to produce perfectly random output, whereas that of Huffman or Ziv-Lempel coding still contains many dependencies. In particular, the output of Huffman coding has already been proven to be random under certain conditions, and we present evidence here that arithmetic coding may produce an output that is identical to that of Huffman.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Visualizing Big data with Compressed Score Plots: Approach and research challenges
    Camacho, Jose
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2014, 135 : 110 - 125
  • [42] ADS-BI: Compressed Indexing of ADS-B Data
    Wandelt, Sebastian
    Sun, Xiaoqian
    Fricke, Hartmut
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2018, 19 (12) : 3795 - 3806
  • [43] RANDOMNESS - A COMPUTATIONAL COMPLEXITY PERSPECTIVE
    Wigderson, A.
    XVIITH INTERNATIONAL CONGRESS ON MATHEMATICAL PHYSICS, 2014, : 254 - 263
  • [44] Coordinated randomness in sparse graphs
    Khan, Usman A.
    2012 50TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2012, : 1729 - 1733
  • [45] Creating and detecting specious randomness
    Jonas Almlöf
    Gemma Vall Llosera
    Elisabet Arvidsson
    Gunnar Björk
    EPJ Quantum Technology, 2023, 10
  • [46] Reversible Data Hiding for AMBTC Compressed Images Based on Matrix and Hamming Coding
    Lin, Chia-Chen
    Lin, Juan
    Chang, Chin-Chen
    ELECTRONICS, 2021, 10 (03) : 1 - 20
  • [47] A low-complexity photoplethysmographic systolic peak detector for compressed sensed data
    Da Poian, Giulia
    Letizia, Nunzio A.
    Rinaldo, Roberto
    Clifford, Gari D.
    PHYSIOLOGICAL MEASUREMENT, 2019, 40 (06)
  • [48] Optimal compressed representation of high throughput sequence data via light assembly
    Ginart, Antonio A.
    Hui, Joseph
    Zhu, Kaiyuan
    Numanagic, Ibrahim
    Courtade, Thomas A.
    Sahinalp, S. Cenk
    Tse, David N.
    NATURE COMMUNICATIONS, 2018, 9
  • [49] Significance Evaluation of Video Data Over Media Cloud Based on Compressed Sensing
    Guo, Jie
    Song, Bin
    Du, Xiaojiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2016, 18 (07) : 1297 - 1304
  • [50] Creating and detecting specious randomness
    Almlof, Jonas
    Llosera, Gemma Vall
    Arvidsson, Elisabet
    Bjork, Gunnar
    EPJ QUANTUM TECHNOLOGY, 2023, 10 (01)