Lightweight Huffman Coding for Efficient GPU Compression

被引:2
|
作者
Shah, Milan [1 ]
Yu, Xiaodong [2 ]
Di, Sheng [2 ]
Becchi, Michela [1 ]
Cappello, Franck [2 ]
机构
[1] North Carolina State Univ, Raleigh, NC USA
[2] Argonne Natl Lab, Argonne, IL 60439 USA
来源
PROCEEDINGS OF THE 37TH INTERNATIONAL CONFERENCE ON SUPERCOMPUTING, ACM ICS 2023 | 2023年
基金
美国国家科学基金会;
关键词
compression; Huffman coding; GPU;
D O I
10.1145/3577193.3593736
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Lossy compression is often deployed in scientific applications to reduce data footprint and improve data transfers and I/O performance. Especially for applications requiring on-the-flight compression, it is essential to minimize compression's runtime. In this paper, we design a scheme to improve the performance of cuSZ, a GPU-based lossy compressor. We observe that Huffman coding - used by cuSZ to compress metadata generated during compression - incurs a performance overhead that can be significant, especially for smaller datasets. Our work seeks to reduce the Huffman coding runtime with minimal-to-no impact on cuSZ's compression efficiency. Our contributions are as follows. First, we examine a variety of probability distributions to determine which distributions closely model the input to cuSZ's Huffman coding stage. From these distributions, we create a dictionary of pre-computed codebooks such that during compression, a codebook is selected from the dictionary instead of computing a custom codebook. Second, we explore three codebook selection criteria to be applied at runtime. Finally, we evaluate our scheme on real-world datasets and in the context of two important application use cases, HDF5 and MPI, using an NVIDIA A100 GPU. Our evaluation shows that our method can reduce the Huffman coding penalty by a factor of 78-92x, translating to a total speedup of up to 5x over baseline cuSZ. Smaller HDF5 chunk sizes enjoy over an 8x speedup in compression and MPI messages on the scale of tens of MB have a 1.4-30.5x speedup in communication time.
引用
收藏
页码:99 / 110
页数:12
相关论文
共 50 条
  • [1] Lightweight Data Compression in Wireless Sensor Networks Using Huffman Coding
    Medeiros, Henry Ponti
    Maciel, Marcos Costa
    Souza, Richard Demo
    Pellenz, Marcelo Eduardo
    INTERNATIONAL JOURNAL OF DISTRIBUTED SENSOR NETWORKS, 2014,
  • [2] Compression Using Huffman Coding
    Sharma, Mamta
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2010, 10 (05): : 133 - 141
  • [3] Efficient Test Pattern Compression Techniques Based on Complementary Huffman Coding
    Lu, Shyue-Kung
    Chuang, Hei-Ming
    Lai, Guan-Ying
    Lai, Bi-Ting
    Huang, Ya-Chen
    IEEE CIRCUITS AND SYSTEMS INTERNATIONAL CONFERENCE ON TESTING AND DIAGNOSIS, 2009, : 521 - 524
  • [4] Efficient Coding of Information: Huffman Coding
    Sridhara, Deepak
    RESONANCE-JOURNAL OF SCIENCE EDUCATION, 2006, 11 (02): : 51 - 73
  • [5] Efficient Compression of Secured Images using Subservient Data and Huffman Coding
    Kasmeera, K. S.
    James, Shine P.
    Sreekumar, K.
    1ST GLOBAL COLLOQUIUM ON RECENT ADVANCEMENTS AND EFFECTUAL RESEARCHES IN ENGINEERING, SCIENCE AND TECHNOLOGY - RAEREST 2016, 2016, 25 : 60 - 67
  • [6] Huffman Coding with Gap Arrays for GPU Acceleration
    Yamamoto, Naoya
    Nakano, Koji
    Ito, Yasuaki
    Takafuji, Daisuke
    Kasagi, Akihiko
    Tabaru, Tsuguchika
    PROCEEDINGS OF THE 49TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2020, 2020,
  • [7] An efficient test vector compression scheme using selective huffman coding
    Jas, A
    Ghosh-Dastidar, J
    Ng, ME
    Touba, NA
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2003, 22 (06) : 797 - 806
  • [8] Efficient coding of information: Huffman coding
    Deepak Sridhara
    Resonance, 2006, 11 (2) : 51 - 73
  • [9] Canonical Huffman Coding for Image Compression
    Khaitu, Shree Ram
    Panday, Sanjeeb Prasad
    PROCEEDINGS ON 2018 IEEE 3RD INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION AND SECURITY (ICCCS), 2018, : 184 - 190
  • [10] VQ Compression Enhancer with Huffman Coding
    Lee, Chin-Feng
    Chang, Chin-Chen
    Zeng, Qun-Feng
    GENETIC AND EVOLUTIONARY COMPUTING, 2018, 579 : 101 - 108