High-Throughput FPGA-Based Hardware Accelerators for Deflate Compression and Decompression Using High-Level Synthesis

被引:20
|
作者
Ledwon, Morgan [1 ]
Cockburn, Bruce F. [1 ]
Han, Jie [1 ]
机构
[1] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2R3, Canada
来源
IEEE ACCESS | 2020年 / 8卷
基金
加拿大自然科学与工程研究理事会;
关键词
Deflate algorithm; lossless compression; LZ77; compression; hardware accelerator; FPGAbased accelerator; high-level synthesis; ALGORITHM;
D O I
10.1109/ACCESS.2020.2984191
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The Deflate compression algorithm provides one of the most widely used solutions for lossless data compression. Field-programmable gate arrays (FPGAs) are commonly used to implement hardware accelerators that speed up computation-intensive applications. In this article, FPGA-based accelerators for Deflate compression and decompression are described. These accelerators were specified in C++ and synthesized using Vivado High-Level Synthesis (HLS) for a Xilinx Virtex UltraScale&+ series FPGA and a system clock frequency of 250 MHz. The proposed compressor processes data at a fixed input throughput of 4.0 GB/s and achieves a geometric mean compression ratio of 1.92 on the Calgary corpus benchmark files using static Huffman encoding. While not the first compressor synthesized using high-level synthesis, our design achieves a 25% greater throughput and an 11% greater compression ratio than the only other published design that uses Vivado HLS. The proposed decompressor design achieves average input throughputs of 196.61 MB/s and 97.40 MB/s, for statically and dynamically encoded Calgary corpus files, respectively. This is the first published decompressor design that is synthesized using high-level synthesis and provides performance that is comparable to that of the best published designs, having static throughputs 11% higher and dynamic throughputs only 10% lower than the expertly-optimized design sold by Xilinx.
引用
收藏
页码:62207 / 62217
页数:11
相关论文
共 50 条
  • [1] High-Level Synthesis Hardware Design for FPGA-Based Accelerators: Models, Methodologies, and Frameworks
    Molina, Romina Soledad
    Gil-Costa, Veronica
    Crespo, Maria Liz
    Ramponi, Giovanni
    IEEE ACCESS, 2022, 10 : 90429 - 90455
  • [2] High-level design tools for FPGA-based combinatorial accelerators
    Sklyarov, V
    Skliarova, I
    Almeida, P
    Almeida, M
    FIELD-PROGRAMMABLE LOGIC AND APPLICATIONS, PROCEEDINGS, 2003, 2778 : 976 - 979
  • [3] High-level synthesis of nonprogrammable hardware accelerators
    Schreiber, Robert
    Aditya, Shail
    Rau, B. Ramakrishna
    Kathail, Vinod
    Mahlke, Scott
    Abraham, Santosh
    Snider, Greg
    HP Laboratories Technical Report, 2000, (31):
  • [4] High-level synthesis of nonprogrammable hardware accelerators
    Schreiber, R
    Aditya, S
    Rau, BR
    Kathail, V
    Mahlke, S
    Abraham, S
    Snider, G
    IEEE INTERNATIONAL CONFERENCE ON APPLICATION-SPECIFIC SYSTEMS, ARCHITECTURES, AND PROCESSORS, PROCEEDINGS, 2000, : 113 - 124
  • [5] Optimization of FPGA-based LDPC decoder using high-level synthesis
    Choi, Geon
    Park, Kyeong-Bin
    Chung, Ki-Seok
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMMUNICATION AND INFORMATION PROCESSING (ICCIP 2018), 2018, : 256 - 259
  • [6] High-level modelling, analysis, and verification on FPGA-based hardware design
    Matousek, P
    Smrcka, A
    Vojnar, T
    CORRECT HARDWARE DESIGN AND VERIFICATION METHODS, PROCEEDINGS, 2005, 3725 : 371 - 375
  • [7] Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions
    Ali, Nermine
    Philippe, Jean-Marc
    Tain, Benoit
    Coussy, Philippe
    JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2022, 94 (10): : 945 - 960
  • [8] Generating Efficient FPGA-based CNN Accelerators from High-Level Descriptions
    Nermine Ali
    Jean-Marc Philippe
    Benoit Tain
    Philippe Coussy
    Journal of Signal Processing Systems, 2022, 94 : 945 - 960
  • [9] High-throughput parallel DWT hardware architecture implemented on an FPGA-based platform
    Mohammed Shaaban Ibraheem
    Khalil Hachicha
    Syed Zahid Ahmed
    Laurent Lambert
    Patrick Garda
    Journal of Real-Time Image Processing, 2019, 16 : 2043 - 2057
  • [10] High-throughput parallel DWT hardware architecture implemented on an FPGA-based platform
    Ibraheem, Mohammed Shaaban
    Hachicha, Khalil
    Ahmed, Syed Zahid
    Lambert, Laurent
    Garda, Patrick
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2019, 16 (06) : 2043 - 2057