TECO: A Unified Feature Map Compression Framework Based on Transform and Entropy

被引:0
|
作者
Shi, Yubo [1 ]
Wang, Meiqi [2 ]
Cao, Tianyu [3 ]
Lin, Jun [1 ]
Wang, Zhongfeng [1 ]
机构
[1] Nanjing Univ, Sch Elect Sci & Engn, Nanjing 210023, Peoples R China
[2] Sun Yat Sen Univ, Sch Integrated Circuits, Shenzhen 518107, Peoples R China
[3] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A1, Canada
基金
中国国家自然科学基金;
关键词
Deep neural networks; discrete cosine transform; entropy coding; feature map (FM) compression; COMPUTATIONS; ACCELERATOR; SPARSE;
D O I
10.1109/TNNLS.2023.3309667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The massive memory accesses of feature maps (FMs) in deep neural network (DNN) processors lead to huge power consumption, which becomes a major energy bottleneck of DNN accelerators. In this article, we propose a unified framework named Transform and Entropy-based COmpression (TECO) scheme to efficiently compress FMs with various attributes in DNN inference. We explore, for the first time, the intrinsic unimodal distribution characteristic that widely exists in the frequency domain of various FMs. In addition, a well-optimized hardware-friendly coding scheme is designed, which fully utilizes this remarkable data distribution characteristic to encode and compress the frequency spectrum of different FMs. Furthermore, the information entropy theory is leveraged to develop a novel loss function for improving the compression ratio and to make a fast comparison among different compressors. Extensive experiments are performed on multiple tasks and demonstrate that the proposed TECO achieves compression ratios of 2.31x in ResNet-50 on image classification, 3.47x in UNet on dark image enhancement, and 3.18x in Yolo-v4 on object detection while keeping the accuracy of these models. Compared with the upper limit of the compression ratio for original FMs, the proposed framework achieves the compression ratio improvement of 21%, 157%, and 152% on the above models.
引用
收藏
页码:17856 / 17866
页数:11
相关论文
共 50 条
  • [1] Entropy Optimized Deep Feature Compression
    Niu, Benben
    Cao, Xiaoran
    Wei, Ziwei
    He, Yun
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 324 - 328
  • [2] Memory-Efficient CNN Accelerator Based on Interlayer Feature Map Compression
    Shao, Zhuang
    Chen, Xiaoliang
    Du, Li
    Chen, Lei
    Du, Yuan
    Zhuang, Wei
    Wei, Huadong
    Xie, Chenjia
    Wang, Zhongfeng
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-REGULAR PAPERS, 2022, 69 (02) : 668 - 681
  • [3] A UNIFIED AND COMPLEXITY SCALABLE ENTROPY CODING SCHEME FOR VIDEO COMPRESSION
    Preiss, Matthias
    Marpe, Detlev
    Bross, Benjamin
    George, Valeri
    Kirchhoffer, Heiner
    Tung Nguyen
    Siekmann, Mischa
    Stegemann, Jan
    Wiegand, Thomas
    2012 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2012), 2012, : 729 - 732
  • [4] ECG Compression Based on Wavelet Transform and Inter-beat Context Modeling Entropy Coding
    Niu, Zhenyang
    Chen, Jianhua
    ELECTRONIC INFORMATION AND ELECTRICAL ENGINEERING, 2012, 19 : 653 - 656
  • [5] Discrete cosine transform based processing framework for indexing, decomposition and compression of biospeckle data
    Singh, Puneet
    Chatterjee, Amit
    Bhatia, Vimal
    Prakash, Shashi
    LASER PHYSICS, 2020, 30 (07)
  • [6] Color image compression using self organizing feature map
    Diri, B
    Albayrak, S
    PROCEEDINGS OF THE IASTED INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND APPLICATIONS, 2006, : 162 - +
  • [7] Point Cloud Compression Based on Joint Optimization of Graph Transform and Entropy Coding for Efficient Data Broadcasting
    Gao, Pan
    Zhang, Lijuan
    Lei, Lei
    Xiang, Wei
    IEEE TRANSACTIONS ON BROADCASTING, 2023, 69 (03) : 727 - 739
  • [8] Entropy-based transform learning algorithms
    Parthasarathy, Gayatri
    Abhilash, G.
    IET SIGNAL PROCESSING, 2018, 12 (04) : 439 - 446
  • [9] Towards Automatic Model Compression via a Unified Two-Stage Framework
    Chen, Weihan
    Wang, Peisong
    Cheng, Jian
    PATTERN RECOGNITION, 2023, 140
  • [10] Image Compression Based on a Partially Rotated Discrete Cosine Transform With a Principal Orientation
    Lee, Gihwan
    Choe, Yoonsik
    IEEE ACCESS, 2021, 9 : 101773 - 101786