Fcd-cnn: FPGA-based CU depth decision for HEVC intra encoder using CNN

被引:2
作者
Dehnavi, Hossein [1 ]
Dehnavi, Mohammad [1 ]
Klidbary, Sajad Haghzad [2 ]
机构
[1] Kermanshah Univ Technol, Energy Fac, Dept Elect Engn, Kermanshah, Iran
[2] Univ Zanjan, Dept Elect & Comp Engn, Zanjan, Iran
关键词
FPGA; Video compression; Hardware architecture; HEVC;
D O I
10.1007/s11554-024-01487-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video compression for storage and transmission has always been a focal point for researchers in the field of image processing. Their efforts aim to reduce the data volume required for video representation while maintaining its quality. HEVC is one of the efficient standards for video compression, receiving special attention due to the increasing demand for high-resolution videos. The main step in video compression involves dividing the coding unit (CU) blocks into smaller blocks that have a uniform texture. In traditional methods, The Discrete Cosine Transform (DCT) is applied, followed by the use of RDO for decision-making on partitioning. This paper presents a novel convolutional neural network (CNN) and its hardware implementation as an alternative to DCT, aimed at speeding up partitioning and reducing the hardware resources required. The proposed hardware utilizes an efficient and lightweight CNN to partition CUs with low hardware resources in real-time applications. This CNN is trained for different Quantization Parameters (QPs) and block sizes to prevent overfitting. Furthermore, the system's input size is fixed at 16x16\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$16\times 16$$\end{document}, and other input sizes are scaled to this dimension. Loop unrolling, data reuse, and resource sharing are applied in hardware implementation to save resources. The hardware architecture is fixed for all block sizes and QPs, and only the coefficients of the CNN are changed. In terms of compression quality, the proposed hardware achieves a 4.42%\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4.42\%$$\end{document} BD-BR and -0.19\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$-\,0.19$$\end{document} BD-PSNR compared to HM16.5. The proposed system can process 64x64\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$64\times 64$$\end{document} CU at 150 MHz and in 4914 clock cycles. The hardware resources utilized by the proposed system include 13,141 LUTs, 15,885 Flip-flops, 51 BRAMs, and 74 DSPs.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] CNN-BASED FAST CU PARTITIONING ALGORITHM FOR VVC INTRA CODING
    Xu, Jun
    Wu, Guoqing
    Zhu, Chen
    Huang, Yan
    Song, Li
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 2706 - 2710
  • [42] Fast CU Size Decision Based on Texture Complexity for HEVC Intra Coding
    Hou, Jiangpeng
    Li, Dongmei
    Li, Zhaohui
    Jiang, Xiuhua
    PROCEEDINGS 2013 INTERNATIONAL CONFERENCE ON MECHATRONIC SCIENCES, ELECTRIC ENGINEERING AND COMPUTER (MEC), 2013, : 1096 - 1099
  • [43] Fast CU size and prediction mode decision method for HEVC encoder based on spatial features
    Ramezanpour, Mohammadreza
    Zargari, Farzad
    SIGNAL IMAGE AND VIDEO PROCESSING, 2016, 10 (07) : 1233 - 1240
  • [44] Fast CU size and prediction mode decision method for HEVC encoder based on spatial features
    Mohammadreza Ramezanpour
    Farzad Zargari
    Signal, Image and Video Processing, 2016, 10 : 1233 - 1240
  • [45] Fast CU size decision algorithm using machine learning for HEVC intra coding
    Lee, Dokyung
    Jeong, Jechang
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2018, 62 : 33 - 41
  • [46] The fast intra CU size decision algorithm using gray value range in HEVC
    Ding, Hanqing
    Huang, Xinpeng
    Zhang, Qiuwen
    OPTIK, 2016, 127 (18): : 7155 - 7161
  • [47] Fast Mode and Depth Decision HEVC Intra Prediction Based on Edge Detection and Partitioning Reconfiguration
    Chen, Gaoxing
    Sun, Lei
    Liu, Zhenyu
    Ikenaga, Takeshi
    2013 INTERNATIONAL SYMPOSIUM ON INTELLIGENT SIGNAL PROCESSING AND COMMUNICATIONS SYSTEMS (ISPACS), 2013, : 38 - 41
  • [48] Texture Characteristic based Fast Algorithm for CU Size Decision in HEVC Intra Coding
    Dang Le Dinh Trang
    Kim, KyungRae
    Chang, Ik Joon
    Kim, Jinsang
    PROCEEDINGS OF 2017 7TH INTERNATIONAL CONFERENCE ON INTEGRATED CIRCUITS, DESIGN, AND VERIFICATION (ICDV), 2017, : 82 - 87
  • [49] Fast Mode and Depth Decision Algorithm for HEVC Intra Coding Based on Characteristics of Coding Bits
    Yao, Fangjie
    Zhang, Xiaoyun
    Gao, Zhiyong
    Yang, Bing
    2016 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2016,
  • [50] Gradient Based Fast CU Splitting and Mode Decision Algorithm for HEVC Intra Prediction
    Guo, Jiefeng
    Chen, Guozhong
    Zheng, Jianwei
    Guo, Donghui
    PROCEEDINGS OF 2014 IEEE INTERNATIONAL CONFERENCE ON ANTI-COUNTERFEITING, SECURITY AND IDENTIFICATION (ASID), 2014, : 122 - 126