A Fast Coding Unit Partitioning Decision Algorithm for Versatile Video Coding Based on Gradient Feedback Hierarchical Convolutional Neural Network and Light Gradient Boosting Machine Decision Tree

被引:0
|
作者
Liu, Fangmei [1 ]
Wang, Jiyuan [1 ]
Zhang, Qiuwen [1 ]
机构
[1] Zhengzhou Univ Light Ind, Coll Comp Sci & Technol, Zhengzhou 450002, Peoples R China
来源
ELECTRONICS | 2024年 / 13卷 / 24期
基金
中国国家自然科学基金;
关键词
CNN; VVC; DT; CU SIZE DECISION;
D O I
10.3390/electronics13244908
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video encoding technology is a foundational component in the advancement of modern technological applications. The latest standard in universal video coding, H.266/VVC, features a quad-tree with nested multi-type tree (QTMT) partitioning structure, which represents an improvement over its predecessor, High-Efficiency Video Coding (H.265/HEVC). This configuration facilitates adaptable block segmentation, albeit at the cost of heightened encoding complexity. In view of the aforementioned considerations, this paper puts forth a deep learning-based approach to facilitate CU partitioning, with the aim of supplanting the intricate CU partitioning process observed in the Versatile Video Coding Test Model (VTM). We begin by presenting the Gradient Feedback Hierarchical CNN (GFH-CNN) model, an advanced convolutional neural network derived from the ResNet architecture, enabling the extraction of features from 64 x 64 coding unit (CU) blocks. Following this, a hierarchical network diagram (HND) is crafted to depict the delineation of partition boundaries corresponding to the various levels of the CU block's layered structure. This diagram maps the features extracted by the GFH-CNN model to the partitioning at each level and boundary. In conclusion, a LightGBM-based decision tree classification model (L-DT) is constructed to predict the corresponding partition structure based on the prediction vector output from the GFH-CNN model. Subsequently, any errors in the partitioning results are corrected in accordance with the encoding constraints specified by the VTM, which ultimately determines the final CU block partitioning. The experimental results demonstrate that, in comparison with VTM-10.0, the proposed algorithm achieves a 48.14% reduction in complexity with only a 0.83% increase in bitrate under the top-three configuration, which is negligible. In comparison, the top-two configuration resulted in a higher complexity reduction of 63.78%, although this was accompanied by a 2.08% increase in bitrate. These results demonstrate that, in comparison to existing solutions, our approach provides an optimal balance between encoding efficiency and computational complexity.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Fast QTMT decision tree for Versatile Video Coding based on deep neural network
    Bouthaina Abdallah
    Fatma Belghith
    Mohamed Ali Ben Ayed
    Nouri Masmoudi
    Multimedia Tools and Applications, 2022, 81 : 42731 - 42747
  • [2] Fast QTMT decision tree for Versatile Video Coding based on deep neural network
    Abdallah, Bouthaina
    Belghith, Fatma
    Ben Ayed, Mohamed Ali
    Masmoudi, Nouri
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (29) : 42731 - 42747
  • [3] An ensemble learning algorithm for machinery fault diagnosis based on convolutional neural network and gradient boosting decision tree
    Zhou, Jing
    Gao, Yang
    Lu, Jianping
    Yin, Chun
    Han, Huan
    Journal of Physics: Conference Series, 2021, 2025 (01):
  • [4] Fast Coding Unit Partition Decision for Intra Prediction in Versatile Video Coding
    Zhang, Menglu
    Chen, Yushi
    Lu, Xin
    Chen, Hao
    Zhang, Ye
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2021, 12888 LNCS : 700 - 711
  • [5] Intrusion Detection Algorithm Based on Convolutional Neural Network and Light Gradient Boosting Machine
    Wang, Qian
    Zhao, Wenfang
    Wei, Xiaoyu
    Ren, Jiadong
    Gao, Yuying
    Zhang, Bing
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2022, 32 (08) : 1229 - 1245
  • [6] Fast coding unit size decision based on deep reinforcement learning for versatile video coding
    Zhao, Jinchao
    Wang, Yihan
    Li, Mingying
    Zhang, Qiuwen
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (12) : 16371 - 16387
  • [7] Fast coding unit size decision based on deep reinforcement learning for versatile video coding
    Jinchao Zhao
    Yihan Wang
    Mingying Li
    Qiuwen Zhang
    Multimedia Tools and Applications, 2022, 81 : 16371 - 16387
  • [8] Fast coding unit decision method based on coding tree pruning for high efficiency video coding
    Choi, Kiho
    Jang, Euee S.
    OPTICAL ENGINEERING, 2012, 51 (03)
  • [9] Gradient-Based Fast Intra Coding Decision Algorithm for HEVC
    Wang, Yuting
    Cao, Jian
    Wang, Jun
    Liang, Fan
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 870 - 874
  • [10] FAST CODING TREE UNIT DEPTH DECISION FOR HIGH EFFICIENCY VIDEO CODING
    Pan, Zhaoqing
    Kwong, Sam
    Zhang, Yun
    Lei, Jianjun
    Yuan, Hui
    2014 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2014, : 3214 - 3218