VVC/H.266 Intra Mode QTMT Based CU Partition Using CNN

被引:3
|
作者
Javaid, Sameena [1 ]
Rizvi, Safdar [1 ]
Ubaid, Muhammad Talha [2 ]
Tariq, Abdullah [2 ]
机构
[1] Bahria Univ, Sch Engn & Appl Sci, Dept Comp Sci, Karachi Campus, Karachi 75290, Pakistan
[2] Univ Engn & Technol, Natl Ctr Artificial Intelligence, KICS, Lahore 39161, Pakistan
关键词
Encoding; Standards; Random forests; Computational complexity; Streaming media; Convolutional neural networks; Feature extraction; Intra mode decision; VVC; H266; fast coding unit partition; complexity reduction; convolutional neural network; SIZE DECISION;
D O I
10.1109/ACCESS.2022.3164421
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The latest standard for video coding is versatile video coding (VVC) / H.266 which is developed by the joint video exploration team (JVET). Its coding structure is a multi-type tree (MTT) structure, which consists of two types of trees that are Ternary Tree (TT) and Binary Tree (BT). Due to the use of brute force quest for residual rate-distortion the quad-tree and multi-type tree (QTMT) structure of the coding unit (CU) split and contributes over 98% of the encoding time. This structure is efficient in coding, however, increases computational complexity. The current paper proposes a deep learning technique to predict the QTMT based CU split rather than just the brute-force QTMT method to substantially speed up the time of the encoding process for VVC/H.266 intra mode. In the first phase, we developed an extensive database containing ample CU splitting patterns and various streaming videos. In the second phase, we suggest a multi-level exit CNN (MLE-CNN) model with a redundancy removal mechanism at different levels to determine the CU partition. In the third phase, for the training of MLECNN model we have established the adaptive loss function and analyzing the both unknown number of partition modes and the focus on RD cost minimization. Finally, a variable threshold decision system is established to achieve the targeted low complexity and RD performance. Ultimately experimental findings show that VVC/H.266 encoding time has reduced to 69.11% from 47.91% with insignificant bjontegaard delta bit rate (BDBR) to 2.919% from 1.023% which performs better than the existing futuristic and modern approaches.
引用
收藏
页码:37246 / 37256
页数:11
相关论文
共 50 条
  • [31] An Overview of Dedicated Hardware Designs for State-of-the-Art AV1 and H.266/VVC Video Codecs
    Saldanha, Mario
    Correa, Marcel
    Correa, Guilherme
    Palomino, Daniel
    Porto, Marcelo
    Zatt, Bruno
    Agostini, Luciano
    2020 27TH IEEE INTERNATIONAL CONFERENCE ON ELECTRONICS, CIRCUITS AND SYSTEMS (ICECS), 2020,
  • [32] A Deep Transformer-Based Fast CU Partition Approach for Inter-Mode VVC
    Li, Tianyi
    Xu, Mai
    Liu, Zheng
    Chen, Ying
    Li, Kai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 34 : 1133 - 1148
  • [33] Multi-scale and Bi-path Method Based on Image Entropy and CNN for Fast CU Partition in VVC
    Zhai, Yifan
    Yan, Xiao
    Fan, Yibo
    Ikenaga, Takeshi
    2022 6TH INTERNATIONAL CONFERENCE ON IMAGING, SIGNAL PROCESSING AND COMMUNICATIONS, ICISPC, 2022, : 22 - 26
  • [34] Cross-Block Difference Guided Fast CU Partition for VVC Intra Coding
    Liu, Hewei
    Zhu, Shuyuan
    Xiong, Ruiqin
    Liu, Guanghui
    Zeng, Bing
    2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,
  • [35] Texture-Based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC
    Cao, Jian
    Tang, Na
    Wang, Jun
    Liang, Fan
    MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 : 739 - 751
  • [36] A Complexity Reduction Method for VVC Intra Prediction Based on Statistical Analysis and SAE-CNN
    Zhao, Jinchao
    Dai, Pu
    Zhang, Qiuwen
    ELECTRONICS, 2021, 10 (24)
  • [37] Fast prediction mode selection and CU partition for HEVC intra coding
    Fu, Bin
    Zhang, Qiangqing
    Hu, Jianling
    IET IMAGE PROCESSING, 2020, 14 (09) : 1892 - 1900
  • [38] An early CU partition mode decision algorithm in VVC based on variogram for virtual reality 360 degree videos
    Zhang, Mengmeng
    Hou, Yan
    Liu, Zhi
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2023, 2023 (01)
  • [39] An early CU partition mode decision algorithm in VVC based on variogram for virtual reality 360 degree videos
    Mengmeng Zhang
    Yan Hou
    Zhi Liu
    EURASIP Journal on Image and Video Processing, 2023
  • [40] LIGHTWEIGHT CNN-BASED IN-LOOP FILTER FOR VVC INTRA CODING
    Zhang, Hao
    Jung, Cheolkon
    Liu, Yang
    Li, Ming
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1635 - 1639