VVC/H.266 Intra Mode QTMT Based CU Partition Using CNN

被引:3
|
作者
Javaid, Sameena [1 ]
Rizvi, Safdar [1 ]
Ubaid, Muhammad Talha [2 ]
Tariq, Abdullah [2 ]
机构
[1] Bahria Univ, Sch Engn & Appl Sci, Dept Comp Sci, Karachi Campus, Karachi 75290, Pakistan
[2] Univ Engn & Technol, Natl Ctr Artificial Intelligence, KICS, Lahore 39161, Pakistan
关键词
Encoding; Standards; Random forests; Computational complexity; Streaming media; Convolutional neural networks; Feature extraction; Intra mode decision; VVC; H266; fast coding unit partition; complexity reduction; convolutional neural network; SIZE DECISION;
D O I
10.1109/ACCESS.2022.3164421
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The latest standard for video coding is versatile video coding (VVC) / H.266 which is developed by the joint video exploration team (JVET). Its coding structure is a multi-type tree (MTT) structure, which consists of two types of trees that are Ternary Tree (TT) and Binary Tree (BT). Due to the use of brute force quest for residual rate-distortion the quad-tree and multi-type tree (QTMT) structure of the coding unit (CU) split and contributes over 98% of the encoding time. This structure is efficient in coding, however, increases computational complexity. The current paper proposes a deep learning technique to predict the QTMT based CU split rather than just the brute-force QTMT method to substantially speed up the time of the encoding process for VVC/H.266 intra mode. In the first phase, we developed an extensive database containing ample CU splitting patterns and various streaming videos. In the second phase, we suggest a multi-level exit CNN (MLE-CNN) model with a redundancy removal mechanism at different levels to determine the CU partition. In the third phase, for the training of MLECNN model we have established the adaptive loss function and analyzing the both unknown number of partition modes and the focus on RD cost minimization. Finally, a variable threshold decision system is established to achieve the targeted low complexity and RD performance. Ultimately experimental findings show that VVC/H.266 encoding time has reduced to 69.11% from 47.91% with insignificant bjontegaard delta bit rate (BDBR) to 2.919% from 1.023% which performs better than the existing futuristic and modern approaches.
引用
收藏
页码:37246 / 37256
页数:11
相关论文
共 50 条
  • [1] Fast CU Partition and Intra Mode Decision Method for H.266/VVC
    Zhang, Qiuwen
    Wang, Yihan
    Huang, Lixun
    Jiang, Bin
    IEEE ACCESS, 2020, 8 : 117539 - 117550
  • [2] Fast QTMT for H.266/VVC Intra Prediction using Early-Terminated Hierarchical CNN model
    Xiem HoangVan
    Sang NguyenQuang
    Minh DinhBao
    Minh DoNgoc
    Dinh Trieu Duong
    2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 195 - 200
  • [3] Fast CU Partition Decision Method Based on Texture Characteristics for H.266/VVC
    Zhang, Qiuwen
    Zhao, Yongbo
    Jiang, Bin
    Huang, Lixun
    Wei, Tao
    IEEE ACCESS, 2020, 8 : 203516 - 203524
  • [4] Fast CU Partition Decision Based on Texture for H.266/VVC
    Zhang, Qiuwen
    Cui, Tengyao
    Su, Rijian
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [5] Fast CU Partition Decision Method Based on Bayes and Improved De-Blocking Filter for H.266/VVC
    Zhang, Qiuwen
    Zhao, Yongbo
    Jiang, Bin
    Wu, Qinggang
    IEEE ACCESS, 2021, 9 (09): : 70382 - 70391
  • [6] Fast CU partition decision for H.266/VVC based on the improved DAG-SVM classifier model
    Zhang, Qiuwen
    Wang, Yihan
    Huang, Lixun
    Jiang, Bin
    Wang, Xiao
    MULTIMEDIA SYSTEMS, 2021, 27 (01) : 1 - 14
  • [7] A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
    Fan, Yibo
    Chen, Jun'An
    Sun, Heming
    Katto, Jiro
    Jing, Ming'E
    IEEE ACCESS, 2020, 8 : 107900 - 107911
  • [8] DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC
    Li, Tianyi
    Xu, Mai
    Tang, Runzhi
    Chen, Ying
    Xing, Qunliang
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5377 - 5390
  • [9] A fast H.266/QTMT intra coding scheme based on predictions of learned models
    Chen, Jiann-Jone
    Huang, Yu-Huan
    Yu, Han-Yen
    Tsai, Yao-Hong
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2024, 47 (06) : 703 - 718
  • [10] CNN-based ternary tree partition approach for VVC intra-QTMT coding
    Fatma Belghith
    Bouthaina Abdallah
    Sonda Ben Jdidia
    Mohamed Ali Ben Ayed
    Nouri Masmoudi
    Signal, Image and Video Processing, 2024, 18 : 3587 - 3594