VVC/H.266 Intra Mode QTMT Based CU Partition Using CNN

被引：3

作者：

Javaid, Sameena ^{[1
]}

Rizvi, Safdar ^{[1
]}

Ubaid, Muhammad Talha ^{[2
]}

Tariq, Abdullah ^{[2
]}

机构：

[1] Bahria Univ, Sch Engn & Appl Sci, Dept Comp Sci, Karachi Campus, Karachi 75290, Pakistan

[2] Univ Engn & Technol, Natl Ctr Artificial Intelligence, KICS, Lahore 39161, Pakistan

来源：

IEEE ACCESS | 2022年 / 10卷

关键词：

Encoding; Standards; Random forests; Computational complexity; Streaming media; Convolutional neural networks; Feature extraction; Intra mode decision; VVC; H266; fast coding unit partition; complexity reduction; convolutional neural network; SIZE DECISION;

D O I：

10.1109/ACCESS.2022.3164421

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The latest standard for video coding is versatile video coding (VVC) / H.266 which is developed by the joint video exploration team (JVET). Its coding structure is a multi-type tree (MTT) structure, which consists of two types of trees that are Ternary Tree (TT) and Binary Tree (BT). Due to the use of brute force quest for residual rate-distortion the quad-tree and multi-type tree (QTMT) structure of the coding unit (CU) split and contributes over 98% of the encoding time. This structure is efficient in coding, however, increases computational complexity. The current paper proposes a deep learning technique to predict the QTMT based CU split rather than just the brute-force QTMT method to substantially speed up the time of the encoding process for VVC/H.266 intra mode. In the first phase, we developed an extensive database containing ample CU splitting patterns and various streaming videos. In the second phase, we suggest a multi-level exit CNN (MLE-CNN) model with a redundancy removal mechanism at different levels to determine the CU partition. In the third phase, for the training of MLECNN model we have established the adaptive loss function and analyzing the both unknown number of partition modes and the focus on RD cost minimization. Finally, a variable threshold decision system is established to achieve the targeted low complexity and RD performance. Ultimately experimental findings show that VVC/H.266 encoding time has reduced to 69.11% from 47.91% with insignificant bjontegaard delta bit rate (BDBR) to 2.919% from 1.023% which performs better than the existing futuristic and modern approaches.

引用

页码：37246 / 37256

页数：11

共 50 条

[1] Fast CU Partition and Intra Mode Decision Method for H.266/VVC
Zhang, Qiuwen
Wang, Yihan
Huang, Lixun
Jiang, Bin
IEEE ACCESS, 2020, 8 : 117539 - 117550
[2] Fast QTMT for H.266/VVC Intra Prediction using Early-Terminated Hierarchical CNN model
Xiem HoangVan
Sang NguyenQuang
Minh DinhBao
Minh DoNgoc
Dinh Trieu Duong
2021 INTERNATIONAL CONFERENCE ON ADVANCED TECHNOLOGIES FOR COMMUNICATIONS (ATC 2021), 2021, : 195 - 200
[3] Fast CU Partition Decision Method Based on Texture Characteristics for H.266/VVC
Zhang, Qiuwen
Zhao, Yongbo
Jiang, Bin
Huang, Lixun
Wei, Tao
IEEE ACCESS, 2020, 8 : 203516 - 203524
[4] Fast CU Partition Decision Based on Texture for H.266/VVC
Zhang, Qiuwen
Cui, Tengyao
Su, Rijian
SCIENTIFIC PROGRAMMING, 2021, 2021
[5] Fast CU Partition Decision Method Based on Bayes and Improved De-Blocking Filter for H.266/VVC
Zhang, Qiuwen
Zhao, Yongbo
Jiang, Bin
Wu, Qinggang
IEEE ACCESS, 2021, 9 (09): : 70382 - 70391
[6] Fast CU partition decision for H.266/VVC based on the improved DAG-SVM classifier model
Zhang, Qiuwen
Wang, Yihan
Huang, Lixun
Jiang, Bin
Wang, Xiao
MULTIMEDIA SYSTEMS, 2021, 27 (01) : 1 - 14
[7] A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
Fan, Yibo
Chen, Jun'An
Sun, Heming
Katto, Jiro
Jing, Ming'E
IEEE ACCESS, 2020, 8 : 107900 - 107911
[8] DeepQTMT: A Deep Learning Approach for Fast QTMT-Based CU Partition of Intra-Mode VVC
Li, Tianyi
Xu, Mai
Tang, Runzhi
Chen, Ying
Xing, Qunliang
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 5377 - 5390
[9] A fast H.266/QTMT intra coding scheme based on predictions of learned models
Chen, Jiann-Jone
Huang, Yu-Huan
Yu, Han-Yen
Tsai, Yao-Hong
JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2024, 47 (06) : 703 - 718
[10] CNN-based ternary tree partition approach for VVC intra-QTMT coding
Fatma Belghith
Bouthaina Abdallah
Sonda Ben Jdidia
Mohamed Ali Ben Ayed
Nouri Masmoudi
Signal, Image and Video Processing, 2024, 18 : 3587 - 3594

← 1 2 3 4 5 →