Block-Dependent Partition Decision for Fast Intra Coding of VVC

被引:5
作者
Peng, Zixiao [1 ]
Shen, Liquan [1 ]
Ding, Qing [2 ]
Dong, Xinchao [3 ]
Zheng, Linru [3 ]
机构
[1] Shanghai Univ, Sch Commun & Informat Engn, Shanghai 200444, Peoples R China
[2] Beihang Univ, Sch Commun & Informat Engn, Beijing 100191, Peoples R China
[3] Agora Lab Inc, Video Algorithm, Santa Clara, CA 95054 USA
基金
中国国家自然科学基金;
关键词
Versatile video coding; intra coding; coding unit partition; quadtree plus multi-type tree; complexity reduction; deep learning; CU DEPTH DECISION; SIZE DECISION;
D O I
10.1109/TCE.2023.3324794
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Aiming at accelerating the intra coding process of versatile video coding (VVC), previous efforts are made to predict the quad-tree plus multi-type tree (QTMT) partition structure, in which the prediction is modeled as a classification procedure. The existing deep learning-based methods usually represent the block partition as a structural output and utilize a single convolutional neural network (CNN) for the prediction of all blocks. However, they take different blocks equally, i.e., one-for-all, which ignores that the blocks with different complexity of partition structures are unevenly distributed and have different prediction difficulties. To address this problem, we propose a novel block-dependent partition decision (BDPD) framework to adaptively process different blocks by networks with different capacities. Specifically, we design a partition homogeneity map (PHM) to represent the QTMT-based block partition, which combines different partition directions to effectively reflect the complexity of the partition structure. On this basis, we propose a class-based prediction to distinguish blocks with different complexity of the partition structure and adopt appropriate FCN models to predict PHM, incorporating block classification and PHM prediction. The blocks are classified into different classes according to their coarse texture and neighboring PHMs. Then different fully convolutional network (FCN) models are utilized to predict PHM for different classes. The FCN models with different capacities are trained in the corresponding class, respectively, which achieves higher performance with less computation on the extremely unbalanced natural video. Finally, an adaptive partition decision based on predicted PHMs is adopted to conduct partition decisions for a better trade-off between rate-distortion performance and encoding complexity. Experimental results show that our approach achieves 45.7%similar to 74.5% encoding complexity reduction with 0.78%similar to 3.38% BD-BR increase, outperforming state-of-the-art approaches.
引用
收藏
页码:277 / 289
页数:13
相关论文
共 47 条
[1]   Tunable VVC Frame Partitioning Based on Lightweight Machine Learning [J].
Amestoy, Thomas ;
Mercat, Alexandre ;
Hamidouche, Wassim ;
Menard, Daniel ;
Bergeron, Cyril .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :1313-1328
[2]  
Bjontegaard G., 2001, CALCULATION AVERAGE
[3]  
Boyce X. L. J., 2018, JVET common testconditions and software reference configurations
[4]   Overview of the Versatile Video Coding (VVC) Standard and its Applications [J].
Bross, Benjamin ;
Wang, Ye-Kui ;
Ye, Yan ;
Liu, Shan ;
Chen, Jianle ;
Sullivan, Gary J. ;
Ohm, Jens-Rainer .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3736-3764
[5]   A fast CU size decision algorithm for VVC intra prediction based on support vector machine [J].
Chen, Fen ;
Ren, Yan ;
Peng, Zongju ;
Jiang, Gangyi ;
Cui, Xin .
MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (37-38) :27923-27939
[6]  
Chen K., 2018, PROC 14 IEEE INT C, P1
[7]   Fast HEVC Encoding Decisions Using Data Mining [J].
Correa, Guilherme ;
Assuncao, Pedro A. ;
Agostini, Luciano Volcan ;
da Silva Cruz, Luis A. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2015, 25 (04) :660-673
[8]  
Dai JF, 2016, ADV NEUR IN, V29
[9]   Fast Intra Mode Decision Algorithm for Versatile Video Coding [J].
Dong, Xinchao ;
Shen, Liquan ;
Yu, Mei ;
Yang, Hao .
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 :400-414
[10]  
Feng A., 2021, PROC IEEE INT C MUL, P1