Low-complexity QTMT partition based on deep neural network for Versatile Video Coding

被引:16
作者
Abdallah, Bouthaina [1 ]
Belghith, Fatma [1 ]
Ben Ayed, Mohamed Ali [1 ]
Masmoudi, Nouri [1 ]
机构
[1] Univ Sfax, Elect & Informat Technol Lab, Natl Engn Sch Sfax, Sfax, Tunisia
关键词
Versatile Video Coding (VVC); Nested multi-type tree (QTMT); Coding complexity; Intra partition; Deep neural network;
D O I
10.1007/s11760-020-01843-9
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Versatile Video Coding (VVC), the newest standard for future video coding, is currently under development. This proposal aimed to improve the encoder performance over the latest standard namely High Efficiency Video Coding, carried with a high increase in coding complexity. The VVC partition structure is mainly based on the quadtree with nested multi-type tree (QTMT) block scheme. Such an improvement leads to a more flexible block partition and promotes a high encoding efficiency, but generates a huge coding complexity. In order to deal with this issue, a fast QTMT intra partition algorithm, based on a deep neural network named Early Terminated Hierarchical Convolution Neural Network, is applied to predict the 64x64 block QT partition structure. The proposed algorithm determines the QTMT partition structure based on the decision of whether to split or skip the corresponding CU, in order to get 128x128 Coding Tree Unit partition architecture. In this paper, the proposed intra partition work achieves a significant speedup in encoding gain that reaches 32.96% in best cases for Ultra High Definition video sequences compared to the reference VVC software VTM-3.0. For all video sequences, 24.49% time saving is reached on average. This improvement comes with an increase of 4.18% and a decrease of 0.18 dB in terms of BDBR and BDPSNR, respectively.
引用
收藏
页码:1153 / 1160
页数:8
相关论文
共 14 条
[1]  
Amestoy T, 2019, INT CONF ACOUST SPEE, P1837, DOI [10.1109/icassp.2019.8683413, 10.1109/ICASSP.2019.8683413]
[2]  
[Anonymous], 2018, VVC TEST MODEL VTM V
[3]  
[Anonymous], 2017, PAC RIM C MULTIM SP
[4]  
[Anonymous], 2017, P 9 INT C QUAL MULT
[5]  
[Anonymous], 2016, Palm Vein Biometric Identification System using Local Derivative Pattern, DOI DOI 10.1109/ICOICT.2016.7571956
[6]  
Bjontegaard Gisle, 2001, Calculation of average PSNR differ
[7]  
Bossen F., 2018, JOINT VIDEO EXPERTS
[8]   Texture-Based Fast CU Size Decision and Intra Mode Decision Algorithm for VVC [J].
Cao, Jian ;
Tang, Na ;
Wang, Jun ;
Liang, Fan .
MULTIMEDIA MODELING (MMM 2020), PT I, 2020, 11961 :739-751
[9]   Fast Coding Unit Partition Decision for HEVC Using Support Vector Machines [J].
Grellert, Mateus ;
Zatt, Bruno ;
Bampi, Sergio ;
da Silva Cruz, Luis A. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (06) :1741-1753
[10]   CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network [J].
Liu, Zhenyu ;
Yu, Xianyu ;
Gao, Yuan ;
Chen, Shaolin ;
Ji, Xiangyang ;
Wang, Dongsheng .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (11) :5088-5103