Saliency-Enabled Coding Unit Partitioning and Quantization Control for Versatile Video Coding

被引：4

作者：

Li, Wei ^{[1
]}

Jiang, Xiantao ^{[1
]}

Jin, Jiayuan ^{[1
]}

Song, Tian ^{[2
]}

Yu, Fei Richard ^{[3
]}

机构：

[1] Shanghai Maritime Univ, Dept Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China

[2] Tokushima Univ, Dept Elect & Elect Engn, 2-24 Shinkura Cho, Tokushima 7708501, Japan

[3] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada

来源：

INFORMATION | 2022年 / 13卷 / 08期

基金：

中国国家自然科学基金;

关键词：

VVC; saliency map; full convolutional network; coding unit partitioning; bitrate reduction; DECISION;

D O I：

10.3390/info13080394

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The latest video coding standard, versatile video coding (VVC), has greatly improved coding efficiency over its predecessor standard high efficiency video coding (HEVC), but at the expense of sharply increased complexity. In the context of perceptual video coding (PVC), the visual saliency model that utilizes the characteristics of the human visual system to improve coding efficiency has become a reliable method due to advances in computer performance and visual algorithms. In this paper, a novel VVC optimization scheme compliant PVC framework is proposed, which consists of fast coding unit (CU) partition algorithm and quantization control algorithm. Firstly, based on the visual saliency model, we proposed a fast CU division scheme, including the redetermination of the CU division depth by calculating Scharr operator and variance, as well as the executive decision for intra sub-partitions (ISP), to reduce the coding complexity. Secondly, a quantization control algorithm is proposed by adjusting the quantization parameter based on multi-level classification of saliency values at the CU level to reduce the bitrate. In comparison with the reference model, experimental results indicate that the proposed method can reduce about 47.19% computational complexity and achieve a bitrate saving of 3.68% on average. Meanwhile, the proposed algorithm has reasonable peak signal-to-noise ratio losses and nearly the same subjective perceptual quality.

引用

页数：24

共 40 条

[1] [Anonymous], 2001, P 13 VCEG M33 M AUST
[2] HEVC-Based Perceptually Adaptive Video Coding Using a DCT-Based Local Distortion Detection Probability Model
Bae, Sung-Ho
Kim, Jaeil
Kim, Munchurl
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3343 - 3357
[3] Bossen F., 2019, JVETN1010
[4] Bross B., 2019, Joint Video Experts Team (JVET) ITU-T SG
[5] Brox T, 2010, LECT NOTES COMPUT SC, V6315, P282, DOI 10.1007/978-3-642-15555-0_21
[6] Chen J., 2018, JVETK1002V2
[7] Chen J., 2020, JVET T2002
[8] Global Contrast based Salient Region Detection
Cheng, Ming-Ming
Zhang, Guo-Xin
Mitra, Niloy J.
Huang, Xiaolei
Hu, Shi-Min
[J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 409 - 416
[9] A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
Fan, Yibo
Chen, Jun'An
Sun, Heming
Katto, Jiro
Jing, Ming'E
[J]. IEEE ACCESS, 2020, 8 : 107900 - 107911
[10] Video Saliency Incorporating Spatiotemporal Cues and Uncertainty Weighting
Fang, Yuming
Wang, Zhou
Lin, Weisi
Fang, Zhijun
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3910 - 3921

← 1 2 3 4 →