Saliency-Enabled Coding Unit Partitioning and Quantization Control for Versatile Video Coding

被引:4
作者
Li, Wei [1 ]
Jiang, Xiantao [1 ]
Jin, Jiayuan [1 ]
Song, Tian [2 ]
Yu, Fei Richard [3 ]
机构
[1] Shanghai Maritime Univ, Dept Informat Engn, 1550 Haigang Ave, Shanghai 201306, Peoples R China
[2] Tokushima Univ, Dept Elect & Elect Engn, 2-24 Shinkura Cho, Tokushima 7708501, Japan
[3] Carleton Univ, Dept Syst & Comp Engn, Ottawa, ON K1S 5B6, Canada
基金
中国国家自然科学基金;
关键词
VVC; saliency map; full convolutional network; coding unit partitioning; bitrate reduction; DECISION;
D O I
10.3390/info13080394
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The latest video coding standard, versatile video coding (VVC), has greatly improved coding efficiency over its predecessor standard high efficiency video coding (HEVC), but at the expense of sharply increased complexity. In the context of perceptual video coding (PVC), the visual saliency model that utilizes the characteristics of the human visual system to improve coding efficiency has become a reliable method due to advances in computer performance and visual algorithms. In this paper, a novel VVC optimization scheme compliant PVC framework is proposed, which consists of fast coding unit (CU) partition algorithm and quantization control algorithm. Firstly, based on the visual saliency model, we proposed a fast CU division scheme, including the redetermination of the CU division depth by calculating Scharr operator and variance, as well as the executive decision for intra sub-partitions (ISP), to reduce the coding complexity. Secondly, a quantization control algorithm is proposed by adjusting the quantization parameter based on multi-level classification of saliency values at the CU level to reduce the bitrate. In comparison with the reference model, experimental results indicate that the proposed method can reduce about 47.19% computational complexity and achieve a bitrate saving of 3.68% on average. Meanwhile, the proposed algorithm has reasonable peak signal-to-noise ratio losses and nearly the same subjective perceptual quality.
引用
收藏
页数:24
相关论文
共 40 条
  • [1] [Anonymous], 2001, P 13 VCEG M33 M AUST
  • [2] HEVC-Based Perceptually Adaptive Video Coding Using a DCT-Based Local Distortion Detection Probability Model
    Bae, Sung-Ho
    Kim, Jaeil
    Kim, Munchurl
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (07) : 3343 - 3357
  • [3] Bossen F., 2019, JVETN1010
  • [4] Bross B., 2019, Joint Video Experts Team (JVET) ITU-T SG
  • [5] Brox T, 2010, LECT NOTES COMPUT SC, V6315, P282, DOI 10.1007/978-3-642-15555-0_21
  • [6] Chen J., 2018, JVETK1002V2
  • [7] Chen J., 2020, JVET T2002
  • [8] Global Contrast based Salient Region Detection
    Cheng, Ming-Ming
    Zhang, Guo-Xin
    Mitra, Niloy J.
    Huang, Xiaolei
    Hu, Shi-Min
    [J]. 2011 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2011, : 409 - 416
  • [9] A Fast QTMT Partition Decision Strategy for VVC Intra Prediction
    Fan, Yibo
    Chen, Jun'An
    Sun, Heming
    Katto, Jiro
    Jing, Ming'E
    [J]. IEEE ACCESS, 2020, 8 : 107900 - 107911
  • [10] Video Saliency Incorporating Spatiotemporal Cues and Uncertainty Weighting
    Fang, Yuming
    Wang, Zhou
    Lin, Weisi
    Fang, Zhijun
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) : 3910 - 3921