Learning-Based QP Initialization for Versatile Video Coding

被引:0
作者
Zhang, Zhentao [1 ]
Zeng, Hongji [1 ]
Lin, Jielian [1 ,2 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou, Peoples R China
[2] Putian Univ, Sch Mech & Elect, Informat Engn, Putian, Fujian, Peoples R China
关键词
Bit rate control; residual network; video coding;
D O I
10.1561/116.20240029
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Versatile Video Coding (VVC) is a modern video compression standard designed to efficiently encode high definition video content, regardless of its diversity. It is expected to deliver superior compression performance compared to the previous standard, High Efficiency Video Coding (HEVC). However, the bit rate control problem for VVC can still be improved. To address this issue, a learning-based initial frame Quantization Parameter (QP) prediction algorithm has been proposed in this paper. This algorithm extracts information from image pixels and maps it to a feature matrix to reduce its additional cost. Furthermore, the problem of inaccurate determination of VVC QPs has been addressed by building a residual network to represent the frame complexity progressively and learning the optimal relationship between QPs and the target bit rate. Experimental results show that the proposed method reduces the control error from 10.74% to 7.19% compared to the original encoder.
引用
收藏
页数:19
相关论文
共 32 条
  • [1] Bjontegaard G., 2001, VCEG-M33
  • [2] A LOW-PARAMETRIC MODEL FOR BIT-RATE ESTIMATION OF VVC RESIDUAL CODING
    Brand, Fabian
    Herglotz, Christian
    Kaup, Andre
    [J]. 2022 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2022, : 1860 - 1864
  • [3] Overview of the Versatile Video Coding (VVC) Standard and its Applications
    Bross, Benjamin
    Wang, Ye-Kui
    Ye, Yan
    Liu, Shan
    Chen, Jianle
    Sullivan, Gary J.
    Ohm, Jens-Rainer
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) : 3736 - 3764
  • [4] Learning-Based Rate Control for High Efficiency Video Coding
    Chen, Sovann
    Aramvith, Supavadee
    Miyanaga, Yoshikazu
    [J]. SENSORS, 2023, 23 (07)
  • [5] Chen Y, 2020, INT CONF ACOUST SPEE, P4422, DOI [10.1109/icassp40776.2020.9054633, 10.1109/ICASSP40776.2020.9054633]
  • [6] Dang-Nguyen DT, 2015, P 6 ACM MULT SYST C, P219
  • [7] Pre-encoding based temporal dependent rate-distortion optimization for HEVC
    Guo, Hongwei
    Zhu, Ce
    Ye, Mao
    Luo, Lei
    Yang, Xu
    [J]. SIGNAL PROCESSING-IMAGE COMMUNICATION, 2023, 115
  • [8] A Frame-Level Constant Bit-Rate Control Using Recursive Bayesian Estimation for Versatile Video Coding
    Hyun, Myung Han
    Lee, Bumshik
    Kim, Munchurl
    [J]. IEEE ACCESS, 2020, 8 : 227255 - 227269
  • [9] A Bit Allocation Method Based on Inter-View Dependency and Spatio-Temporal Correlation for Multi-View Texture Video Coding
    Li, Tiansong
    Yu, Li
    Wang, Hongkui
    Kuang, Zhuo
    [J]. IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (01) : 159 - 173
  • [10] Li Y, 2017, 2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP)