Learning-Based QP Initialization for Versatile Video Coding

被引:0
作者
Zhang, Zhentao [1 ]
Zeng, Hongji [1 ]
Lin, Jielian [1 ,2 ]
机构
[1] Fuzhou Univ, Fujian Key Lab Intelligent Proc & Wireless Transmi, Fuzhou, Peoples R China
[2] Putian Univ, Sch Mech & Elect, Informat Engn, Putian, Fujian, Peoples R China
关键词
Bit rate control; residual network; video coding;
D O I
10.1561/116.20240029
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Versatile Video Coding (VVC) is a modern video compression standard designed to efficiently encode high definition video content, regardless of its diversity. It is expected to deliver superior compression performance compared to the previous standard, High Efficiency Video Coding (HEVC). However, the bit rate control problem for VVC can still be improved. To address this issue, a learning-based initial frame Quantization Parameter (QP) prediction algorithm has been proposed in this paper. This algorithm extracts information from image pixels and maps it to a feature matrix to reduce its additional cost. Furthermore, the problem of inaccurate determination of VVC QPs has been addressed by building a residual network to represent the frame complexity progressively and learning the optimal relationship between QPs and the target bit rate. Experimental results show that the proposed method reduces the control error from 10.74% to 7.19% compared to the original encoder.
引用
收藏
页数:19
相关论文
共 32 条
  • [11] Li YM, 2020, IEEE IMAGE PROC, P1176, DOI 10.1109/ICIP40778.2020.9191125
  • [12] Liao J., 2024, IEEE Transactions on Multimedia (TMM)
  • [13] DeepSVC: Deep Scalable Video Coding for Both Machine and Human Vision
    Lin, Hongbin
    Chen, Bolin
    Zhang, Zhichen
    Lin, Jielian
    Wang, Xu
    Zhao, Tiesong
    [J]. PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 9205 - 9214
  • [14] Lin J., 2022, IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  • [15] Lin J., 2023, IEEE Transactions on Broadcasting
  • [16] The Future of Video Coding
    Ling, Nam
    Kuo, C. -C. Jay
    Sullivan, Gary J.
    Xu, Dong
    Liu, Shan
    Hang, Hsueh-Ming
    Peng, Wen-Hsiao
    Liu, Jiaying
    [J]. APSIPA TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING, 2022, 11 (01)
  • [17] Lirong Huang, 2021, Image and Graphics: 11th International Conference, ICIG 2021, Proceedings. Lecture Notes in Computer Science, Image Processing, Computer Vision, Pattern Recognition, and Graphics (12888), P665, DOI 10.1007/978-3-030-87355-4_55
  • [18] Multi-Objective Optimization of Quality in VVC Rate Control for Low-Delay Video Coding
    Liu, Feiyang
    Chen, Zhenzhong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 4706 - 4718
  • [19] High Efficiency Rate Control for Versatile Video Coding Based on Composite Cauchy Distribution
    Mao, Yunhao
    Wang, Meng
    Wang, Shiqi
    Kwong, Sam
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (04) : 2371 - 2384
  • [20] McCulloch W.S., 1943, B MATH BIOPHYS, V5, P115, DOI [DOI 10.1007/BF02478259, DOI 10.1007/BF02478259/METRICS]