A convolutional neural network-based rate control algorithm for VVC intra coding

被引:3
作者
Wang, Jiafeng [1 ]
Shang, Xiwu [1 ]
Zhao, Xiaoli [1 ]
Zhang, Yuhuai [2 ]
机构
[1] Shanghai Univ Engn Sci, Sch Elect & Elect Engn, Shanghai 201620, Peoples R China
[2] Peking Univ, Inst Digital Media, Dept Elect Engn & Comp Sci, Beijing 100000, Peoples R China
基金
中国国家自然科学基金;
关键词
H.266/VVC; Intra coding; Rate control; Convolutional Neural Network (CNN); BLIND QUALITY ASSESSMENT; VIDEO;
D O I
10.1016/j.displa.2024.102652
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The Versatile Video Coding (VVC) has shown significant improvements in Rate-Distortion (R-D) performance compared to its predecessor, High Efficiency Video Coding (HEVC). However, it still encounters several challenges. One of these challenges is the efficient allocation of bits among all Coding Tree Units (CTUs). Additionally, there is a lack of prior information for intra-frame coding, particularly for the first frame. After CTU-level bit allocation, only fixed parameters can be used to determine the lambda for CTUs, which does not result in optimal ratedistortion performance. To tackle above challenges, we propose a rate control solution based on Convolutional Neural Network (CNN). This approach utilizes CNN to predict the key parameters alpha and beta in the R-D model, addressing the problem of lacking prior information in intra-frame coding. Subsequently, the predicted alpha and beta values are used to adaptively allocate bits for each CTU. Our proposed algorithm is implemented in VTM-16.0 under Common Test Conditions (CTC). Experimental results show that, compared to the default rate control algorithm in VTM-16.0, our proposed algorithm enhances R-D performance by 0.96% while maintaining rate control accuracy.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] Noise Level Estimation Algorithm Using Convolutional Neural Network-Based Noise Separation Model
    Xu S.
    Liu T.
    Li C.
    Tang Y.
    Hu L.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2019, 56 (05): : 1060 - 1070
  • [22] Intra mode prediction for H.266/FVC video coding based on convolutional neural network
    Lin, Ting-Lan
    Liang, Kai-Wen
    Huang, Jing-Ya
    Tu, Yu-Liang
    Chang, Pao-Chi
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2020, 71
  • [23] A New Convolutional Neural Network-Based System for NILM Applications
    Ciancetta, Fabrizio
    Bucci, Giovanni
    Fiorucci, Edoardo
    Mari, Simone
    Fioravanti, Andrea
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2021, 70 (70)
  • [24] Convolutional Neural Network-Based Signal Classification in Real Time
    Cetin, Ramazan
    Gecgel, Selen
    Kurt, Gunes Karabulut
    Baskaya, Faik
    IEEE EMBEDDED SYSTEMS LETTERS, 2021, 13 (04) : 186 - 189
  • [25] Inter-Frame Dependency-Based Rate Control for VVC Low-Delay Coding
    Liu, Hewei
    Zhu, Shuyuan
    Zeng, Bing
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2727 - 2731
  • [26] Merging Multiple Template Matching Predictions in Intra Coding with Attentive Convolutional Neural Network
    Wang, Qijun
    Zheng, Guodong
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 1994 - 2001
  • [27] Investigation of a convolutional neural network-based approach for license plate detection
    Cao, Yong
    JOURNAL OF OPTICS-INDIA, 2024, 53 (01): : 697 - 703
  • [28] A Feature Difference Convolutional Neural Network-Based Change Detection Method
    Zhang, Min
    Shi, Wenzhong
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2020, 58 (10): : 7232 - 7246
  • [29] Investigation of a convolutional neural network-based approach for license plate detection
    Yong Cao
    Journal of Optics, 2024, 53 : 697 - 703
  • [30] A Deep Double-Convolutional Neural Network-Based Fault Detection
    Wang, Xiuli
    Li, Zhongxin
    Liang, Jing
    Li, Yang
    2023 IEEE 2ND INDUSTRIAL ELECTRONICS SOCIETY ANNUAL ON-LINE CONFERENCE, ONCON, 2023,