Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network

被引:0
作者
Zhang, Chenhao [1 ]
Gao, Wei [1 ,2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, SECE, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2024, PT LXXXV | 2025年 / 15143卷
关键词
Neural Video Compression; Rate Control; Rate-Distortion-Complexity Optimization;
D O I
10.1007/978-3-031-73013-9_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Video Compression (NVC) has achieved remarkable performance in recent years. However, precise rate control remains a challenge due to the inherent limitations of learning-based codecs. To solve this issue, we propose a dynamic video compression framework designed for variable bitrate scenarios. First, to achieve variable bitrate implementation, we propose the Dynamic-Route Autoencoder with variable coding routes, each occupying partial computational complexity of the whole network and navigating to a distinct RD trade-off. Second, to approach the target bitrate, the Rate Control Agent estimates the bitrate of each route and adjusts the coding route of DRA at run time. To encompass a broad spectrum of variable bitrates while preserving overall RD performance, we employ the Joint-Routes Optimization strategy, achieving collaborative training of various routes. Extensive experiments on the HEVC and UVG datasets show that the proposed method achieves an average BD-Rate reduction of 14.8% and BD-PSNR gain of 0.47 dB over state-of-the-art methods while maintaining an average bitrate error of 1.66%, achieving Rate-Distortion-Complexity Optimization (RDCO) for various bitrate and bitrate-constrained applications.
引用
收藏
页码:239 / 255
页数:17
相关论文
共 38 条
  • [21] Improved rate control via dynamic frame-rate controlling and Kalman filtering for low-delay video coding
    Chan, DY
    Lin, CH
    Hsieh, WS
    OPTICAL ENGINEERING, 2005, 44 (12)
  • [22] A Long-Short Term Memory Neural Network Based Rate Control Method for Video Coding
    Zhang, Zheng-Teng
    Lin, Jucai
    Fang, Ruidong
    Lu, Juan
    Chen, Yao
    PROCEEDINGS OF 2018 THE 2ND INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING (ICVIP 2018), 2018, : 155 - 160
  • [23] Delay-Constrained Rate Control for Real-Time Video Streaming with Bounded Neural Network
    Huang, Tianchi
    Zhang, Rui-Xiao
    Zhou, Chao
    Sun, Lifeng
    PROCEEDINGS OF THE 28TH ACM WORKSHOP ON NETWORK AND OPERATING SYSTEMS SUPPORT FOR DIGITAL AUDIO AND VIDEO (NOSSDAV'18), 2018, : 13 - 18
  • [24] Efficient Rate-Quantization Model for Frame Level Rate Control in Spatially Scalable Video Coding
    Jing, Xuan
    Tham, Jo Yew
    Wang, Yu
    Goh, Kwong Huang
    Lee, Wei Siong
    2012 18TH IEEE INTERNATIONAL CONFERENCE ON NETWORKS (ICON), 2012, : 339 - 343
  • [25] Effective Frame Level Rate Control for H.264/AVC Video Coding
    Zhou, Yimin
    Sun, Yu
    Yin, Xin
    Sun, Shixin
    GLOBECOM 2008 - 2008 IEEE GLOBAL TELECOMMUNICATIONS CONFERENCE, 2008,
  • [26] Research on Parallel Rate Control Based on BP Neural Network
    Li, Guoping
    Huang, Lulu
    Wang, Guozhong
    Yao, Chen
    2018 INTERNATIONAL CONFERENCE ON AUDIO, LANGUAGE AND IMAGE PROCESSING (ICALIP), 2018, : 73 - 78
  • [27] A neural network approach to GOP-level rate control of x265 using Lookahead
    Cheng, Boya
    Zhang, Yuan
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [28] Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding
    Ye, Yuyun
    He, Xiaohai
    Teng, Qizhi
    Qing, Linbo
    Lin, Hongwei
    Xia, Dechun
    MULTIMEDIA TOOLS AND APPLICATIONS, 2018, 77 (12) : 14557 - 14577
  • [29] λ-Domain Rate Control via Wavelet-Based Residual Neural Network for VVC HDR Intra Coding
    Yuan, Feng
    Lei, Jianjun
    Pan, Zhaoqing
    Peng, Bo
    Xie, Haoran
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 33 : 6189 - 6203
  • [30] Adaptive Gradient Information and BFGS Based Inter Frame Rate Control for High Efficiency Video Coding
    Yuyun Ye
    Xiaohai He
    Qizhi Teng
    Linbo Qing
    Hongwei Lin
    Dechun Xia
    Multimedia Tools and Applications, 2018, 77 : 14557 - 14577