Learned Rate Control for Frame-Level Adaptive Neural Video Compression via Dynamic Neural Network

被引:0
|
作者
Zhang, Chenhao [1 ]
Gao, Wei [1 ,2 ]
机构
[1] Peking Univ, Shenzhen Grad Sch, SECE, Shenzhen, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
来源
COMPUTER VISION - ECCV 2024, PT LXXXV | 2025年 / 15143卷
关键词
Neural Video Compression; Rate Control; Rate-Distortion-Complexity Optimization;
D O I
10.1007/978-3-031-73013-9_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural Video Compression (NVC) has achieved remarkable performance in recent years. However, precise rate control remains a challenge due to the inherent limitations of learning-based codecs. To solve this issue, we propose a dynamic video compression framework designed for variable bitrate scenarios. First, to achieve variable bitrate implementation, we propose the Dynamic-Route Autoencoder with variable coding routes, each occupying partial computational complexity of the whole network and navigating to a distinct RD trade-off. Second, to approach the target bitrate, the Rate Control Agent estimates the bitrate of each route and adjusts the coding route of DRA at run time. To encompass a broad spectrum of variable bitrates while preserving overall RD performance, we employ the Joint-Routes Optimization strategy, achieving collaborative training of various routes. Extensive experiments on the HEVC and UVG datasets show that the proposed method achieves an average BD-Rate reduction of 14.8% and BD-PSNR gain of 0.47 dB over state-of-the-art methods while maintaining an average bitrate error of 1.66%, achieving Rate-Distortion-Complexity Optimization (RDCO) for various bitrate and bitrate-constrained applications.
引用
收藏
页码:239 / 255
页数:17
相关论文
共 50 条
  • [1] Dynamic Frame Resizing with Convolutional Neural Network for Efficient Video Compression
    Kim, Jaehwan
    Park, Youngo
    Choi, Kwang Pyo
    Lee, JongSeok
    Jeon, Sunyoung
    Park, JeongHoon
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XL, 2017, 10396
  • [2] Content-Adaptive Rate-Distortion Modeling for Frame-Level Rate Control in Versatile Video Coding
    Liao, Junqi
    Li, Li
    Liu, Dong
    Li, Houqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 6864 - 6879
  • [3] NeXtVLAD: An Efficient Neural Network to Aggregate Frame-Level Features for Large-Scale Video Classification
    Lin, Rongcheng
    Xiao, Jing
    Fan, Jianping
    COMPUTER VISION - ECCV 2018 WORKSHOPS, PT IV, 2019, 11132 : 206 - 218
  • [4] FLDNet: Frame-Level Distilling Neural Network for EEG Emotion Recognition
    Wang, Zhe
    Gu, Tianhao
    Zhu, Yiwen
    Li, Dongdong
    Yang, Hai
    Du, Wenli
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2021, 25 (07) : 2533 - 2544
  • [5] FLEXIBLE-RATE LEARNED HIERARCHICAL BI-DIRECTIONAL VIDEO COMPRESSION WITH MOTION REFINEMENT AND FRAME-LEVEL BIT ALLOCATION
    Cetin, Eren
    Yilmaz, M. Akin
    Tekalp, A. Murat
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 1206 - 1210
  • [6] An Improved Parametric Bit Rate Model for Frame-level Rate Control in Video Coding
    Chen, Zhifeng
    Doken, Serhad
    Wu, Dapeng
    2011 DATA COMPRESSION CONFERENCE (DCC), 2011, : 451 - 451
  • [7] An Efficient Frame-Level Rate Control Algorithm for High Efficiency Video Coding
    Lin, Yubei
    Zhang, Xingming
    Xiao, Jianen
    Su, Shengkai
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2016, 10 (04): : 1877 - 1891
  • [8] A Frame-Level Rate Control Scheme for Low Delay Video Coding in HEVC
    Guo, Hongwei
    Zhu, Ce
    Gao, Yanbo
    Song, Shichang
    2017 IEEE 19TH INTERNATIONAL WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING (MMSP), 2017,
  • [9] Frame-Level Rate Control for Geometry-Based LiDAR Point Cloud Compression
    Li, Li
    Li, Zhu
    Liu, Shan
    Li, Houqiang
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 3855 - 3867
  • [10] Adaptive Frame Rate Optimization Based on Particle Swarm and Neural Network for Industrial Video Stream
    Zhang, Xiaoling
    Li, Menghao
    Mei, Ke
    Ding, Lu
    2019 24TH IEEE INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2019, : 1111 - 1118