RATE-QUALITY BASED RATE CONTROL MODEL FOR NEURAL VIDEO COMPRESSION

被引:0
作者
Liao, Shuhong [1 ,2 ]
Jia, Chuanmin [2 ]
Fan, Hongfei [3 ]
Yan, Jingwen [1 ]
Ma, Siwei [2 ]
机构
[1] Shantou Univ, Coll Engn, Shantou, Peoples R China
[2] Peking Univ, Beijing, Peoples R China
[3] Ant Grp, Hangzhou, Peoples R China
来源
2024 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING, ICASSP 2024 | 2024年
基金
中国国家自然科学基金;
关键词
Rate control; Neural video compression; ALGORITHM; HEVC;
D O I
10.1109/ICASSP48485.2024.10447777
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Rate control (RC) is crucial in achieving stable and smooth bitrate variation in video compression and transmission. Existing RC methods for neural video compression (NVC) have made strong assumptions on solving bit allocation parameters using a pre-defined model, leading to high bit-rate errors (BRE). In response, the study introduces a simple yet effective one-pass RC strategy tailored for NVC frameworks in a plug-in fashion. This strategy consists of two key components: the NVC rate-adaptive model and the associated RC approach. The former model constructs the basis of the latter approach. The proposed RC approach employs a progressive online updating technique for parameter estimation to achieve a lower BRE and maintain the original quality structure of frameworks. Experimental results demonstrate that our approach achieves an impressive RC performance on standard test sequences, outperforming the conventional optimal R-lambda RC model with lower BRE and better rate-distortion (R-D) performances. Furthermore, we extend our method to three representative NVC frameworks, consistently showcasing its effectiveness in achieving lower BRE with only moderated R-D performance degradation.
引用
收藏
页码:4215 / 4219
页数:5
相关论文
共 20 条
[1]   FLEXIBLE-RATE LEARNED HIERARCHICAL BI-DIRECTIONAL VIDEO COMPRESSION WITH MOTION REFINEMENT AND FRAME-LEVEL BIT ALLOCATION [J].
Cetin, Eren ;
Yilmaz, M. Akin ;
Tekalp, A. Murat .
2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, :1206-1210
[2]  
Cui ZH, 2021, ICCREM 2021: CHALLENGES OF THE CONSTRUCTION INDUSTRY UNDER THE PANDEMIC, P532
[3]   Neural Inter-Frame Compression for Video Coding [J].
Djelouah, Abdelaziz ;
Campos, Joaquim ;
Schaub-Meyer, Simone ;
Schroers, Christopher .
2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), 2019, :6430-6438
[4]   Coarse-to-fine Deep Video Coding with Hyperprior-guided Mode Prediction [J].
Hu, Zhihao ;
Lu, Guo ;
Guo, Jinyang ;
Liu, Shan ;
Jiang, Wei ;
Xu, Dong .
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, :5911-5920
[5]   FVC: A New Framework towards Deep Video Compression in Feature Space [J].
Hu, Zhihao ;
Lu, Guo ;
Xu, Dong .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :1502-1511
[6]   λ Domain Rate Control Algorithm for High Efficiency Video Coding [J].
Li, Bin ;
Li, Houqiang ;
Li, Li ;
Zhang, Jinlei .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (09) :3841-3854
[7]  
Li J., 2021, Advances in Neural Information Processing Systems (NeurIPS), P18114
[8]   Neural Video Compression with Diverse Contexts [J].
Li, Jiahao ;
Li, Bin ;
Lu, Yan .
2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, :22616-22626
[9]   Hybrid Spatial-Temporal Entropy Modelling for Neural Video Compression [J].
Li, Jiahao ;
Li, Bin ;
Lu, Yan .
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, :1503-1511
[10]   λ-Domain Optimal Bit Allocation Algorithm for High Efficiency Video Coding [J].
Li, Li ;
Li, Bin ;
Li, Houqiang ;
Chen, Chang Wen .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (01) :130-142