Efficient Rate Control in Versatile Video Coding With Adaptive Spatial-Temporal Bit Allocation and Parameter Updating

被引:8
作者
He, Liqiang [1 ]
He, Xiaohai [1 ]
Xiong, Shuhua [1 ]
Zhao, Zeming [1 ]
Xiao, Hang [1 ]
Chen, Honggang [1 ,2 ]
机构
[1] Sichuan Univ, Coll Elect & Informat Engn, Chengdu 610065, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin 541004, Peoples R China
基金
中国国家自然科学基金;
关键词
Versatile video coding; rate control; spatial-temporal bit allocation; adaptive parameter updating; RATE CONTROL ALGORITHM; INTRA PREDICTION; RATE-DISTORTION; FRAME-LEVEL; TEXTURE; NETWORK; MODEL;
D O I
10.1109/TCSVT.2022.3224723
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Despite the fact that Versatile Video Coding (VVC) has achieved superior coding performance, two major problems remain for the rate control (RC) model in VVC. First, the regions concerned by human eyes are not clear enough in the coded video due to the deviation between the target bit allocation strategy of the coding tree unit (CTU) in RC and the human visual attention mechanism (HVAM). Second, there are significant quality fluctuations in the coded video frames due to the inappropriate updating speed. To address the above problems, we propose an efficient rate control (ERC) model. Specifically, in order to make the coded video more consistent with the attention of human eyes, we extract texture and motion-based spatial-temporal information to guide the bit allocation at the CTU level. Furthermore, based on the quasi-Newton algorithm and bit error, we propose an adaptive parameter updating (APU) method with the proper updating speed to precisely control the bits per frame. The proposed ERC outperforms the default RC model of VVC Test Model (VTM) 9.1 by saving the average Bjontegaard Delta Rate (BD-Rate) on full-frame video sequences by 3.60% and 4.94% under low delay P (LDP) and random access (RA) configurations respectively, with higher bitrate accuracy. Moreover, the Peak Signal-to-Noise Ratio (PSNR) and actual coded bits per frame in the video coded by the proposed ERC are more stable.
引用
收藏
页码:2920 / 2934
页数:15
相关论文
共 52 条
[1]   Parameter-Based Affine Intra Prediction of Screen Content in Versatile Video Coding [J].
Adhuran, Jayasingam ;
Kulupana, Gosala ;
Blasi, Saverio ;
Fernando, Anil .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (09) :3590-3602
[2]  
[Anonymous], 2012, Methodology for the Subjective Assessment of the Quality of Television Pictures, document BT.500-13
[3]  
[Anonymous], 2013, PROC VIS COMMUN IMAG
[4]  
Bjontegaard G., 2008, 35 VCEG M
[5]   Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding [J].
Blanch, Marc Gorriz ;
Blasi, Saverio ;
Smeaton, Alan F. ;
O'Connor, Noel E. ;
Mrak, Marta .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2021, 15 (02) :366-377
[6]  
Bossen F., 2019, 16 JVET ITUT SG
[7]  
Bross B., 2018, P JOINT VIDEO EXPL T
[8]   Pixel-Wise Unified Rate-Quantization Model for Multi-Level Rate Control [J].
Choi, Hyomin ;
Yoo, Jonghun ;
Nam, Junghak ;
Sim, Donggyu ;
Bajic, Ivan V. .
IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (06) :1112-1123
[9]  
De-Luxán-Hernández S, 2019, IEEE IMAGE PROC, P1203, DOI [10.1109/ICIP.2019.8803777, 10.1109/icip.2019.8803777]
[10]   QUASI-NEWTON METHODS, MOTIVATION AND THEORY [J].
DENNIS, JE ;
MORE, JJ .
SIAM REVIEW, 1977, 19 (01) :46-89