Joint Optimization for SSIM-Based CTU-Level Bit Allocation and Rate Distortion Optimization

被引:29
作者
Li, Yang [1 ]
Mou, Xuanqin [1 ]
机构
[1] Xi An Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Xian 710049, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal bit allocation; rate distortion optimization; SSIM; GAME-THEORY; QUANTIZATION; INDEX; IMAGE;
D O I
10.1109/TBC.2021.3068871
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Structural similarity (SSIM)-based distortion Dssim is more consistent with human perception than the traditional mean squared error D-MSE. To achieve better video encoding quality, many studies on optimal bit allocation (OBA) used Dssim as the distortion metric. However, the MSE-based rate distortion optimization (RDO) was still used in these studies. The inconsistency between the optimization goals of OBA and RDO results in a non-optimal SSIM-based encoding performance. To solve this problem, we propose an accurate coding tree unit level D-SSIM-D-MSE model, which enables performing the SSIM-based RDO with simpler R-D-MSE cost scaled by the SSIM-based Lagrangian parameter lambda(SSIM). Moreover, based on this model, the R-D-SSIM model can be accurately estimated based on the joint relationship of R-D-SSIM-lambda(SSIM) With the accurate R-D-SSIM model, the SSIM-based OBA problem is then solved. Accordingly, the SSIM-based OBA and SSIM-based RDO are unified together in our scheme, called SOSR. Compared with the HEVC reference encoder HM16.20, SOSR saves 5%, 11%, and 17% bitrate under the same SSIM in the commonly used all-intra, hierarchical and non-hierarchical low-delay-B configurations, which is superior to existing state-of-the-art SSIM-based OBA schemes.
引用
收藏
页码:500 / 511
页数:12
相关论文
共 41 条
[21]   Rate-distortion methods for image and video compression [J].
Ortega, A ;
Ramchandran, K .
IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) :23-50
[22]   SSIM-Based Perceptual Rate Control for Video Coding [J].
Ou, Tao-Sheng ;
Huang, Yi-Hsin ;
Chen, Homer H. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2011, 21 (05) :682-691
[23]  
Qi J, 2013, PICT COD SYMP, P217, DOI 10.1109/PCS.2013.6737722
[24]  
Rehman A., 2012, 2012 IEEE International Conference on Multimedia and Expo (ICME), P497, DOI 10.1109/ICME.2012.175
[25]   CONTRAST ADAPTATION AND CONTRAST MASKING IN HUMAN VISION [J].
ROSS, J ;
SPEED, HD .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1991, 246 (1315) :61-69
[26]  
Shuai Wang, 2015, 2015 IEEE Power & Energy Society General Meeting, P1, DOI 10.1109/PESGM.2015.7286582
[27]   Overview of the High Efficiency Video Coding (HEVC) Standard [J].
Sullivan, Gary J. ;
Ohm, Jens-Rainer ;
Han, Woo-Jin ;
Wiegand, Thomas .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2012, 22 (12) :1649-1668
[28]   Rate-distortion optimization for video compression [J].
Sullivan, GJ ;
Wiegand, T .
IEEE SIGNAL PROCESSING MAGAZINE, 1998, 15 (06) :74-90
[29]   INTERPRETATION OF THE CORRELATION-COEFFICIENT - A BASIC REVIEW [J].
TAYLOR, R .
JOURNAL OF DIAGNOSTIC MEDICAL SONOGRAPHY, 1990, 6 (01) :35-39
[30]  
Wang C., 2015, OPTIMIZATION BLOCK L