Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering

被引:1
作者
Yang, Mingyi [1 ]
Huo, Junyan [1 ]
Zhou, Xile [1 ]
Qiao, Wenhan [1 ]
Wan, Shuai [2 ,3 ]
Wang, Hao [4 ]
Yang, Fuzheng [1 ]
机构
[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China
[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China
[3] RMIT Univ, Sch Engn, Melbourne, Vic 3001, Australia
[4] CXMT, Hefei 230000, Peoples R China
基金
中国国家自然科学基金;
关键词
Encoding; Filtering; Filtering theory; Video coding; Optimization; Distortion; Rate-distortion; Rate-distortion optimization (RDO); video coding; learning-based in-loop filter;
D O I
10.1109/TMM.2023.3304895
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Learning-based in-loop filters (ILFs) have recently been widely deployed in the video codec to remove compression artifacts and to obtain better-quality reconstructed videos. However, in the existing codec, the impact of the learning-based ILF is not considered in the Rate-Distortion optimization (RDO) process. With the learning-based ILF, the set of coding parameters selected by the conventional RDO process may no longer be the best one, and the best overall Rate-Distortion (R-D) performance can not be guaranteed. In this article, we propose a joint RDO (JRDO) for Video Coding and learning-based in-loop filtering, which incorporates the effect of the learning-based ILF on the reconstructed video into the RDO process, aiming to achieve the best overall R-D performance of the reconstructed video after in-loop filtering. Furthermore, to realize the proposed JRDO in a standardized video codec, we propose practical strategies to efficiently estimate the effect of learning-based ILF during the RDO process, i.e., efficiently estimate the distortion of the reconstructed block after in-loop filtering during the RDO process. Extensive experiments demonstrate that the proposed joint RDO is standard-compliant and can improve the R-D performance without increasing the decoding time. Besides, the superiority of joint RDO is achieved in various ILFs, indicating the generality of the proposed work.
引用
收藏
页码:2851 / 2865
页数:15
相关论文
共 40 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]  
Akossou AYJ., 2013, Int. J. Math. Comput, V20, P84
[3]   A Low Complexity Detection of Discrete Cross Differences for Fast H.264/AVC Intra Prediction [J].
Bharanitharan, K. ;
Liu, Bin-Da ;
Yang, Jar-Ferr ;
Tsai, Wen-Chih .
IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (07) :1250-1260
[4]  
Bjontegaard G., 2001, CALCULATION AVERAGE
[5]  
Bossen F, 2010, 3 JCT VC MEET GUANGZ
[6]  
Bross B., 2019, Joint Video Experts Team (JVET) of ITU-T SG, V16, P3
[7]   AV1 IN-LOOP FILTERING USING A WIDE-ACTIVATION STRUCTURED RESIDUAL NETWORK [J].
Chen, Guangyao ;
Ding, Dandan ;
Mukherjee, Debargha ;
Joshi, Urvang ;
Chen, Yue .
2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :1725-1729
[8]   Motion Vector Coding and Block Merging in the Versatile Video Coding Standard [J].
Chien, Wei-Jung ;
Zhang, Li ;
Winken, Martin ;
Li, Xiang ;
Liao, Ru-Ling ;
Gao, Han ;
Hsu, Chih-Wei ;
Liu, Hongbin ;
Chen, Chun-Chi .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3848-3861
[9]   A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].
Dai, Yuanying ;
Liu, Dong ;
Wu, Feng .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39
[10]   A Switchable Deep Learning Approach for In-Loop Filtering in Video Coding [J].
Ding, Dandan ;
Kong, Lingyi ;
Chen, Guangyao ;
Liu, Zoe ;
Fang, Yong .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :1871-1887