Joint Rate-Distortion Optimization for Video Coding and Learning-Based In-Loop Filtering

被引：1

作者：

Yang, Mingyi ^{[1
]}

Huo, Junyan ^{[1
]}

Zhou, Xile ^{[1
]}

Qiao, Wenhan ^{[1
]}

Wan, Shuai ^{[2
,3
]}

Wang, Hao ^{[4
]}

Yang, Fuzheng ^{[1
]}

机构：

[1] Xidian Univ, Sch Telecommun Engn, Xian 710071, Peoples R China

[2] Northwestern Polytech Univ, Sch Elect & Informat, Xian 710072, Peoples R China

[3] RMIT Univ, Sch Engn, Melbourne, Vic 3001, Australia

[4] CXMT, Hefei 230000, Peoples R China

来源：

IEEE TRANSACTIONS ON MULTIMEDIA | 2024年 / 26卷

基金：

中国国家自然科学基金;

关键词：

Encoding; Filtering; Filtering theory; Video coding; Optimization; Distortion; Rate-distortion; Rate-distortion optimization (RDO); video coding; learning-based in-loop filter;

D O I：

10.1109/TMM.2023.3304895

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning-based in-loop filters (ILFs) have recently been widely deployed in the video codec to remove compression artifacts and to obtain better-quality reconstructed videos. However, in the existing codec, the impact of the learning-based ILF is not considered in the Rate-Distortion optimization (RDO) process. With the learning-based ILF, the set of coding parameters selected by the conventional RDO process may no longer be the best one, and the best overall Rate-Distortion (R-D) performance can not be guaranteed. In this article, we propose a joint RDO (JRDO) for Video Coding and learning-based in-loop filtering, which incorporates the effect of the learning-based ILF on the reconstructed video into the RDO process, aiming to achieve the best overall R-D performance of the reconstructed video after in-loop filtering. Furthermore, to realize the proposed JRDO in a standardized video codec, we propose practical strategies to efficiently estimate the effect of learning-based ILF during the RDO process, i.e., efficiently estimate the distortion of the reconstructed block after in-loop filtering during the RDO process. Extensive experiments demonstrate that the proposed joint RDO is standard-compliant and can improve the R-D performance without increasing the decoding time. Besides, the superiority of joint RDO is achieved in various ILFs, indicating the generality of the proposed work.

引用

页码：2851 / 2865

页数：15

共 40 条

[1] NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].

Agustsson, Eirikur ;

Timofte, Radu .

2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131

[2]

Akossou AYJ., 2013, Int. J. Math. Comput, V20, P84

[3] A Low Complexity Detection of Discrete Cross Differences for Fast H.264/AVC Intra Prediction [J].

Bharanitharan, K. ;

Liu, Bin-Da ;

Yang, Jar-Ferr ;

Tsai, Wen-Chih .

IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (07) :1250-1260

[4]

Bjontegaard G., 2001, CALCULATION AVERAGE

[5]

Bossen F, 2010, 3 JCT VC MEET GUANGZ

[6]

Bross B., 2019, Joint Video Experts Team (JVET) of ITU-T SG, V16, P3

[7] AV1 IN-LOOP FILTERING USING A WIDE-ACTIVATION STRUCTURED RESIDUAL NETWORK [J].

Chen, Guangyao ;

Ding, Dandan ;

Mukherjee, Debargha ;

Joshi, Urvang ;

Chen, Yue .

2019 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2019, :1725-1729

[8] Motion Vector Coding and Block Merging in the Versatile Video Coding Standard [J].

Chien, Wei-Jung ;

Zhang, Li ;

Winken, Martin ;

Li, Xiang ;

Liao, Ru-Ling ;

Gao, Han ;

Hsu, Chih-Wei ;

Liu, Hongbin ;

Chen, Chun-Chi .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (10) :3848-3861

[9] A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].

Dai, Yuanying ;

Liu, Dong ;

Wu, Feng .

MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39

[10] A Switchable Deep Learning Approach for In-Loop Filtering in Video Coding [J].

Ding, Dandan ;

Kong, Lingyi ;

Chen, Guangyao ;

Liu, Zoe ;

Fang, Yong .

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (07) :1871-1887

← 1 2 3 4 →