Reconstruction flow recurrent network for compressed video quality enhancement

被引:2
作者
Wang, Zhengning [1 ]
Liu, Xuhang [1 ]
Wang, Chuan [2 ]
Jiang, Ting [2 ]
Zeng, Tianjiao [1 ]
Zeng, Zhenni [1 ]
Wang, Guoqing [1 ]
Liu, Shuaicheng [1 ]
机构
[1] Univ Elect Sci & Technol China, Chengdu 611731, Sichuan, Peoples R China
[2] Megvii Res Chengdu, Chengdu 610095, Sichuan, Peoples R China
基金
中国国家自然科学基金;
关键词
Video coding; Video quality enhancement; Flow reconstruction;
D O I
10.1016/j.patcog.2024.110638
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a reconstruction flow for the task of compressed video quality enhancement (VQE). Compressed videos often suffer from various coding artifacts, such as blocking and blurring, especially under low bitrate. VQE aims to suppress these artifacts to improve the visual quality. Frame similarity can be utilized to enhance low -quality frames given their neighboring high -quality frames, for which motion estimation becomes important. Previous approaches often calculate optical flow for the motion compensation. On the other hand, video coding contains a rich set of block motion vectors, forming a coding flow, which may or may not correspond to the scene motion, but to places that deliver the minimum compression error. In contrast, such a valuable coding flow has always been ignored in VQE previously. In this work, we combine these two motion sources into a new flow, namely reconstruction flow, for the purpose of high -quality VQE. Specifically, we estimate optical flows from RGB frames and extract coding flows from coding streams, which are then merged by a fusion module to generate reconstruction flow. Besides, our network is built upon a recurrent network to utilize global temporal information. The deep features are warped according to the reconstruction flow and fed into the subsequent reconstruction module with spatial -variant kernel attention. Our method is evaluated on the leading MFQE2.0 dataset, which demonstrates superior performances when compared to the existing state-of-the-art methods.
引用
收藏
页数:10
相关论文
共 43 条
[1]   ViViT: A Video Vision Transformer [J].
Arnab, Anurag ;
Dehghani, Mostafa ;
Heigold, Georg ;
Sun, Chen ;
Lucic, Mario ;
Schmid, Cordelia .
2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, :6816-6826
[2]   Study of Temporal Effects on Subjective Video Quality of Experience [J].
Bampis, Christos George ;
Li, Zhi ;
Moorthy, Anush Krishna ;
Katsavounidis, Ioannis ;
Aaron, Anne ;
Bovik, Alan Conrad .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (11) :5217-5231
[3]  
Cao JZ, 2023, Arxiv, DOI [arXiv:2106.06847, DOI 10.48550/ARXIV.2106.06847]
[4]   BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond [J].
Chan, Kelvin C. K. ;
Wang, Xintao ;
Yu, Ke ;
Dong, Chao ;
Loy, Chen Change .
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, :4945-4954
[5]   Reducing Artifacts in JPEG Decompression Via a Learned Dictionary [J].
Chang, Huibin ;
Ng, Michael K. ;
Zeng, Tieyong .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (03) :718-728
[6]  
CHARBONNIER P, 1994, IEEE IMAGE PROC, P168
[7]   Deformable Convolutional Networks [J].
Dai, Jifeng ;
Qi, Haozhi ;
Xiong, Yuwen ;
Li, Yi ;
Zhang, Guodong ;
Hu, Han ;
Wei, Yichen .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2017, :764-773
[8]   A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding [J].
Dai, Yuanying ;
Liu, Dong ;
Wu, Feng .
MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 :28-39
[9]  
Deng JN, 2020, AAAI CONF ARTIF INTE, V34, P10696
[10]   Compression Artifacts Reduction by a Deep Convolutional Network [J].
Dong, Chao ;
Deng, Yubin ;
Loy, Chen Change ;
Tang, Xiaoou .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :576-584