Recurrent Network with Enhanced Alignment and Attention-Guided Aggregation for Compressed Video Quality Enhancement

被引：1

作者：

Shi, Xiaodi ^{[1
]}

Lin, Jucai ^{[1
]}

Jiang, Dong ^{[1
]}

Nian, Chunmei ^{[1
]}

Yin, Jun ^{[1
,2
]}

机构：

[1] Zhejiang Dahua Technol Co Ltd, Hangzhou, Peoples R China

[2] Zhejiang Prov Key Lab Harmonized Applicat Vis & T, Hangzhou, Peoples R China

来源：

2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2022年

关键词：

Quality enhancement; deformable alignment; compressed video; attention; recurrent network;

D O I：

10.1109/VCIP56404.2022.10008807

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recently, various compressed video quality enhancement technologies have been proposed to overcome the visual artifacts. Most existing methods are based on optical flow or deformable alignment to explore the spatiotemporal information across frames. However, inaccurate motion estimation and training instability of deformable convolution would be detrimental to the reconstruction performance. In this paper, we design a bi-directional recurrent network equipping with enhanced deformable alignment and attention-guided aggregation to promote information flows among frames. For the alignment, a pair of scale and shift parameters are learned to modulate optical flows into new offsets for deformable convolution. Furthermore, an attention aggregation strategy oriented at preference is designed for temporal information fusion. The strategy synthesizes global information of inputs to modulate features for effective fusion. Extensive experiments have proved that the proposed method achieves great performance in terms of quantitative performance and qualitative effect.

引用

页数：5

共 24 条

[1] Chan KCK, 2021, Arxiv, DOI arXiv:2104.13371
[2] A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding
Dai, Yuanying
Liu, Dong
Wu, Feng
[J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 28 - 39
[3] Deng JN, 2020, AAAI CONF ARTIF INTE, V34, P10696
[4] Compression Artifacts Reduction by a Deep Convolutional Network
Dong, Chao
Deng, Yubin
Loy, Chen Change
Tang, Xiaoou
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 576 - 584
[5] MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video
Guan, Zhenyu
Xing, Qunliang
Xu, Mai
Yang, Ren
Liu, Tie
Wang, Zulin
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 949 - 963
[6] Isobe T, 2020, PROC CVPR IEEE, P8005, DOI 10.1109/CVPR42600.2020.00803
[7] Kingma D. P., 2015, P 3 INT C LEARN REPR, P1
[8] Li K, 2017, IEEE INT CON MULTI, P1320, DOI 10.1109/ICME.2017.8019416
[9] Loshchilov Ilya, 2016, arXiv
[10] Deep Non-Local Kalman Network for Video Compression Artifact Reduction
Lu, Guo
Zhang, Xiaoyun
Ouyang, Wanli
Xu, Dong
Chen, Li
Gao, Zhiyong
[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1725 - 1737

← 1 2 3 →