Recurrent Network with Enhanced Alignment and Attention-Guided Aggregation for Compressed Video Quality Enhancement

被引:1
作者
Shi, Xiaodi [1 ]
Lin, Jucai [1 ]
Jiang, Dong [1 ]
Nian, Chunmei [1 ]
Yin, Jun [1 ,2 ]
机构
[1] Zhejiang Dahua Technol Co Ltd, Hangzhou, Peoples R China
[2] Zhejiang Prov Key Lab Harmonized Applicat Vis & T, Hangzhou, Peoples R China
来源
2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP) | 2022年
关键词
Quality enhancement; deformable alignment; compressed video; attention; recurrent network;
D O I
10.1109/VCIP56404.2022.10008807
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recently, various compressed video quality enhancement technologies have been proposed to overcome the visual artifacts. Most existing methods are based on optical flow or deformable alignment to explore the spatiotemporal information across frames. However, inaccurate motion estimation and training instability of deformable convolution would be detrimental to the reconstruction performance. In this paper, we design a bi-directional recurrent network equipping with enhanced deformable alignment and attention-guided aggregation to promote information flows among frames. For the alignment, a pair of scale and shift parameters are learned to modulate optical flows into new offsets for deformable convolution. Furthermore, an attention aggregation strategy oriented at preference is designed for temporal information fusion. The strategy synthesizes global information of inputs to modulate features for effective fusion. Extensive experiments have proved that the proposed method achieves great performance in terms of quantitative performance and qualitative effect.
引用
收藏
页数:5
相关论文
共 24 条
  • [1] Chan KCK, 2021, Arxiv, DOI arXiv:2104.13371
  • [2] A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding
    Dai, Yuanying
    Liu, Dong
    Wu, Feng
    [J]. MULTIMEDIA MODELING (MMM 2017), PT I, 2017, 10132 : 28 - 39
  • [3] Deng JN, 2020, AAAI CONF ARTIF INTE, V34, P10696
  • [4] Compression Artifacts Reduction by a Deep Convolutional Network
    Dong, Chao
    Deng, Yubin
    Loy, Chen Change
    Tang, Xiaoou
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 576 - 584
  • [5] MFQE 2.0: A New Approach for Multi-Frame Quality Enhancement on Compressed Video
    Guan, Zhenyu
    Xing, Qunliang
    Xu, Mai
    Yang, Ren
    Liu, Tie
    Wang, Zulin
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 949 - 963
  • [6] Isobe T, 2020, PROC CVPR IEEE, P8005, DOI 10.1109/CVPR42600.2020.00803
  • [7] Kingma D. P., 2015, P 3 INT C LEARN REPR, P1
  • [8] Li K, 2017, IEEE INT CON MULTI, P1320, DOI 10.1109/ICME.2017.8019416
  • [9] Loshchilov Ilya, 2016, arXiv
  • [10] Deep Non-Local Kalman Network for Video Compression Artifact Reduction
    Lu, Guo
    Zhang, Xiaoyun
    Ouyang, Wanli
    Xu, Dong
    Chen, Li
    Gao, Zhiyong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 1725 - 1737