Online Video Super-resolution using Information Replenishing Unidirectional Recurrent Model

被引:4
作者
Baniya, Arbind Agrahari [1 ]
Lee, Tsz-Kwan [1 ]
Eklund, Peter W. [1 ]
Aryal, Sunil [1 ]
Robles-Kelly, Antonio [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Video super-resolution; Recurrent network; deep learning; Advanced optimisation; Multimedia application; NETWORK;
D O I
10.1016/j.neucom.2023.126355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent Neural Networks (RNN) are widespread for Video Super-Resolution (VSR) because of their proven ability to learn spatiotemporal inter-dependencies across the temporal dimension. Despite RNN's ability to propagate memory across longer sequences of frames, vanishing gradient and error accumulation remain major obstacles to unidirectional RNNs in VSR. Several bi-directional recurrent models are suggested in the literature to alleviate this issue; however, these models are only applicable to offline use cases due to heavy demands for computational resources and the number of frames required per input. This paper proposes a novel unidirectional recurrent model for VSR, namely "Replenished Recurrency with Dual-Duct" (R2D2), that can be used in an online application setting. R2D2 incorporates a recurrent architecture with a sliding-window-based local alignment resulting in a recurrent hybrid architecture. It also uses a dual-duct residual network for concurrent and mutual refinement of local features along with global memory for full utilisation of the information available at each timestamp. With novel modelling and sophisticated optimisation, R2D2 demonstrates competitive performance and efficiency despite the lack of information available at each time-stamp compared to its offline (bidirectional) counterparts. Ablation analysis confirms the additive benefits of the proposed subcomponents of R2D2 over baseline RNN models.The PyTorch-based code for the R2D2 model will be released at R2D2 GitRepo.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:10
相关论文
共 50 条
  • [31] Deeply feature fused video super-resolution network using temporal grouping
    Chen, Zhensen
    Yang, Wenyuan
    Yang, Jingmin
    JOURNAL OF SUPERCOMPUTING, 2022, 78 (07) : 8999 - 9016
  • [32] Video super-resolution with fused local and nonlocal feature
    Wang, Wenhao
    Liu, Zhenbing
    Lan, Rushi
    Lu, Haoxiang
    JOURNAL OF ELECTRONIC IMAGING, 2022, 31 (04)
  • [33] DSCVSR: A Lightweight Video Super-Resolution for Arbitrary Magnification
    Hong, Zixuan
    Cao, Weipeng
    Xu, Zhiwu
    Ming, Zhong
    Cao, Chuqing
    Zheng, Liang
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, KSEM 2024, 2024, 14884 : 112 - 123
  • [34] Deep Learning for Image/Video Restoration and Super-resolution
    Tekalp, A. Murat
    FOUNDATIONS AND TRENDS IN COMPUTER GRAPHICS AND VISION, 2022, 13 (01): : 1 - 110
  • [35] VIDEO SUPER-RESOLUTION USING LOW RANK MATRIX COMPLETION
    Chen, Jin
    Nunez-Yanez, Jose
    Achim, Alin
    2013 20TH IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP 2013), 2013, : 1376 - 1380
  • [36] Video Super-Resolution Using Wave-Shape Network
    Wu, Yanan
    Kamata, Sei-ichiro
    ICVIP 2019: PROCEEDINGS OF 2019 3RD INTERNATIONAL CONFERENCE ON VIDEO AND IMAGE PROCESSING, 2019, : 132 - 136
  • [37] Advancing Surveillance Video Clarity and Transmission: A Real-Time Video Super-Resolution Model with Background Information Awareness
    Liu, Zhifeng
    He, Zheng
    Ye, Gang
    Zhu, Wenqian
    PATTERN RECOGNITION AND COMPUTER VISION, PT IX, PRCV 2024, 2025, 15039 : 304 - 318
  • [38] High-resolution optical flow and frame-recurrent network for video super-resolution and deblurring
    Fang, Ning
    Zhan, Zongqian
    NEUROCOMPUTING, 2022, 489 : 128 - 138
  • [39] Video super-resolution using an adaptive superpixel-guided auto-regressive model
    Li, Kun
    Zhu, Yanming
    Yang, Jingyu
    Jiang, Jianmin
    PATTERN RECOGNITION, 2016, 51 : 59 - 71
  • [40] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,