Online Video Super-resolution using Information Replenishing Unidirectional Recurrent Model

被引:4
作者
Baniya, Arbind Agrahari [1 ]
Lee, Tsz-Kwan [1 ]
Eklund, Peter W. [1 ]
Aryal, Sunil [1 ]
Robles-Kelly, Antonio [1 ]
机构
[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia
关键词
Video super-resolution; Recurrent network; deep learning; Advanced optimisation; Multimedia application; NETWORK;
D O I
10.1016/j.neucom.2023.126355
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recurrent Neural Networks (RNN) are widespread for Video Super-Resolution (VSR) because of their proven ability to learn spatiotemporal inter-dependencies across the temporal dimension. Despite RNN's ability to propagate memory across longer sequences of frames, vanishing gradient and error accumulation remain major obstacles to unidirectional RNNs in VSR. Several bi-directional recurrent models are suggested in the literature to alleviate this issue; however, these models are only applicable to offline use cases due to heavy demands for computational resources and the number of frames required per input. This paper proposes a novel unidirectional recurrent model for VSR, namely "Replenished Recurrency with Dual-Duct" (R2D2), that can be used in an online application setting. R2D2 incorporates a recurrent architecture with a sliding-window-based local alignment resulting in a recurrent hybrid architecture. It also uses a dual-duct residual network for concurrent and mutual refinement of local features along with global memory for full utilisation of the information available at each timestamp. With novel modelling and sophisticated optimisation, R2D2 demonstrates competitive performance and efficiency despite the lack of information available at each time-stamp compared to its offline (bidirectional) counterparts. Ablation analysis confirms the additive benefits of the proposed subcomponents of R2D2 over baseline RNN models.The PyTorch-based code for the R2D2 model will be released at R2D2 GitRepo.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
引用
收藏
页数:10
相关论文
共 50 条
  • [41] FLOW-GUIDED DEFORMABLE ATTENTION NETWORK FOR FAST ONLINE VIDEO SUPER-RESOLUTION
    Yang, Xi
    Zhang, Xindong
    Zhang, Lei
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 390 - 394
  • [42] Video super-resolution based on deep learning: a comprehensive survey
    Hongying Liu
    Zhubo Ruan
    Peng Zhao
    Chao Dong
    Fanhua Shang
    Yuanyuan Liu
    Linlin Yang
    Radu Timofte
    Artificial Intelligence Review, 2022, 55 : 5981 - 6035
  • [43] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [44] Local-Global Fusion Network for Video Super-Resolution
    Su, Dewei
    Wang, Hua
    Jin, Longcun
    Sun, Xianfang
    Peng, Xinyi
    IEEE ACCESS, 2020, 8 : 172443 - 172456
  • [45] Deformable Non-Local Network for Video Super-Resolution
    Wang, Hua
    Su, Dewei
    Liu, Chuangchuang
    Jin, Longcun
    Sun, Xianfang
    Peng, Xinyi
    IEEE ACCESS, 2019, 7 : 177734 - 177744
  • [46] Optical flow for video super-resolution: a survey
    Zhigang Tu
    Hongyan Li
    Wei Xie
    Yuanzhong Liu
    Shifu Zhang
    Baoxin Li
    Junsong Yuan
    Artificial Intelligence Review, 2022, 55 : 6505 - 6546
  • [47] Deformable transformer for endoscopic video super-resolution
    Song, Xiaowei
    Tang, Hui
    Yang, Chunfeng
    Zhou, Guangquan
    Wang, Yangang
    Huang, Xinjun
    Hua, Jie
    Coatrieux, Gouenou
    He, Xiaopu
    Chen, Yang
    BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2022, 77
  • [48] EFFICIENT JOINT VIDEO DENOISING AND SUPER-RESOLUTION
    Huang, Yuning
    Wang, Tianqi
    Lin, Qian
    Allebach, Jan P.
    Zhu, Fengqing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 1865 - 1869
  • [49] Optical flow for video super-resolution: a survey
    Tu, Zhigang
    Li, Hongyan
    Xie, Wei
    Liu, Yuanzhong
    Zhang, Shifu
    Li, Baoxin
    Yuan, Junsong
    ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 6505 - 6546
  • [50] Deep Blind Super-Resolution for Satellite Video
    Xiao, Yi
    Yuan, Qiangqiang
    Zhang, Qiang
    Zhang, Liangpei
    IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING, 2023, 61