Online Video Super-resolution using Information Replenishing Unidirectional Recurrent Model

被引：4

作者：

Baniya, Arbind Agrahari ^{[1
]}

Lee, Tsz-Kwan ^{[1
]}

Eklund, Peter W. ^{[1
]}

Aryal, Sunil ^{[1
]}

Robles-Kelly, Antonio ^{[1
]}

机构：

[1] Deakin Univ, Sch Informat Technol, Geelong, Vic, Australia

来源：

NEUROCOMPUTING | 2023年 / 546卷

关键词：

Video super-resolution; Recurrent network; deep learning; Advanced optimisation; Multimedia application; NETWORK;

D O I：

10.1016/j.neucom.2023.126355

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Recurrent Neural Networks (RNN) are widespread for Video Super-Resolution (VSR) because of their proven ability to learn spatiotemporal inter-dependencies across the temporal dimension. Despite RNN's ability to propagate memory across longer sequences of frames, vanishing gradient and error accumulation remain major obstacles to unidirectional RNNs in VSR. Several bi-directional recurrent models are suggested in the literature to alleviate this issue; however, these models are only applicable to offline use cases due to heavy demands for computational resources and the number of frames required per input. This paper proposes a novel unidirectional recurrent model for VSR, namely "Replenished Recurrency with Dual-Duct" (R2D2), that can be used in an online application setting. R2D2 incorporates a recurrent architecture with a sliding-window-based local alignment resulting in a recurrent hybrid architecture. It also uses a dual-duct residual network for concurrent and mutual refinement of local features along with global memory for full utilisation of the information available at each timestamp. With novel modelling and sophisticated optimisation, R2D2 demonstrates competitive performance and efficiency despite the lack of information available at each time-stamp compared to its offline (bidirectional) counterparts. Ablation analysis confirms the additive benefits of the proposed subcomponents of R2D2 over baseline RNN models.The PyTorch-based code for the R2D2 model will be released at R2D2 GitRepo.(c) 2023 The Author(s). Published by Elsevier B.V. This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).

引用

页数：10

共 50 条

[21] Video super-resolution based on deep learning: a comprehensive survey
Liu, Hongying
Ruan, Zhubo
Zhao, Peng
Dong, Chao
Shang, Fanhua
Liu, Yuanyuan
Yang, Linlin
Timofte, Radu
ARTIFICIAL INTELLIGENCE REVIEW, 2022, 55 (08) : 5981 - 6035
[22] Omnidirectional Video Super-Resolution Using Deep Learning
Baniya, Arbind Agrahari
Lee, Tsz-Kwan
Eklund, Peter W.
Aryal, Sunil
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 540 - 554
[23] Video Super-Resolution Using Multiple Complementary Priors
Dai, Maohua
He, Xiaohai
Wang, Zhengyong
Chen, Honggang
Tao, Qingchuan
2017 2ND INTERNATIONAL CONFERENCE ON IMAGE, VISION AND COMPUTING (ICIVC 2017), 2017, : 510 - 515
[24] Video Super-Resolution With Convolutional Neural Networks
Kappeler, Armin
Yoo, Seunghwan
Dai, Qiqin
Katsaggelos, Aggelos K.
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING, 2016, 2 (02) : 109 - 122
[25] Lightweight Video Super-Resolution for Compressed Video
Kwon, Ilhwan
Li, Jun
Prasad, Mukesh
ELECTRONICS, 2023, 12 (03)
[26] Unsupervised Video Satellite Super-Resolution by Using Only a Single Video
He, Zhi
He, Dan
Li, Xinyuan
Xu, Jiani
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 19
[27] Video super-resolution based on spatial-temporal recurrent residual networks
Yang, Wenhan
Feng, Jiashi
Xie, Guosen
Liu, Jiaying
Guo, Zongming
Yan, Shuicheng
COMPUTER VISION AND IMAGE UNDERSTANDING, 2018, 168 : 79 - 92
[28] A Parallel Framework for Video Super-Resolution
Freitas, Pedro Garcia
Farias, Mylene C. Q.
de Araujo, Aleteia P. F.
2014 27TH SIBGRAPI CONFERENCE ON GRAPHICS, PATTERNS AND IMAGES (SIBGRAPI), 2014, : 204 - 211
[29] Deeply Feature Fused Video Super-resolution Network
Yang, Jingmin
Chen, Zhensen
Xu, Li
PROCEEDINGS OF THE 2021 IEEE INTERNATIONAL CONFERENCE ON PROGRESS IN INFORMATICS AND COMPUTING (PIC), 2021, : 161 - 164
[30] Deeply feature fused video super-resolution network using temporal grouping
Zhensen Chen
Wenyuan Yang
Jingmin Yang
The Journal of Supercomputing, 2022, 78 : 8999 - 9016

← 1 2 3 4 5 →