Residual Invertible Spatio-Temporal Network for Video Super-Resolution

被引：53

作者：

Zhu, Xiaobin ^{[1
,2
]}

Li, Zhuangzi ^{[2
]}

Zhang, Xiao-Yu ^{[3
]}

Li, Changsheng ^{[4
]}

Liu, Yaqi ^{[3
]}

Xue, Ziyu ^{[5
]}

机构：

[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China

[2] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing, Peoples R China

[3] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China

[4] Univ Elect Sci & Technol China, Hefei, Anhui, Peoples R China

[5] NRTA, Acad Broadcasting Sci, lnformat Technol Inst, Beijing, Peoples R China

来源：

THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

D O I：

10.1609/aaai.v33i01.33015981

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video super-resolution is a challenging task, which has attracted great attention in research and industry communities. In this paper, we propose a novel end-to-end architecture, called Residual Invertible Spatio-Temporal Network (RISTN) for video super-resolution. The RISTN can sufficiently exploit the spatial information from low-resolution to high-resolution, and effectively models the temporal consistency from consecutive video frames. Compared with existing recurrent convolutional network based approaches, RISTN is much deeper but more efficient. It consists of three major components: In the spatial component, a lightweight residual invertible block is designed to reduce information loss during feature transformation and provide robust feature representations. In the temporal component, a novel recurrent convolutional model with residual dense connections is proposed to construct deeper network and avoid feature degradation. In the reconstruction component, a new fusion method based on the sparse strategy is proposed to integrate the spatial and temporal features. Experiments on public benchmark datasets demonstrate that RISTN outperforms the state-of-the-art methods.

引用

页码：5981 / 5988

页数：8

共 50 条

[31] Patch-based spatio-temporal super-resolution for video with non-rigid motion
Salvador, Jordi
Kochale, Axel
Schweidler, Siegfried
SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 483 - 493
[32] Sparse Spatio-Temporal Representation with Adaptive Regularized Dictionaries for Super-Resolution Based Video Coding
Pan, Zhiming
Xiong, Hongkai
2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 139 - 148
[33] A Novel Zero-Shot Real World Spatio-Temporal Super-Resolution (ZS-RW-STSR) Model for Video Super-Resolution
Shukla, Ankit
Upadhyay, Avinash
Sharma, Manoj
Saini, Anil
Fatema, Nuzhat
Malik, Hasmat
Afthanorhan, Asyraf
Hossaini, Mohammad Asef
IEEE ACCESS, 2024, 12 : 123969 - 123984
[34] AttGAN: attention gated generative adversarial network for spatio-temporal super-resolution of ocean phenomena
Liu, Yanni
Wang, Xinjie
Yuan, Chunxin
Xu, Jiexin
Wei, Zhiqiang
Nie, Jie
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
[35] Spatio-temporal Super-resolution with Photographic and Depth Data using GANs
Lim, Steffen
Khan, Sams
Alessandro, Matteo
McFall, Kevin
PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 262 - 263
[36] Infrared Thermal Imaging Super-Resolution via Multiscale Spatio-Temporal Feature Fusion Network
Zhang, Wenhui
Sui, Xiubao
Gu, Guohua
Chen, Qian
Cao, Heyang
IEEE SENSORS JOURNAL, 2021, 21 (17) : 19176 - 19185
[37] STIFS: Spatio-Temporal Input Frame Selection for Learning-based Video Super-Resolution Models
Baniya, Arbind Agrahari
Lee, Tsz-Kwan
Eklund, Peter W.
Aryal, Sunil
SIGMAP: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2022, : 48 - 58
[38] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
Chadha, Aman
Britto, John
Roja, M. Mani
COMPUTATIONAL VISUAL MEDIA, 2020, 6 (03) : 307 - 317
[39] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
Aman Chadha
John Britto
M.Mani Roja
ComputationalVisualMedia, 2020, 6 (03) : 307 - 317
[40] Spatio-Temporal Adaptive Super-Resolution Reconstruction Model Based on Zernike Moment for Spatial Video Sequences
Liang Meiyu
Du Junping
Lee, JangMyung
Liu Honggang
Zhang Yun
CHINA COMMUNICATIONS, 2012, 9 (12) : 93 - 107

← 1 2 3 4 5 →