Residual Invertible Spatio-Temporal Network for Video Super-Resolution

被引:53
作者
Zhu, Xiaobin [1 ,2 ]
Li, Zhuangzi [2 ]
Zhang, Xiao-Yu [3 ]
Li, Changsheng [4 ]
Liu, Yaqi [3 ]
Xue, Ziyu [5 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[4] Univ Elect Sci & Technol China, Hefei, Anhui, Peoples R China
[5] NRTA, Acad Broadcasting Sci, lnformat Technol Inst, Beijing, Peoples R China
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1609/aaai.v33i01.33015981
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is a challenging task, which has attracted great attention in research and industry communities. In this paper, we propose a novel end-to-end architecture, called Residual Invertible Spatio-Temporal Network (RISTN) for video super-resolution. The RISTN can sufficiently exploit the spatial information from low-resolution to high-resolution, and effectively models the temporal consistency from consecutive video frames. Compared with existing recurrent convolutional network based approaches, RISTN is much deeper but more efficient. It consists of three major components: In the spatial component, a lightweight residual invertible block is designed to reduce information loss during feature transformation and provide robust feature representations. In the temporal component, a novel recurrent convolutional model with residual dense connections is proposed to construct deeper network and avoid feature degradation. In the reconstruction component, a new fusion method based on the sparse strategy is proposed to integrate the spatial and temporal features. Experiments on public benchmark datasets demonstrate that RISTN outperforms the state-of-the-art methods.
引用
收藏
页码:5981 / 5988
页数:8
相关论文
共 50 条
  • [31] Patch-based spatio-temporal super-resolution for video with non-rigid motion
    Salvador, Jordi
    Kochale, Axel
    Schweidler, Siegfried
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2013, 28 (05) : 483 - 493
  • [32] Sparse Spatio-Temporal Representation with Adaptive Regularized Dictionaries for Super-Resolution Based Video Coding
    Pan, Zhiming
    Xiong, Hongkai
    2012 DATA COMPRESSION CONFERENCE (DCC), 2012, : 139 - 148
  • [33] A Novel Zero-Shot Real World Spatio-Temporal Super-Resolution (ZS-RW-STSR) Model for Video Super-Resolution
    Shukla, Ankit
    Upadhyay, Avinash
    Sharma, Manoj
    Saini, Anil
    Fatema, Nuzhat
    Malik, Hasmat
    Afthanorhan, Asyraf
    Hossaini, Mohammad Asef
    IEEE ACCESS, 2024, 12 : 123969 - 123984
  • [34] AttGAN: attention gated generative adversarial network for spatio-temporal super-resolution of ocean phenomena
    Liu, Yanni
    Wang, Xinjie
    Yuan, Chunxin
    Xu, Jiexin
    Wei, Zhiqiang
    Nie, Jie
    INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2024, 17 (01)
  • [35] Spatio-temporal Super-resolution with Photographic and Depth Data using GANs
    Lim, Steffen
    Khan, Sams
    Alessandro, Matteo
    McFall, Kevin
    PROCEEDINGS OF THE 2019 ANNUAL ACM SOUTHEAST CONFERENCE (ACMSE 2019), 2019, : 262 - 263
  • [36] Infrared Thermal Imaging Super-Resolution via Multiscale Spatio-Temporal Feature Fusion Network
    Zhang, Wenhui
    Sui, Xiubao
    Gu, Guohua
    Chen, Qian
    Cao, Heyang
    IEEE SENSORS JOURNAL, 2021, 21 (17) : 19176 - 19185
  • [37] STIFS: Spatio-Temporal Input Frame Selection for Learning-based Video Super-Resolution Models
    Baniya, Arbind Agrahari
    Lee, Tsz-Kwan
    Eklund, Peter W.
    Aryal, Sunil
    SIGMAP: PROCEEDINGS OF THE 19TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING AND MULTIMEDIA APPLICATIONS, 2022, : 48 - 58
  • [38] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
    Chadha, Aman
    Britto, John
    Roja, M. Mani
    COMPUTATIONAL VISUAL MEDIA, 2020, 6 (03) : 307 - 317
  • [39] iSeeBetter: Spatio-temporal video super-resolution using recurrent generative back-projection networks
    Aman Chadha
    John Britto
    M.Mani Roja
    ComputationalVisualMedia, 2020, 6 (03) : 307 - 317
  • [40] Spatio-Temporal Adaptive Super-Resolution Reconstruction Model Based on Zernike Moment for Spatial Video Sequences
    Liang Meiyu
    Du Junping
    Lee, JangMyung
    Liu Honggang
    Zhang Yun
    CHINA COMMUNICATIONS, 2012, 9 (12) : 93 - 107