Residual Invertible Spatio-Temporal Network for Video Super-Resolution

被引:53
|
作者
Zhu, Xiaobin [1 ,2 ]
Li, Zhuangzi [2 ]
Zhang, Xiao-Yu [3 ]
Li, Changsheng [4 ]
Liu, Yaqi [3 ]
Xue, Ziyu [5 ]
机构
[1] Univ Sci & Technol Beijing, Sch Comp & Commun Engn, Beijing, Peoples R China
[2] Beijing Technol & Business Univ, Sch Comp & Informat Engn, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Informat Engn, Beijing, Peoples R China
[4] Univ Elect Sci & Technol China, Hefei, Anhui, Peoples R China
[5] NRTA, Acad Broadcasting Sci, lnformat Technol Inst, Beijing, Peoples R China
来源
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE | 2019年
基金
中国国家自然科学基金; 国家重点研发计划;
关键词
D O I
10.1609/aaai.v33i01.33015981
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution is a challenging task, which has attracted great attention in research and industry communities. In this paper, we propose a novel end-to-end architecture, called Residual Invertible Spatio-Temporal Network (RISTN) for video super-resolution. The RISTN can sufficiently exploit the spatial information from low-resolution to high-resolution, and effectively models the temporal consistency from consecutive video frames. Compared with existing recurrent convolutional network based approaches, RISTN is much deeper but more efficient. It consists of three major components: In the spatial component, a lightweight residual invertible block is designed to reduce information loss during feature transformation and provide robust feature representations. In the temporal component, a novel recurrent convolutional model with residual dense connections is proposed to construct deeper network and avoid feature degradation. In the reconstruction component, a new fusion method based on the sparse strategy is proposed to integrate the spatial and temporal features. Experiments on public benchmark datasets demonstrate that RISTN outperforms the state-of-the-art methods.
引用
收藏
页码:5981 / 5988
页数:8
相关论文
共 50 条
  • [1] Fast Spatio-Temporal Residual Network for Video Super-Resolution
    Li, Sheng
    He, Fengxiang
    Du, Bo
    Zhang, Lefei
    Xu, Yonghao
    Tao, Dacheng
    2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10514 - 10523
  • [2] DSTnet: Deformable Spatio-Temporal Convolutional Residual Network for Video Super-Resolution
    Khan, Anusha
    Sargano, Allah Bux
    Habib, Zulfiqar
    MATHEMATICS, 2021, 9 (22)
  • [3] Spatio-Temporal Fusion Network for Video Super-Resolution
    Li, Huabin
    Zhang, Pingjian
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [4] Video super-resolution based on a spatio-temporal matching network
    Zhu, Xiaobin
    Li, Zhuangzi
    Lou, Jungang
    Shen, Qing
    PATTERN RECOGNITION, 2021, 110
  • [5] Grouped Spatio-Temporal Alignment Network for Video Super-Resolution
    Lu, Mingxuan
    Zhang, Peng
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2193 - 2197
  • [6] Video Super-Resolution via a Spatio-Temporal Alignment Network
    Wen, Weilei
    Ren, Wenqi
    Shi, Yinghuan
    Nie, Yunfeng
    Zhang, Jingang
    Cao, Xiaochun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 31 : 1761 - 1773
  • [7] Efficient Spatio-Temporal Network with Gated Fusion for Video Super-Resolution
    Li, Changyu
    Zhang, Dongyang
    Xie, Ning
    Shao, Jie
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2021, PT V, 2021, 12895 : 640 - 651
  • [8] Bidirectional spatio-temporal generative adversarial network for video super-resolution
    Yang, Peng
    Chen, Zhangquan
    Sun, Yuankang
    Hu, Zhongjian
    Li, Bing
    PATTERN ANALYSIS AND APPLICATIONS, 2025, 28 (01)
  • [9] Spatio-temporal Super-resolution Network: Enhance Visual Representations for Video Captioning
    Cao, Quanhui
    Tang, Pengjie
    Wang, Hanli
    2022 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 22), 2022, : 3125 - 3129
  • [10] A multi-stage spatio-temporal adaptive network for video super-resolution
    Zhang, Yuhang
    Chen, Zhenzhong
    Liu, Shan
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2022, 87