Multi-Memory Convolutional Neural Network for Video Super-Resolution

被引:155
作者
Wang, Zhongyuan [1 ]
Yi, Peng [1 ]
Jiang, Kui [1 ]
Jiang, Junjun [2 ,3 ]
Han, Zhen [1 ]
Lu, Tao [4 ]
Ma, Jiayi [5 ]
机构
[1] Wuhan Univ, Sch Comp, Natl Engn Res Ctr Multimedia Software, Wuhan 430072, Hubei, Peoples R China
[2] Harbin Inst Technol, Sch Comp Sci & Technol, Harbin 150001, Heilongjiang, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518055, Peoples R China
[4] Wuhan Inst Technol, Sch Comp Sci & Engn, Wuhan 430205, Hubei, Peoples R China
[5] Wuhan Univ, Elect Informat Sch, Wuhan 430072, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Convolutional neural network; video super resolution; long short-term memory; multi-memory residual block; ALGORITHM;
D O I
10.1109/TIP.2018.2887017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video super-resolution (SR) is focused on reconstructing high-resolution frames from consecutive low-resolution (LR) frames. Most previous video SR methods based on convolutional neural networks (CNN) use a direct connection and single-memory module within the network, and thus, they fail to make full use of spatio-temporal complementary information from LR observed frames. To fully exploit spatio-temporal correlations between adjacent LR frames and reveal more realistic details, this paper proposes a multi-memory CNN (MMCNN) for video SR, cascading an optical flow network and an image-reconstruction network. A series of residual blocks engaged in utilizing intra-frame spatial correlations is proposed for feature extraction and reconstruction. Particularly, instead of using a single-memory module, we embed convolutional long short-term memory into the residual block, thus forming a multi-memory residual block to progressively extract and retain inter-frame temporal correlations between the consecutive LR frames. We conduct extensive experiments on numerous testing datasets with respect to different scaling factors. Our proposed MMCNN shows superiority over the state-of-the-art methods in terms of PSNR and visual quality and surpasses the best counterpart method by 1 dB at most.
引用
收藏
页码:2530 / 2544
页数:15
相关论文
共 55 条
  • [1] [Anonymous], 2013, IEEE T PATTERN ANAL, DOI DOI 10.1109/TPAMI.2012.59
  • [2] [Anonymous], PROC CVPR IEEE
  • [3] [Anonymous], 2015, NIPS 15 P 28 INT C N
  • [4] [Anonymous], P IEEE C COMP VIS PA
  • [5] [Anonymous], VIDEOSET4
  • [6] [Anonymous], MYANM
  • [7] [Anonymous], CODE OF LAPSRN
  • [8] [Anonymous], 2017, PROC CVPR IEEE, DOI DOI 10.1109/CVPR.2017.618
  • [9] [Anonymous], 2015, PROC 28 INT C NEURAL
  • [10] Contour Detection and Hierarchical Image Segmentation
    Arbelaez, Pablo
    Maire, Michael
    Fowlkes, Charless
    Malik, Jitendra
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (05) : 898 - 916