FI-Net: A Lightweight Video Frame Interpolation Network Using Feature-Level Flow

被引:5
作者
Li, Haopeng [2 ]
Yuan, Yuan [1 ,2 ]
Wang, Qi [2 ]
机构
[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China
[2] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning, Xian 710072, Shaanxi, Peoples R China
基金
中国国家自然科学基金;
关键词
Video frame interpolation; lightweight network; feature-level flow; Sobolev loss;
D O I
10.1109/ACCESS.2019.2936549
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Video frame interpolation is a classic computer vision task that aims to generate in-between frames given two consecutive frames. In this paper, a flow-based interpolation method (FI-Net) is proposed. FI-Net is a lightweight end-to-end neural network that takes two frames in arbitrary size as input and outputs the estimated intermediate frame. Novelly, it computes optical flow at feature level instead of image level. Such practice can increase the accuracy of estimated flow. Multi-scale technique is utilized to handle large motions. For training, a comprehensive loss function that contains a novel content loss (Sobolev loss) and a semantic loss is introduced. It forces the generated frame to be close to the ground truth one at both pixel level and semantic level. We compare FI-Net with previous methods and it achieves higher performance with less time consumption and much smaller model size.
引用
收藏
页码:118287 / 118296
页数:10
相关论文
共 43 条
  • [1] [Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
  • [2] [Anonymous], IEEE T PATTERN ANAL
  • [3] [Anonymous], 2017, ARXIV PREPRINT ARXIV
  • [4] Baker Simon, 2007, 2007 11th IEEE International Conference on Computer Vision, P1
  • [5] Bruna Joan, 2016, P 4 INT C LEARN REPR
  • [6] A Naturalistic Open Source Movie for Optical Flow Evaluation
    Butler, Daniel J.
    Wulff, Jonas
    Stanley, Garrett B.
    Black, Michael J.
    [J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
  • [7] Chen XK, 2017, AER ADV ENG RES, V100, P1
  • [8] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [9] Learning Spatiotemporal Features with 3D Convolutional Networks
    Du Tran
    Bourdev, Lubomir
    Fergus, Rob
    Torresani, Lorenzo
    Paluri, Manohar
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
  • [10] FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
    Ilg, Eddy
    Mayer, Nikolaus
    Saikia, Tonmoy
    Keuper, Margret
    Dosovitskiy, Alexey
    Brox, Thomas
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1647 - 1655