FI-Net: A Lightweight Video Frame Interpolation Network Using Feature-Level Flow

被引：5

作者：

Li, Haopeng ^{[2
]}

Yuan, Yuan ^{[1
,2
]}

Wang, Qi ^{[2
]}

机构：

[1] Northwestern Polytech Univ, Sch Comp Sci, Xian 710072, Shaanxi, Peoples R China

[2] Northwestern Polytech Univ, Ctr OPT IMagery Anal & Learning, Xian 710072, Shaanxi, Peoples R China

来源：

IEEE ACCESS | 2019年 / 7卷

基金：

中国国家自然科学基金;

关键词：

Video frame interpolation; lightweight network; feature-level flow; Sobolev loss;

D O I：

10.1109/ACCESS.2019.2936549

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Video frame interpolation is a classic computer vision task that aims to generate in-between frames given two consecutive frames. In this paper, a flow-based interpolation method (FI-Net) is proposed. FI-Net is a lightweight end-to-end neural network that takes two frames in arbitrary size as input and outputs the estimated intermediate frame. Novelly, it computes optical flow at feature level instead of image level. Such practice can increase the accuracy of estimated flow. Multi-scale technique is utilized to handle large motions. For training, a comprehensive loss function that contains a novel content loss (Sobolev loss) and a semantic loss is introduced. It forces the generated frame to be close to the ground truth one at both pixel level and semantic level. We compare FI-Net with previous methods and it achieves higher performance with less time consumption and much smaller model size.

引用

页码：118287 / 118296

页数：10

共 43 条

[1] [Anonymous], 2015, IEEE I CONF COMP VIS, DOI DOI 10.1109/ICCV.2015.123
[2] [Anonymous], IEEE T PATTERN ANAL
[3] [Anonymous], 2017, ARXIV PREPRINT ARXIV
[4] Baker Simon, 2007, 2007 11th IEEE International Conference on Computer Vision, P1
[5] Bruna Joan, 2016, P 4 INT C LEARN REPR
[6] A Naturalistic Open Source Movie for Optical Flow Evaluation
Butler, Daniel J.
Wulff, Jonas
Stanley, Garrett B.
Black, Michael J.
[J]. COMPUTER VISION - ECCV 2012, PT VI, 2012, 7577 : 611 - 625
[7] Chen XK, 2017, AER ADV ENG RES, V100, P1
[8] FlowNet: Learning Optical Flow with Convolutional Networks
Dosovitskiy, Alexey
Fischer, Philipp
Ilg, Eddy
Haeusser, Philip
Hazirbas, Caner
Golkov, Vladimir
van der Smagt, Patrick
Cremers, Daniel
Brox, Thomas
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
[9] Learning Spatiotemporal Features with 3D Convolutional Networks
Du Tran
Bourdev, Lubomir
Fergus, Rob
Torresani, Lorenzo
Paluri, Manohar
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 4489 - 4497
[10] FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks
Ilg, Eddy
Mayer, Nikolaus
Saikia, Tonmoy
Keuper, Margret
Dosovitskiy, Alexey
Brox, Thomas
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 1647 - 1655

← 1 2 3 4 5 →