Optimizing Video Prediction via Video Frame Interpolation

被引:28
作者
Wu, Yue [1 ]
Wen, Qiang [1 ]
Chen, Qifeng [1 ]
机构
[1] Hong Kong Univ Sci & Technol, Hong Kong, Peoples R China
来源
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022) | 2022年
关键词
D O I
10.1109/CVPR52688.2022.01729
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video prediction is an extrapolation task that predicts future frames given past frames, and video frame interpolation is an interpolation task that estimates intermediate frames between two frames. We have witnessed the tremendous advancement of video frame interpolation, but the general video prediction in the wild is still an open question. Inspired by the photo-realistic results of video frame interpolation, we present a new optimization framework for video prediction via video frame interpolation, in which we solve an extrapolation problem based on an interpolation model. Our video prediction framework is based on optimization with a pretrained differentiable video frame interpolation module without the need for a training dataset, and thus there is no domain gap issue between training and test data. Also, our approach does not need any additional information such as semantic or instance maps, which makes our framework applicable to any video. Extensive experiments on the Cityscapes, KITTI, DAVIS, Middlebury, and Vimeo9OK datasets show that our video prediction results are robust in general scenarios, and our approach outperforms other video prediction methods that require a large amount of training data or extra semantic information.
引用
收藏
页码:17793 / 17802
页数:10
相关论文
共 49 条
[1]  
[Anonymous], 2017, INT C LEARN REPR ICL
[2]  
[Anonymous], 2018, NeurIPS
[3]  
[Anonymous], 2018, CVPR, DOI DOI 10.1109/CVPR.2018.00059
[4]  
[Anonymous], 2019, CVPR, DOI DOI 10.1109/CVPR.2019.00786
[5]  
[Anonymous], 2020, CVPR, DOI DOI 10.1109/CVPR42600.2020.00548
[6]  
[Anonymous], 2016, NeurIPS
[7]  
[Anonymous], 2017, CVPR, DOI DOI 10.1109/CVPR.2017.244
[8]   MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement [J].
Bao, Wenbo ;
Lai, Wei-Sheng ;
Zhang, Xiaoyun ;
Gao, Zhiyong ;
Yang, Ming-Hsuan .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) :933-948
[9]  
Bei Xinzhu, 2021, CVPR
[10]   NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis [J].
Ben Mildenhall ;
Srinivasan, Pratul P. ;
Tancik, Matthew ;
Barron, Jonathan T. ;
Ramamoorthi, Ravi ;
Ng, Ren .
COMPUTER VISION - ECCV 2020, PT I, 2020, 12346 :405-421