Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement

被引:19
作者
Hu, Mengshun [1 ]
Xiao, Jing [1 ]
Liao, Liang [2 ]
Wang, Zheng [1 ]
Lin, Chia-Wen [3 ,4 ]
Wang, Mi [5 ]
Satoh, Shin'ichi [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, Japan
[3] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
[4] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu 30013, Taiwan
[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Interpolation; Optical feedback; Adaptive optics; Optical imaging; Kernel; Motion estimation; Estimation; Video frame interpolation; recurrent feedback; motion enhancement; large motions;
D O I
10.1109/TCSVT.2021.3110796
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Interpolating video frames involving large motions remains an elusive challenge. In case that frames involve small and fast-moving objects, conventional feed-forward neural network-based approaches that estimate optical flow and synthesize in-between frames sequentially often result in loss of motion features and thus blurred boundaries. To address the problem, we propose a novel Recurrent Motion-Enhanced Interpolation Network (ReMEI-Net) by assigning attention to the motion features of small objects from both the intra-scale and inter-scale perspectives. Specifically, we add recurrent feedback blocks in the existing multi-scale autoencoder pipeline, aiming to iteratively enhance the motion information of small objects across different scales. Second, to further refine the motion features of the highly moving objects, we propose a Multi-Directional ConvLSTM (MD-ConvLSTM) block to capture the global spatial contextual information of motion from multiple directions. In this way, the coarse-scale features can be utilized to correct and enhance the fine-scale features through the feedback mechanism. Extensive experiments on various datasets demonstrate the superiority of our proposed method over state-of-the-art approaches in terms of clear locations and complete shape.
引用
收藏
页码:3390 / 3406
页数:17
相关论文
共 59 条
  • [11] Triple-Frame-Based Bi-Directional Motion Estimation for Motion-Compensated Frame Interpolation
    Choi, Giyong
    Heo, PyeongGang
    Park, HyunWook
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (05) : 1251 - 1258
  • [12] Choi M, 2020, AAAI CONF ARTIF INTE, V34, P10663
  • [13] FlowNet: Learning Optical Flow with Convolutional Networks
    Dosovitskiy, Alexey
    Fischer, Philipp
    Ilg, Eddy
    Haeusser, Philip
    Hazirbas, Caner
    Golkov, Vladimir
    van der Smagt, Patrick
    Cremers, Daniel
    Brox, Thomas
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
  • [14] DeepStereo: Learning to Predict New Views from the World's Imagery
    Flynn, John
    Neulander, Ivan
    Philbin, James
    Snavely, Noah
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5515 - 5524
  • [15] Feature-based motion compensated interpolation for frame rate up-conversion
    Guo, Dabo
    Shao, Ling
    Han, Jungong
    [J]. NEUROCOMPUTING, 2014, 123 : 390 - 397
  • [16] Motion-compensated frame interpolation with weighted motion estimation and hierarchical vector refinement
    Guo, Dan
    Lu, Zhihong
    [J]. NEUROCOMPUTING, 2016, 181 : 76 - 85
  • [17] Haoxian Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P474, DOI 10.1007/978-3-030-58595-2_29
  • [18] Hu MS, 2020, INT CONF ACOUST SPEE, P4347, DOI [10.1109/ICASSP40776.2020.9053223, 10.1109/icassp40776.2020.9053223]
  • [19] Direction-aware Spatial Context Features for Shadow Detection
    Hu, Xiaowei
    Zhu, Lei
    Fu, Chi-Wing
    Qin, Jing
    Heng, Pheng-Ann
    [J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7454 - 7462
  • [20] Densely Connected Convolutional Networks
    Huang, Gao
    Liu, Zhuang
    van der Maaten, Laurens
    Weinberger, Kilian Q.
    [J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269