Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement

被引:19
作者
Hu, Mengshun [1 ]
Xiao, Jing [1 ]
Liao, Liang [2 ]
Wang, Zheng [1 ]
Lin, Chia-Wen [3 ,4 ]
Wang, Mi [5 ]
Satoh, Shin'ichi [2 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
[2] Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, Japan
[3] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan
[4] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu 30013, Taiwan
[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China
基金
中国国家自然科学基金;
关键词
Interpolation; Optical feedback; Adaptive optics; Optical imaging; Kernel; Motion estimation; Estimation; Video frame interpolation; recurrent feedback; motion enhancement; large motions;
D O I
10.1109/TCSVT.2021.3110796
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Interpolating video frames involving large motions remains an elusive challenge. In case that frames involve small and fast-moving objects, conventional feed-forward neural network-based approaches that estimate optical flow and synthesize in-between frames sequentially often result in loss of motion features and thus blurred boundaries. To address the problem, we propose a novel Recurrent Motion-Enhanced Interpolation Network (ReMEI-Net) by assigning attention to the motion features of small objects from both the intra-scale and inter-scale perspectives. Specifically, we add recurrent feedback blocks in the existing multi-scale autoencoder pipeline, aiming to iteratively enhance the motion information of small objects across different scales. Second, to further refine the motion features of the highly moving objects, we propose a Multi-Directional ConvLSTM (MD-ConvLSTM) block to capture the global spatial contextual information of motion from multiple directions. In this way, the coarse-scale features can be utilized to correct and enhance the fine-scale features through the feedback mechanism. Extensive experiments on various datasets demonstrate the superiority of our proposed method over state-of-the-art approaches in terms of clear locations and complete shape.
引用
收藏
页码:3390 / 3406
页数:17
相关论文
共 59 条
  • [1] A Database and Evaluation Methodology for Optical Flow
    Baker, Simon
    Scharstein, Daniel
    Lewis, J. P.
    Roth, Stefan
    Black, Michael J.
    Szeliski, Richard
    [J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) : 1 - 31
  • [2] Depth-Aware Video Frame Interpolation
    Bao, Wenbo
    Lai, Wei-Sheng
    Ma, Chao
    Zhang, Xiaoyun
    Gao, Zhiyong
    Yang, Ming-Hsuan
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
  • [3] MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement
    Bao, Wenbo
    Lai, Wei-Sheng
    Zhang, Xiaoyun
    Gao, Zhiyong
    Yang, Ming-Hsuan
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 933 - 948
  • [4] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
    Bell, Sean
    Zitnick, C. Lawrence
    Bala, Kavita
    Girshick, Ross
    [J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
  • [5] ContextVP: Fully Context-Aware Video Prediction
    Byeon, Wonmin
    Wang, Qin
    Srivastava, Rupesh Kumar
    Koumoutsakos, Petros
    [J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 781 - 797
  • [6] A method for motion adaptive frame rate up-conversion
    Castagno, R
    Haavisto, P
    Ramponi, G
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1996, 6 (05) : 436 - 446
  • [7] Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution
    Cheng, Xianhang
    Chen, Zhenzhong
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7029 - 7045
  • [8] Cheng XH, 2020, AAAI CONF ARTIF INTE, V34, P10607
  • [9] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
    Cheng, Xianhang
    Chen, Zhenzhong
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
  • [10] Motion-compensated frame interpolation using bilateral motion estimation and adaptive overlapped block motion compensation
    Choi, Byeong-Doo
    Han, Jong-Woo
    Kim, Chang-Su
    Ko, Sung-Jea
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (04) : 407 - 416