Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement

被引：19

作者：

Hu, Mengshun ^{[1
]}

Xiao, Jing ^{[1
]}

Liao, Liang ^{[2
]}

Wang, Zheng ^{[1
]}

Lin, Chia-Wen ^{[3
,4
]}

Wang, Mi ^{[5
]}

Satoh, Shin'ichi ^{[2
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, Japan

[3] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan

[4] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu 30013, Taiwan

[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Interpolation; Optical feedback; Adaptive optics; Optical imaging; Kernel; Motion estimation; Estimation; Video frame interpolation; recurrent feedback; motion enhancement; large motions;

D O I：

10.1109/TCSVT.2021.3110796

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Interpolating video frames involving large motions remains an elusive challenge. In case that frames involve small and fast-moving objects, conventional feed-forward neural network-based approaches that estimate optical flow and synthesize in-between frames sequentially often result in loss of motion features and thus blurred boundaries. To address the problem, we propose a novel Recurrent Motion-Enhanced Interpolation Network (ReMEI-Net) by assigning attention to the motion features of small objects from both the intra-scale and inter-scale perspectives. Specifically, we add recurrent feedback blocks in the existing multi-scale autoencoder pipeline, aiming to iteratively enhance the motion information of small objects across different scales. Second, to further refine the motion features of the highly moving objects, we propose a Multi-Directional ConvLSTM (MD-ConvLSTM) block to capture the global spatial contextual information of motion from multiple directions. In this way, the coarse-scale features can be utilized to correct and enhance the fine-scale features through the feedback mechanism. Extensive experiments on various datasets demonstrate the superiority of our proposed method over state-of-the-art approaches in terms of clear locations and complete shape.

引用

页码：3390 / 3406

页数：17

共 59 条

[1] A Database and Evaluation Methodology for Optical Flow
Baker, Simon
Scharstein, Daniel
Lewis, J. P.
Roth, Stefan
Black, Michael J.
Szeliski, Richard
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2011, 92 (01) : 1 - 31
[2] Depth-Aware Video Frame Interpolation
Bao, Wenbo
Lai, Wei-Sheng
Ma, Chao
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 3698 - 3707
[3] MEMC-Net: Motion Estimation and Motion Compensation Driven Neural Network for Video Interpolation and Enhancement
Bao, Wenbo
Lai, Wei-Sheng
Zhang, Xiaoyun
Gao, Zhiyong
Yang, Ming-Hsuan
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2021, 43 (03) : 933 - 948
[4] Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks
Bell, Sean
Zitnick, C. Lawrence
Bala, Kavita
Girshick, Ross
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 2874 - 2883
[5] ContextVP: Fully Context-Aware Video Prediction
Byeon, Wonmin
Wang, Qin
Srivastava, Rupesh Kumar
Koumoutsakos, Petros
[J]. COMPUTER VISION - ECCV 2018, PT XVI, 2018, 11220 : 781 - 797
[6] A method for motion adaptive frame rate up-conversion
Castagno, R
Haavisto, P
Ramponi, G
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 1996, 6 (05) : 436 - 446
[7] Multiple Video Frame Interpolation via Enhanced Deformable Separable Convolution
Cheng, Xianhang
Chen, Zhenzhong
[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (10) : 7029 - 7045
[8] Cheng XH, 2020, AAAI CONF ARTIF INTE, V34, P10607
[9] A Multi-Scale Position Feature Transform Network for Video Frame Interpolation
Cheng, Xianhang
Chen, Zhenzhong
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2020, 30 (11) : 3968 - 3981
[10] Motion-compensated frame interpolation using bilateral motion estimation and adaptive overlapped block motion compensation
Choi, Byeong-Doo
Han, Jong-Woo
Kim, Chang-Su
Ko, Sung-Jea
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2007, 17 (04) : 407 - 416

← 1 2 3 4 5 6 →