Capturing Small, Fast-Moving Objects: Frame Interpolation via Recurrent Motion Enhancement

被引：19

作者：

Hu, Mengshun ^{[1
]}

Xiao, Jing ^{[1
]}

Liao, Liang ^{[2
]}

Wang, Zheng ^{[1
]}

Lin, Chia-Wen ^{[3
,4
]}

Wang, Mi ^{[5
]}

Satoh, Shin'ichi ^{[2
]}

机构：

[1] Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China

[2] Natl Inst Informat, Digital Content & Media Sci Res Div, Tokyo 1018430, Japan

[3] Natl Tsing Hua Univ, Dept Elect Engn, Hsinchu 30013, Taiwan

[4] Natl Tsing Hua Univ, Inst Commun Engn, Hsinchu 30013, Taiwan

[5] Wuhan Univ, State Key Lab Informat Engn Surveying Mapping & R, Wuhan 430072, Peoples R China

来源：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY | 2022年 / 32卷 / 06期

基金：

中国国家自然科学基金;

关键词：

Interpolation; Optical feedback; Adaptive optics; Optical imaging; Kernel; Motion estimation; Estimation; Video frame interpolation; recurrent feedback; motion enhancement; large motions;

D O I：

10.1109/TCSVT.2021.3110796

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Interpolating video frames involving large motions remains an elusive challenge. In case that frames involve small and fast-moving objects, conventional feed-forward neural network-based approaches that estimate optical flow and synthesize in-between frames sequentially often result in loss of motion features and thus blurred boundaries. To address the problem, we propose a novel Recurrent Motion-Enhanced Interpolation Network (ReMEI-Net) by assigning attention to the motion features of small objects from both the intra-scale and inter-scale perspectives. Specifically, we add recurrent feedback blocks in the existing multi-scale autoencoder pipeline, aiming to iteratively enhance the motion information of small objects across different scales. Second, to further refine the motion features of the highly moving objects, we propose a Multi-Directional ConvLSTM (MD-ConvLSTM) block to capture the global spatial contextual information of motion from multiple directions. In this way, the coarse-scale features can be utilized to correct and enhance the fine-scale features through the feedback mechanism. Extensive experiments on various datasets demonstrate the superiority of our proposed method over state-of-the-art approaches in terms of clear locations and complete shape.

引用

页码：3390 / 3406

页数：17

共 59 条

[11] Triple-Frame-Based Bi-Directional Motion Estimation for Motion-Compensated Frame Interpolation
Choi, Giyong
Heo, PyeongGang
Park, HyunWook
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 29 (05) : 1251 - 1258
[12] Choi M, 2020, AAAI CONF ARTIF INTE, V34, P10663
[13] FlowNet: Learning Optical Flow with Convolutional Networks
Dosovitskiy, Alexey
Fischer, Philipp
Ilg, Eddy
Haeusser, Philip
Hazirbas, Caner
Golkov, Vladimir
van der Smagt, Patrick
Cremers, Daniel
Brox, Thomas
[J]. 2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, : 2758 - 2766
[14] DeepStereo: Learning to Predict New Views from the World's Imagery
Flynn, John
Neulander, Ivan
Philbin, James
Snavely, Noah
[J]. 2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, : 5515 - 5524
[15] Feature-based motion compensated interpolation for frame rate up-conversion
Guo, Dabo
Shao, Ling
Han, Jungong
[J]. NEUROCOMPUTING, 2014, 123 : 390 - 397
[16] Motion-compensated frame interpolation with weighted motion estimation and hierarchical vector refinement
Guo, Dan
Lu, Zhihong
[J]. NEUROCOMPUTING, 2016, 181 : 76 - 85
[17] Haoxian Zhang, 2020, Computer Vision - ECCV 2020. 16th European Conference. Proceedings. Lecture Notes in Computer Science (LNCS 12370), P474, DOI 10.1007/978-3-030-58595-2_29
[18] Hu MS, 2020, INT CONF ACOUST SPEE, P4347, DOI [10.1109/ICASSP40776.2020.9053223, 10.1109/icassp40776.2020.9053223]
[19] Direction-aware Spatial Context Features for Shadow Detection
Hu, Xiaowei
Zhu, Lei
Fu, Chi-Wing
Qin, Jing
Heng, Pheng-Ann
[J]. 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 7454 - 7462
[20] Densely Connected Convolutional Networks
Huang, Gao
Liu, Zhuang
van der Maaten, Laurens
Weinberger, Kilian Q.
[J]. 30TH IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2017), 2017, : 2261 - 2269

← 1 2 3 4 5 6 →