Video Frame Interpolation for Large Motion with Generative Prior

被引：0

作者：

Huang, Yuheng ^{[1
]}

Jia, Xu ^{[1
]}

Su, Xin ^{[1
]}

Zhang, Lu ^{[1
]}

Li, Xiaomin ^{[1
]}

Wang, Qinghe ^{[1
]}

Lu, Huchuan ^{[1
]}

机构：

[1] Dalian Univ Technol, Dalian, Peoples R China

来源：

PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X | 2025年 / 15040卷

基金：

中国国家自然科学基金;

关键词：

Video frame interpolation; Pre-trained diffusion model; Large motions;

D O I：

10.1007/978-981-97-8792-0_28

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Video Frame Interpolation (VFI) is a challenging task, especially when scenarios involve large motions. Most existing methods are based on optical flow, which is difficult to predict when large motions exist. Additionally, due to their lack of prior image knowledge, they tend to generate intermediate frames with artifacts if the predicted optical flow is wrong. In this paper, we propose a novel method based on a pre-trained latent diffusion model (LDM). Firstly, we freeze most of the parameters to preserve the rich image prior knowledge and powerful generation capabilities of the LDM. Secondly, we inflate our model to handle videos and adopt a multi-scale spatial-temporal attention module to enhance the ability to process large motions. Finally, information from the input frames is utilized to assist in reconstructing details in the output frames, further enhancing the quality of the output frames. The experimental results demonstrate that our method achieves excellent performance in both natural and animated videos with large motions. Specifically, our method achieves state-of-the-art performance on the animated dataset, showcasing remarkable outputs with nearly no artifacts.

引用

页码：402 / 415

页数：14

共 50 条

[1] Motion-Aware Video Frame Interpolation
Han, Pengfei
Zhang, Fuhua
Zhao, Bin
Li, Xuelong
NEURAL NETWORKS, 2024, 178
[2] MOTION FEEDBACK DESIGN FOR VIDEO FRAME INTERPOLATION
Hu, Mengshun
Liao, Liang
Xiao, Jing
Gu, Lin
Satoh, Shin'ichi
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4347 - 4351
[3] Robust Video Frame Interpolation With Exceptional Motion Map
Park, Minho
Kim, Hak Gu
Lee, Sangmin
Ro, Yong Man
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 754 - 764
[4] A Motion Refinement Network With Local Compensation for Video Frame Interpolation
Wang, Kaiqiao
Liu, Peng
IEEE ACCESS, 2023, 11 : 103092 - 103101
[5] Fine-Grained Motion Estimation for Video Frame Interpolation
Yan, Bo
Tan, Weimin
Lin, Chuming
Shen, Liquan
IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (01) : 174 - 184
[6] Multi-Scale Attention Generative Adversarial Networks for Video Frame Interpolation
Xiao, Jian
Bi, Xiaojun
IEEE ACCESS, 2020, 8 : 94842 - 94851
[7] VIDEO FRAME INTERPOLATION VIA EXCEPTIONAL MOTION-AWARE SYNTHESIS
Park, Minho
Lee, Sangmin
Ro, Yong Man
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 1958 - 1962
[8] Progressive Motion Context Refine Network for Efficient Video Frame Interpolation
Kong, Lingtong
Liu, Jinfeng
Yang, Jie
IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2338 - 2342
[9] Video frame interpolation via down-up scale generative adversarial networks
Tran, Quang Nhat
Yang, Shih-Hsuan
COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 220
[10] TimeLens-XL: Real-Time Event-Based Video Frame Interpolation with Large Motion
Ma, Yongrui
Guo, Shi
Chen, Yutian
Xue, Tianfan
Gu, Jinwei
COMPUTER VISION - ECCV 2024, PT LXXXIV, 2025, 15142 : 178 - 194

← 1 2 3 4 5 →