Video Frame Interpolation for Large Motion with Generative Prior

被引:0
|
作者
Huang, Yuheng [1 ]
Jia, Xu [1 ]
Su, Xin [1 ]
Zhang, Lu [1 ]
Li, Xiaomin [1 ]
Wang, Qinghe [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X | 2025年 / 15040卷
基金
中国国家自然科学基金;
关键词
Video frame interpolation; Pre-trained diffusion model; Large motions;
D O I
10.1007/978-981-97-8792-0_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Frame Interpolation (VFI) is a challenging task, especially when scenarios involve large motions. Most existing methods are based on optical flow, which is difficult to predict when large motions exist. Additionally, due to their lack of prior image knowledge, they tend to generate intermediate frames with artifacts if the predicted optical flow is wrong. In this paper, we propose a novel method based on a pre-trained latent diffusion model (LDM). Firstly, we freeze most of the parameters to preserve the rich image prior knowledge and powerful generation capabilities of the LDM. Secondly, we inflate our model to handle videos and adopt a multi-scale spatial-temporal attention module to enhance the ability to process large motions. Finally, information from the input frames is utilized to assist in reconstructing details in the output frames, further enhancing the quality of the output frames. The experimental results demonstrate that our method achieves excellent performance in both natural and animated videos with large motions. Specifically, our method achieves state-of-the-art performance on the animated dataset, showcasing remarkable outputs with nearly no artifacts.
引用
收藏
页码:402 / 415
页数:14
相关论文
共 50 条
  • [1] Sparse Global Matching for Video Frame Interpolation with Large Motion
    Liu, Chunxu
    Zhang, Guozhen
    Zhao, Rui
    Wang, Limin
    2024 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2024, : 19125 - 19134
  • [2] FILM: Frame Interpolation for Large Motion
    Reda, Fitsum
    Kontkanen, Janne
    Tabellion, Eric
    Sun, Deqing
    Pantofaru, Caroline
    Curless, Brian
    COMPUTER VISION, ECCV 2022, PT VII, 2022, 13667 : 250 - 266
  • [3] Progressive Motion Boosting for Video Frame Interpolation
    Xiao, Jing
    Xu, Kangmin
    Hu, Mengshun
    Liao, Liang
    Wang, Zheng
    Lin, Chia-Wen
    Wang, Mi
    Satoh, Shin'ichi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 25 : 8076 - 8090
  • [4] Motion-Aware Video Frame Interpolation
    Han, Pengfei
    Zhang, Fuhua
    Zhao, Bin
    Li, Xuelong
    NEURAL NETWORKS, 2024, 178
  • [5] A Motion Distillation Framework for Video Frame Interpolation
    Zhou, Shili
    Tan, Weimin
    Yan, Bo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3728 - 3740
  • [6] MOTION FEEDBACK DESIGN FOR VIDEO FRAME INTERPOLATION
    Hu, Mengshun
    Liao, Liang
    Xiao, Jing
    Gu, Lin
    Satoh, Shin'ichi
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 4347 - 4351
  • [7] Efficient Video Frame Interpolation Using Generative Adversarial Networks
    Tran, Quang Nhat
    Yang, Shih-Hsuan
    APPLIED SCIENCES-BASEL, 2020, 10 (18):
  • [8] Asymmetric Bilateral Motion Estimation for Video Frame Interpolation
    Park, Junheum
    Lee, Chul
    Kim, Chang-Su
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 14519 - 14528
  • [9] Robust Video Frame Interpolation With Exceptional Motion Map
    Park, Minho
    Kim, Hak Gu
    Lee, Sangmin
    Ro, Yong Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 754 - 764
  • [10] Multi-Level Adaptive Separable Convolution for Large-Motion Video Frame Interpolation
    Wijma, Ruth
    You, Shaodi
    Li, Yu
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2021), 2021, : 1127 - 1135