Video Frame Interpolation for Large Motion with Generative Prior

被引:0
|
作者
Huang, Yuheng [1 ]
Jia, Xu [1 ]
Su, Xin [1 ]
Zhang, Lu [1 ]
Li, Xiaomin [1 ]
Wang, Qinghe [1 ]
Lu, Huchuan [1 ]
机构
[1] Dalian Univ Technol, Dalian, Peoples R China
来源
PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X | 2025年 / 15040卷
基金
中国国家自然科学基金;
关键词
Video frame interpolation; Pre-trained diffusion model; Large motions;
D O I
10.1007/978-981-97-8792-0_28
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Video Frame Interpolation (VFI) is a challenging task, especially when scenarios involve large motions. Most existing methods are based on optical flow, which is difficult to predict when large motions exist. Additionally, due to their lack of prior image knowledge, they tend to generate intermediate frames with artifacts if the predicted optical flow is wrong. In this paper, we propose a novel method based on a pre-trained latent diffusion model (LDM). Firstly, we freeze most of the parameters to preserve the rich image prior knowledge and powerful generation capabilities of the LDM. Secondly, we inflate our model to handle videos and adopt a multi-scale spatial-temporal attention module to enhance the ability to process large motions. Finally, information from the input frames is utilized to assist in reconstructing details in the output frames, further enhancing the quality of the output frames. The experimental results demonstrate that our method achieves excellent performance in both natural and animated videos with large motions. Specifically, our method achieves state-of-the-art performance on the animated dataset, showcasing remarkable outputs with nearly no artifacts.
引用
收藏
页码:402 / 415
页数:14
相关论文
共 50 条
  • [21] Range-nullspace Video Frame Interpolation with Focalized Motion Estimation
    Yu, Zhiyang
    Zhang, Yu
    Zou, Dongqing
    Chen, Xijun
    Ren, Jimmy S.
    Ren, Shunqing
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 22159 - 22168
  • [22] Progressive Motion Context Refine Network for Efficient Video Frame Interpolation
    Kong, Lingtong
    Liu, Jinfeng
    Yang, Jie
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2338 - 2342
  • [23] A motion-compensated video frame interpolation method with image inpainting
    Jia, Qian
    Yi, Benshun
    Xiao, Jinsheng
    Sichuan Daxue Xuebao (Gongcheng Kexue Ban)/Journal of Sichuan University (Engineering Science Edition), 2015, 47 (03): : 77 - 82
  • [24] Video frame interpolation via down-up scale generative adversarial networks
    Tran, Quang Nhat
    Yang, Shih-Hsuan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2022, 220
  • [25] TimeLens-XL: Real-Time Event-Based Video Frame Interpolation with Large Motion
    Ma, Yongrui
    Guo, Shi
    Chen, Yutian
    Xue, Tianfan
    Gu, Jinwei
    COMPUTER VISION - ECCV 2024, PT LXXXIV, 2025, 15142 : 178 - 194
  • [26] HIGH FRAME RATE MOTION COMPENSATED FRAME INTERPOLATION IN HIGH-DEFINITION VIDEO PROCESSING
    Lee, Yen-Lin
    Truong Nguyen
    2010 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2010, : 858 - 861
  • [27] Extracting Motion and Appearance via Inter-Frame Attention for Efficient Video Frame Interpolation
    Zhang, Guozhen
    Zhu, Yuhan
    Wang, Haonan
    Chen, Youxin
    Wu, Gangshan
    Wang, Limin
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR, 2023, : 5682 - 5692
  • [28] PhaseNet for Video Frame Interpolation
    Meyer, Simone
    Djelouah, Abdelaziz
    McWilliams, Brian
    Sorkine-Hornung, Alexander
    Gross, Markus
    Schroers, Christopher
    2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, : 498 - 507
  • [29] Blurry Video Frame Interpolation
    Shen, Wang
    Bao, Wenbo
    Zhai, Guangtao
    Chen, Li
    Min, Xiongkuo
    Gao, Zhiyong
    2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 5113 - 5122
  • [30] Video Frame Interpolation Transformer
    Shi, Zhihao
    Xu, Xiangyu
    Liu, Xiaohong
    Chen, Jun
    Yang, Ming-Hsuan
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2022), 2022, : 17461 - 17470