Flow Guidance Deformable Compensation Network for Video Frame Interpolation

被引:2
作者
Lei, Pengcheng [1 ,2 ]
Fang, Faming [1 ,2 ]
Zeng, Tieyong [3 ]
Zhang, Guixu [1 ,2 ]
机构
[1] East China Normal Univ, Sch Comp Sci & Technol, Shanghai 200062, Peoples R China
[2] East China Normal Univ, KLATASDS MOE, Shanghai 200062, Peoples R China
[3] Chinese Univ Hong Kong, Dept Math, Shenzhen 518172, Peoples R China
关键词
Task analysis; Estimation; Deformation; Interpolation; Convolution; Kernel; Optical imaging; Video frame interpolation; motion estimation; motion compensation; deformable convolution; distillation learning;
D O I
10.1109/TMM.2023.3289702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Flow-based and deformable convolution (DConv)-based methods are two mainstream approaches for solving the video frame interpolation (VFI) problem, which have made remarkable progress with the development of deep convolutional networks over the past years. However, flow-based VFI methods often suffer from the inaccuracy of flow map estimation, especially in dealing with complex and irregular real-world motions. DConv-based VFI methods have advantages in handling complex motions, while the increased degree of freedom makes the training of the DConv model difficult. To address these problems, in this article, we propose a flow guidance deformable compensation network (FGDCN) for the VFI task. FGDCN decomposes the frame sampling process into two steps: a flow step and a deformation step. Specifically, the flow step utilizes a coarse-to-fine flow estimation network to directly estimate the intermediate flows and synthesizes an anchor frame simultaneously. To ensure the accuracy of the estimated flow, a distillation loss and a task-oriented loss are jointly employed in this step. Under the guidance of the flow priors learned in step one, the deformation step designs a new pyramid deformable compensation network to compensate for the missing details of the flow step. In addition, a pyramid loss is proposed to supervise the model in both the image and frequency domains. Experimental results show that the proposed algorithm achieves excellent performance on various datasets with fewer parameters.
引用
收藏
页码:1801 / 1812
页数:12
相关论文
共 50 条
  • [21] DSF-Net: Dual-Stream Fused Network for Video Frame Interpolation
    Zhang, Fuhua
    Yang, Chuang
    IEEE SIGNAL PROCESSING LETTERS, 2023, 30 : 1122 - 1126
  • [22] A Motion Distillation Framework for Video Frame Interpolation
    Zhou, Shili
    Tan, Weimin
    Yan, Bo
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 3728 - 3740
  • [23] Multi-Frame Pyramid Refinement Network for Video Frame Interpolation
    Zhang, Haoxian
    Wang, Ronggang
    Zhao, Yang
    IEEE ACCESS, 2019, 7 : 130610 - 130621
  • [24] SVMFI: speaker video multi-frame interpolation with the guidance of audio
    Wang, Qianrui
    Li, Dengshi
    Gao, Yu
    Chen, Aolei
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (40) : 88411 - 88428
  • [25] Textural Detail Preservation Network for Video Frame Interpolation
    Yoon, Kihwan
    Huh, Jingang
    Kim, Yong Han
    Kim, Sungjei
    Jeong, Jinwoo
    IEEE ACCESS, 2023, 11 : 71994 - 72006
  • [26] Video Frame Interpolation via Multi-scale Expandable Deformable Convolution
    Zhang, Dengyong
    Huang, Pu
    Ding, Xiangling
    Li, Feng
    Yang, Gaobo
    PROCEEDINGS OF THE 2023 ACM WORKSHOP ON INFORMATION HIDING AND MULTIMEDIA SECURITY, IH&MMSEC 2023, 2023, : 19 - 28
  • [27] Direct Video Frame Interpolation With Multiple Latent Encoders
    Kwon, Yong-Hoon
    Yoon, Ju Hong
    Park, Min-Gyu
    IEEE ACCESS, 2021, 9 : 32457 - 32466
  • [28] Robust Video Frame Interpolation With Exceptional Motion Map
    Park, Minho
    Kim, Hak Gu
    Lee, Sangmin
    Ro, Yong Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 754 - 764
  • [29] Video Frame Interpolation: A Comprehensive Survey
    Dong, Jiong
    Ota, Kaoru
    Dong, Mianxiong
    ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (02)
  • [30] A CONCATENATED MODEL FOR VIDEO FRAME INTERPOLATION
    Chen, Ying
    Smith, Mark J. T.
    2009 IEEE 13TH DIGITAL SIGNAL PROCESSING WORKSHOP & 5TH IEEE PROCESSING EDUCATION WORKSHOP, VOLS 1 AND 2, PROCEEDINGS, 2009, : 565 - 569