ASVFI: AUDIO-DRIVEN SPEAKER VIDEO FRAME INTERPOLATION

被引:0
|
作者
Wang, Qianrui [1 ]
Li, Dengshi [1 ]
Liao, Liang [2 ]
Song, Hao [1 ]
Li, Wei [1 ]
Xiao, Jing [3 ]
机构
[1] Jianghan Univ, Sch Artificial Intelligence, Wuhan, Peoples R China
[2] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[3] Wuhan Univ, Natl Engn Res Ctr Multimedia Software, Wuhan, Peoples R China
来源
2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP | 2023年
关键词
Speaker video; video frame interpolation; audio;
D O I
10.1109/ICIP49359.2023.10222345
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to limited data transmission, the video frame rate is low during the online conference, severely affecting user experience. Video frame interpolation can solve the problem by interpolating intermediate frames to increase the video frame rate. Generally, most existing video frame interpolation methods are based on the linear motion assumption. However, the mouth motion is nonlinear, and these methods can not generate superior intermediate frames in speaker video. Considering the strong correlation between mouth shape and vocalization, a new method is proposed, named Audio-driven Speaker Video Frame Interpolation(ASVFI). First, we extract the audio feature from Audio Net(ANet). Second, we use Video Net(VNet) encoder to extract the video feature. Finally, we fuse the audio and video features by AVFusion and decode out the intermediate frame in the VNet decoder. The experimental results show that the PSNR is nearly 0.13dB higher than the baseline of interpolating one frame. When interpolating seven frames, the PSNR is 0.33dB higher than the baseline.
引用
收藏
页码:3200 / 3204
页数:5
相关论文
共 50 条
  • [21] Video Frame Interpolation Based on Symmetric and Asymmetric Motions
    Choi, Whan
    Koh, Yeong Jun
    Kim, Chang-Su
    IEEE ACCESS, 2023, 11 : 22394 - 22403
  • [22] Robust Video Frame Interpolation With Exceptional Motion Map
    Park, Minho
    Kim, Hak Gu
    Lee, Sangmin
    Ro, Yong Man
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (02) : 754 - 764
  • [23] Multi-Scale Warping for Video Frame Interpolation
    Choi, Whan
    Koh, Yeong Jun
    Kim, Chang-Su
    IEEE ACCESS, 2021, 9 : 150470 - 150479
  • [24] FloLPIPS: A Bespoke Video Quality Metric for Frame Interpolation
    Danier, Duolikun
    Zhang, Fan
    Bull, David
    2022 PICTURE CODING SYMPOSIUM (PCS), 2022, : 283 - 287
  • [25] Video Frame Interpolation for Large Motion with Generative Prior
    Huang, Yuheng
    Jia, Xu
    Su, Xin
    Zhang, Lu
    Li, Xiaomin
    Wang, Qinghe
    Lu, Huchuan
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 402 - 415
  • [26] Video Frame Interpolation With Stereo Event and Intensity Cameras
    Ding, Chao
    Lin, Mingyuan
    Zhang, Haijian
    Liu, Jianzhuang
    Yu, Lei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9187 - 9202
  • [27] Video Frame Interpolation via Generalized Deformable Convolution
    Shi, Zhihao
    Liu, Xiaohong
    Shi, Kangdi
    Dai, Linhui
    Chen, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 426 - 439
  • [28] A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation
    Brascher, Andre Beims
    da Silveira, Gabriela Furtado
    Cancellier, Luiz Henrique
    Seidel, Ismael
    Grellert, Mateus
    Guntzel, Jose Luis
    2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI, 2023, : 161 - 166
  • [29] Fine-Grained Motion Estimation for Video Frame Interpolation
    Yan, Bo
    Tan, Weimin
    Lin, Chuming
    Shen, Liquan
    IEEE TRANSACTIONS ON BROADCASTING, 2021, 67 (01) : 174 - 184
  • [30] Flow Guidance Deformable Compensation Network for Video Frame Interpolation
    Lei, Pengcheng
    Fang, Faming
    Zeng, Tieyong
    Zhang, Guixu
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 1801 - 1812