Textural Detail Preservation Network for Video Frame Interpolation

被引:0
|
作者
Yoon, Kihwan [1 ,2 ]
Huh, Jingang [1 ]
Kim, Yong Han [2 ]
Kim, Sungjei [1 ]
Jeong, Jinwoo [1 ]
机构
[1] Korea Elect Technol Inst KETI, Seongnam Si 13488, Gyeonggi Do, South Korea
[2] Univ Seoul, Sch Elect & Comp Engn, Seoul 02504, South Korea
关键词
Video frame interpolation; textural detail preservation; perceptual loss; synthesis network; ENHANCEMENT;
D O I
10.1109/ACCESS.2023.3294964
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The subjective image quality of the Video Frame Interpolation (VFI) result depends on whether image features such as edges, textures and blobs are preserved. With the development of deep learning, various algorithms have been proposed and the objective results of VFI have significantly improved. Moreover, perceptual loss has been used in a method that enhances subjective quality by preserving the features of the image, and as a result, the subjective quality is improved. Despite the quality enhancements achieved in VFI, no analysis has been performed to preserve specific features in the interpolated frames. Therefore, we conducted an analysis to preserve textural detail, such as film grain noise, which can represent the texture of an image, and weak textures, such as droplets or particles. Based on our analysis, we identify the importance of synthesis networks in textural detail preservation and propose an enhanced synthesis network, the Textural Detail Preservation Network (TDPNet). Furthermore, based on our analysis, we propose a Perceptual Training Method (PTM) to address the issue of degraded Peak Signal-to-Noise Ratio (PSNR) when simply applying perceptual loss and to preserve more textural detail. We also propose a Multi-scale Resolution Training Method (MRTM) to address the issue of poor performance when testing datasets with a resolution different from that of the training dataset. The experimental results of the proposed network was outperformed in LPIPS and DISTS on the Vimeo90K, HD, SNU-FILM and UVG datasets compared with the state-of-the-art VFI algorithms, and the subjective results were also outperformed. Furthermore, applying PTM improved PSNR results by an average of 0.293dB compared to simply applying perceptual loss.
引用
收藏
页码:71994 / 72006
页数:13
相关论文
共 50 条
  • [41] Video Frame Interpolation With Stereo Event and Intensity Cameras
    Ding, Chao
    Lin, Mingyuan
    Zhang, Haijian
    Liu, Jianzhuang
    Yu, Lei
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 9187 - 9202
  • [42] Video Frame Interpolation via Generalized Deformable Convolution
    Shi, Zhihao
    Liu, Xiaohong
    Shi, Kangdi
    Dai, Linhui
    Chen, Jun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 24 : 426 - 439
  • [43] A New Approach to Video Coding Leveraging Hybrid Coding and Video Frame Interpolation
    Brascher, Andre Beims
    da Silveira, Gabriela Furtado
    Cancellier, Luiz Henrique
    Seidel, Ismael
    Grellert, Mateus
    Guntzel, Jose Luis
    2023 36TH SBC/SBMICRO/IEEE/ACM SYMPOSIUM ON INTEGRATED CIRCUITS AND SYSTEMS DESIGN, SBCCI, 2023, : 161 - 166
  • [44] STDC-Net: A spatial-temporal deformable convolution network for conference video frame interpolation
    Hu, Jinhui
    Wang, Qianrui
    Li, Dengshi
    Gao, Yu
    MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (40) : 88283 - 88302
  • [45] FI-Net: A Lightweight Video Frame Interpolation Network Using Feature-Level Flow
    Li, Haopeng
    Yuan, Yuan
    Wang, Qi
    IEEE ACCESS, 2019, 7 : 118287 - 118296
  • [46] EMCFN: Edge-based Multi-scale Cross Fusion Network for video frame interpolation
    Wang, Shaowen
    Yang, Xiaohui
    Feng, Zhiquan
    Sun, Jiande
    Liu, Ju
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2024, 103
  • [47] ASVFI: AUDIO-DRIVEN SPEAKER VIDEO FRAME INTERPOLATION
    Wang, Qianrui
    Li, Dengshi
    Liao, Liang
    Song, Hao
    Li, Wei
    Xiao, Jing
    2023 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2023, : 3200 - 3204
  • [48] Video Frame Interpolation and Enhancement via Pyramid Recurrent Framework
    Shen, Wang
    Bao, Wenbo
    Zhai, Guangtao
    Chen, Li
    Min, Xiongkuo
    Gao, Zhiyong
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 30 : 277 - 292
  • [49] FID: Frame Interpolation and DCT-based Video Compression
    Jalalpour, Yeganeh
    Wang, Li-Yun
    Feng, Wu-chi
    Liu, Feng
    2020 IEEE INTERNATIONAL SYMPOSIUM ON MULTIMEDIA (ISM 2020), 2020, : 218 - 221
  • [50] CONTINUOUS BIDIRECTIONAL OPTICAL FLOW FOR VIDEO FRAME SEQUENCE INTERPOLATION
    Gu, Donghao
    Wen, Zhaojing
    Cui, Wenxue
    Wang, Rui
    Jiang, Feng
    Liu, Shaohui
    2019 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), 2019, : 1768 - 1773