GOP-based Deep Preprocessing for Video Coding

被引:1
作者
Arai, Daichi [1 ]
Iwamura, Shunsuke [1 ]
Iguchi, Kazuhisa [1 ]
Ichigaya, Atsuro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, Tokyo, Japan
来源
2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年
关键词
Video coding; preprocessing; neural network; learned video compression; group of pictures;
D O I
10.1109/PCS60826.2024.10566387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based video preprocessing techniques have recently shown remarkable improvements in video codec performance. However, conventional preprocessing methods tend to prioritize perceptual quality over peak signal-to-noise ratio (PSNR), a key standard for video quality assessment. In this study, We propose a novel deep preprocessing method based on a group of pictures (GOP) structure, specifically aimed at enhancing the rate-distortion performance in terms of PSNR. This approach involves developing a video compression model that employs the GOP structure of the target video codec and training a preprocessing model through joint optimization with the video compression model. Experimental results demonstrate that our GOP-based deep preprocessing method not only improves PSNR but also elevates other quality metrics, including VMAF, across various codecs like MPEG-2, HEVC, and VVC. Additionally, ablation studies highlight the critical role of GOP structures in enhancing encoding efficiency based on PSNR.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] Video Coding Algorithm Based on High Efficiency Video Coding (HEVC) and Hybrid Transforms
    Wang, Chengyou
    Shan, Rongyang
    Zhou, Xiao
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2018, 12 (09): : 4448 - 4466
  • [42] Improvements in DCT based video coding
    Puri, A
    Schmidt, RL
    Haskell, BG
    VISUAL COMMUNICATIONS AND IMAGE PROCESSING '97, PTS 1-2, 1997, 3024 : 676 - 688
  • [43] Lapped transform based video coding
    Tran, TD
    Tu, C
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XXIV, 2001, 4472 : 319 - 333
  • [44] ADAPTIVE INTRA PERIOD SIZE FOR DEEP LEARNING-BASED SCREEN CONTENT VIDEO CODING
    Wu, Yuyang
    Xie, Liang
    Sun, Shangkun
    Gao, Wei
    Yan, Yiqiang
    2024 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO WORKSHOPS, ICMEW 2024, 2024,
  • [45] Light Field Image Compression Based on Preprocessing and High Efficiency Coding
    Perra, Cristian
    2016 24TH TELECOMMUNICATIONS FORUM (TELFOR), 2016, : 917 - 920
  • [46] DEEP VIRTUAL REFERENCE FRAME GENERATION FOR MULTIVIEW VIDEO CODING
    Lei, Jianjun
    Zhang, Zongqian
    Liu, Dong
    Chen, Ying
    Ling, Nam
    2020 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2020, : 1123 - 1127
  • [47] DEEP INCREMENTAL OPTICAL FLOW CODING FOR LEARNED VIDEO COMPRESSION
    Chang, Chih-Peng
    Chen, Peng-Yu
    Ho, Yung-Han
    Peng, Wen-Hsiao
    2022 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP, 2022, : 3988 - 3992
  • [48] Hierarchical Random Access Coding for Deep Neural Video Compression
    Thang, Nguyen Van
    Bang, Le Van
    IEEE ACCESS, 2023, 11 : 57494 - 57502
  • [49] Dual Learning-based Video Coding with Inception Dense Blocks
    Liu, Chao
    Sun, Heming
    Chen, Jun'an
    Cheng, Zhengxue
    Takeuchi, Masaru
    Katto, Jiro
    Zeng, Xiaoyang
    Fan, Yibo
    2019 PICTURE CODING SYMPOSIUM (PCS), 2019,
  • [50] A study on secure coding of intelligent inspection video in plant areas based on improved deep reinforcement learning
    Yongmin Yang
    Zhenhao Wang
    Discover Applied Sciences, 7 (1)