GOP-based Deep Preprocessing for Video Coding

被引:1
|
作者
Arai, Daichi [1 ]
Iwamura, Shunsuke [1 ]
Iguchi, Kazuhisa [1 ]
Ichigaya, Atsuro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, Tokyo, Japan
来源
2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年
关键词
Video coding; preprocessing; neural network; learned video compression; group of pictures;
D O I
10.1109/PCS60826.2024.10566387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based video preprocessing techniques have recently shown remarkable improvements in video codec performance. However, conventional preprocessing methods tend to prioritize perceptual quality over peak signal-to-noise ratio (PSNR), a key standard for video quality assessment. In this study, We propose a novel deep preprocessing method based on a group of pictures (GOP) structure, specifically aimed at enhancing the rate-distortion performance in terms of PSNR. This approach involves developing a video compression model that employs the GOP structure of the target video codec and training a preprocessing model through joint optimization with the video compression model. Experimental results demonstrate that our GOP-based deep preprocessing method not only improves PSNR but also elevates other quality metrics, including VMAF, across various codecs like MPEG-2, HEVC, and VVC. Additionally, ablation studies highlight the critical role of GOP structures in enhancing encoding efficiency based on PSNR.
引用
收藏
页数:5
相关论文
共 50 条
  • [11] Study on deep CNN as preprocessing for video compression
    Bhosale, Kavita Arjun
    Kuk, Seungho
    Park, Sang-hyo
    APPLICATIONS OF DIGITAL IMAGE PROCESSING XLIV, 2021, 11842
  • [12] ADAPTIVE GOP SIZE DECISION FOR MULTI-PASS VIDEO CODING BASED ON HIDDEN MARKOV MODEL
    Li, Bohan
    Han, Jingning
    Xu, Yaowu
    2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 1575 - 1579
  • [13] Distributed video coding supporting hierarchical GOP structures with transmitted motion vectors
    Kyung-Yeon Min
    Woong Lim
    Junghak Nam
    Donggyu Sim
    Ivan V Bajić
    EURASIP Journal on Image and Video Processing, 2015
  • [14] Distributed video coding supporting hierarchical GOP structures with transmitted motion vectors
    Min, Kyung-Yeon
    Lim, Woong
    Nam, Junghak
    Sim, Donggyu
    Bajic, Ivan V.
    EURASIP JOURNAL ON IMAGE AND VIDEO PROCESSING, 2015,
  • [15] Depth Video Inter Coding Based on Deep Frame Generation
    Li, Ge
    Lei, Jianjun
    Pan, Zhaoqing
    Peng, Bo
    Ling, Nam
    IEEE TRANSACTIONS ON BROADCASTING, 2024, 70 (02) : 708 - 718
  • [16] New adaptive filters as perceptual preprocessing for rate-quality performance optimization of video coding
    Vidal, Eloise
    Sturmel, Nicolas
    Guillemot, Christine
    Corlay, Patrick
    Coudoux, Francois-Xavier
    SIGNAL PROCESSING-IMAGE COMMUNICATION, 2017, 52 : 124 - 137
  • [17] Deep learning-based video quality enhancement for the new versatile video coding
    Bouaafia, Soulef
    Khemiri, Randa
    Messaoud, Seifeddine
    Ben Ahmed, Olfa
    Sayadi, Fatma Ezahra
    NEURAL COMPUTING & APPLICATIONS, 2022, 34 (17) : 14135 - 14149
  • [18] Deep learning-based video quality enhancement for the new versatile video coding
    Soulef Bouaafia
    Randa Khemiri
    Seifeddine Messaoud
    Olfa Ben Ahmed
    Fatma Ezahra Sayadi
    Neural Computing and Applications, 2022, 34 : 14135 - 14149
  • [19] Introducing GOP-Level Quantization Parameter Offset In High Efficiency Video Coding
    Xu, L.
    Zhu, C.
    Zhou, Y.
    Wang, Y.
    Gao, Y.
    2016 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING (BMSB), 2016,
  • [20] Motion-compensated residue preprocessing in video coding based on just-noticeable-distortion profile
    Yang, XK
    Lin, WS
    Lu, ZK
    Ong, EP
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (06) : 742 - 752