GOP-based Deep Preprocessing for Video Coding

被引:1
|
作者
Arai, Daichi [1 ]
Iwamura, Shunsuke [1 ]
Iguchi, Kazuhisa [1 ]
Ichigaya, Atsuro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, Tokyo, Japan
来源
2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年
关键词
Video coding; preprocessing; neural network; learned video compression; group of pictures;
D O I
10.1109/PCS60826.2024.10566387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based video preprocessing techniques have recently shown remarkable improvements in video codec performance. However, conventional preprocessing methods tend to prioritize perceptual quality over peak signal-to-noise ratio (PSNR), a key standard for video quality assessment. In this study, We propose a novel deep preprocessing method based on a group of pictures (GOP) structure, specifically aimed at enhancing the rate-distortion performance in terms of PSNR. This approach involves developing a video compression model that employs the GOP structure of the target video codec and training a preprocessing model through joint optimization with the video compression model. Experimental results demonstrate that our GOP-based deep preprocessing method not only improves PSNR but also elevates other quality metrics, including VMAF, across various codecs like MPEG-2, HEVC, and VVC. Additionally, ablation studies highlight the critical role of GOP structures in enhancing encoding efficiency based on PSNR.
引用
收藏
页数:5
相关论文
共 50 条
  • [21] Learning-Based Video Coding with Joint Deep Compression and Enhancement
    Zhao, Tiesong
    Feng, Weize
    Zeng, Hongji
    Xu, Yiwen
    Niu, Yuzhen
    Liu, Jiaying
    PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2022, 2022, : 3045 - 3054
  • [22] HDVC: Deep Video Compression With Hyperprior-Based Entropy Coding
    Hu, Yusong
    Jung, Cheolkon
    Qin, Qipu
    Han, Jiang
    Liu, Yang
    Li, Ming
    IEEE ACCESS, 2024, 12 : 17541 - 17551
  • [23] Deep Neural Network Based Frame Reconstruction for Optimized Video Coding
    Ding, Dandan
    Liu, Peng
    Chen, Yu
    Zhu, Zheng
    Liu, Zoe
    Bankoski, James
    ARTIFICIAL INTELLIGENCE AND MOBILE SERVICES - AIMS 2018, 2018, 10970 : 235 - 242
  • [24] A GOP-Level Fuzzy Logic Rate Controller for High Efficiency Video Coding Standard
    Fani, Davoud
    Rezaei, Mehdi
    2015 4th Iranian Joint Congress on Fuzzy and Intelligent Systems (CFIS), 2015,
  • [25] PSNR control for GOP-level constant quality in H.264 video coding
    De Vito, F
    De Martin, JC
    2005 IEEE INTERNATIONAL SYMPOSIUM ON SIGNAL PROCESSING AND INFORMATION TECHNOLOGY (ISSPIT), VOLS 1 AND 2, 2005, : 612 - 617
  • [26] Deep Learning-Based Luma and Chroma Fractional Interpolation in Video Coding
    Pham, Chi Do-Kim
    Zhou, Jinjia
    IEEE ACCESS, 2019, 7 : 112535 - 112543
  • [27] CodingHomo: Bootstrapping Deep Homography With Video Coding
    Liu, Yike
    Li, Haipeng
    Liu, Shuaicheng
    Zeng, Bing
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 34 (11) : 11214 - 11228
  • [28] Deep Learning-Based Chroma Prediction for Intra Versatile Video Coding
    Zhu, Linwei
    Zhang, Yun
    Wang, Shiqi
    Kwong, Sam
    Jin, Xin
    Qiao, Yu
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3168 - 3181
  • [29] Deep region segmentation-based intra prediction for depth video coding
    Zhang, Jing
    Hou, Yonghong
    Zhang, Zhe
    Jin, Dengchao
    Zhang, Peihan
    Li, Ge
    MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (25) : 35953 - 35964
  • [30] A Novel Method of Adaptive GOP Structure Based on the Positions of Video Cuts
    Krulikovska, Lenka
    53RD INTERNATIONAL SYMPOSIUM ELMAR-2011, 2011, : 67 - 70