GOP-based Deep Preprocessing for Video Coding

被引:1
|
作者
Arai, Daichi [1 ]
Iwamura, Shunsuke [1 ]
Iguchi, Kazuhisa [1 ]
Ichigaya, Atsuro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, Tokyo, Japan
来源
2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年
关键词
Video coding; preprocessing; neural network; learned video compression; group of pictures;
D O I
10.1109/PCS60826.2024.10566387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based video preprocessing techniques have recently shown remarkable improvements in video codec performance. However, conventional preprocessing methods tend to prioritize perceptual quality over peak signal-to-noise ratio (PSNR), a key standard for video quality assessment. In this study, We propose a novel deep preprocessing method based on a group of pictures (GOP) structure, specifically aimed at enhancing the rate-distortion performance in terms of PSNR. This approach involves developing a video compression model that employs the GOP structure of the target video codec and training a preprocessing model through joint optimization with the video compression model. Experimental results demonstrate that our GOP-based deep preprocessing method not only improves PSNR but also elevates other quality metrics, including VMAF, across various codecs like MPEG-2, HEVC, and VVC. Additionally, ablation studies highlight the critical role of GOP structures in enhancing encoding efficiency based on PSNR.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] GOP-based unequal error protection for scalable video over packet erasure channel
    Wang, Yu
    Chau, Lap-Pui
    Yap, Kim-Hui
    2008 IEEE INTERNATIONAL SYMPOSIUM ON BROADBAND MULTIMEDIA SYSTEMS AND BROADCASTING, 2008, : 23 - 26
  • [2] GOP-based channel rate allocation using genetic algorithm for scalable video streaming over error-prone networks
    Fang, Tao
    Chau, Lap-Pui
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2006, 15 (06) : 1323 - 1330
  • [3] Rate-GOP Based Rate Control for High Efficiency Video Coding
    Wang, Shanshe
    Ma, Siwei
    Wang, Shiqi
    Zhao, Debin
    Gao, Wen
    IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2013, 7 (06) : 1101 - 1111
  • [4] Pipelines for HDR Video Coding Based on Luminance Independent Chromaticity Preprocessing
    Mahmalat, Samir
    Aydin, Tunc Ozan
    Smolic, Aljosa
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (12) : 3467 - 3477
  • [5] Just-Noticeable-Quantization-Distortion Based Preprocessing for Perceptual Video Coding
    Ki, Sehwan
    Kim, Munchurl
    Ko, Hyunsuk
    2017 IEEE VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2017,
  • [6] Shared Memory Tile-Based vs Hybrid Memory GOP-Based Parallel Algorithms for HEVC Encoder
    Migallon, Hector
    Lopez-Granado, Otoniel
    Galiano, Vicente
    Pinol, Pablo
    Malumbres, Manuel P.
    ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2016, 2016, 10048 : 521 - 528
  • [7] Deep Learning-Based Video Coding: A Review and a Case Study
    Liu, Dong
    Li, Yue
    Lin, Jianping
    Li, Houqiang
    Wu, Feng
    ACM COMPUTING SURVEYS, 2020, 53 (01)
  • [8] New method for reducing GOP-boundary artifacts in wavelet-based video coding
    Wang, Demin
    Zhang, Liang
    Vincent, Andre
    IEEE TRANSACTIONS ON BROADCASTING, 2006, 52 (03) : 350 - 355
  • [9] GOP Structure-Independent Quantization Parameter Cascading in Video Coding
    Xu, Yiwen
    Yi, Shiqi
    Lin, Liqun
    Chen, Weiling
    Zhao, Tiesong
    IEEE ACCESS, 2019, 7 : 76274 - 76282
  • [10] LUMINANCE INDEPENDENT CHROMATICITY PREPROCESSING FOR HDR VIDEO CODING
    Mahmalat, Samir
    Stefanoski, Nikolce
    Luginbuehl, Daniel
    Aydin, Tunc Ozan
    Smolic, Aljosa
    2016 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2016, : 1389 - 1393