GOP-based Deep Preprocessing for Video Coding

被引:1
作者
Arai, Daichi [1 ]
Iwamura, Shunsuke [1 ]
Iguchi, Kazuhisa [1 ]
Ichigaya, Atsuro [1 ]
机构
[1] NHK Japan Broadcasting Corp, Sci & Technol Res Labs, Tokyo, Japan
来源
2024 PICTURE CODING SYMPOSIUM, PCS 2024 | 2024年
关键词
Video coding; preprocessing; neural network; learned video compression; group of pictures;
D O I
10.1109/PCS60826.2024.10566387
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network-based video preprocessing techniques have recently shown remarkable improvements in video codec performance. However, conventional preprocessing methods tend to prioritize perceptual quality over peak signal-to-noise ratio (PSNR), a key standard for video quality assessment. In this study, We propose a novel deep preprocessing method based on a group of pictures (GOP) structure, specifically aimed at enhancing the rate-distortion performance in terms of PSNR. This approach involves developing a video compression model that employs the GOP structure of the target video codec and training a preprocessing model through joint optimization with the video compression model. Experimental results demonstrate that our GOP-based deep preprocessing method not only improves PSNR but also elevates other quality metrics, including VMAF, across various codecs like MPEG-2, HEVC, and VVC. Additionally, ablation studies highlight the critical role of GOP structures in enhancing encoding efficiency based on PSNR.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] PERCEPTUAL QUALITY BASED PACKET DROPPING FOR GENERALIZED VIDEO GOP STRUCTURES
    Lin, Ting-Lan
    Zhi, Yuan
    Kanumuri, Sandeep
    Cosman, Pamela C.
    Reibman, Amy R.
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 781 - 784
  • [32] A preprocessing approach to ROI video coding using variable Gaussian filters and variance in intensity
    Karlsson, LS
    Sjöström, M
    PROCEEDINGS ELMAR-2005, 2005, : 65 - 68
  • [33] An Optimized Video Capture and Preprocessing Strategy Based on FPGA
    Jiang, Hong Xu
    Li, Hui Yong
    Zhang, Ping
    Zhai, Dong Lin
    COMPUTER AND INFORMATION TECHNOLOGY, 2014, 519-520 : 708 - 713
  • [34] Multiple Resolution Prediction With Deep Up-Sampling for Depth Video Coding
    Li, Ge
    Lei, Jianjun
    Pan, Zhaoqing
    Peng, Bo
    Ling, Nam
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (09) : 6337 - 6346
  • [35] RETRACTED ARTICLE: An Improvised video coding algorithm for deep learning-based video transmission using HEVC
    A. Srinivasan
    G. Rohini
    Soft Computing, 2019, 23 : 8503 - 8514
  • [36] Deep Reference Frame Interpolation based Inter Prediction Enhancement for Versatile Video Coding
    Jia, Jianghao
    Liu, Zizheng
    Xu, Xiaozhong
    Liu, Shan
    Chen, Zhenzhong
    2022 IEEE INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2022,
  • [37] Motion-Aware Deep Video Coding Network
    Khan, Rida
    Liu, Ying
    BIG DATA II: LEARNING, ANALYTICS, AND APPLICATIONS, 2020, 11395
  • [38] A LIGHTWEIGHT MODEL FOR DEEP FRAME PREDICTION IN VIDEO CODING
    Choi, Hyomin
    Bajic, Ivan, V
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 1122 - 1126
  • [39] Error resilient GOP structures on video streaming
    Yang, Kai-Chao
    Huang, Chun-Ming
    Wang, Jia-Shung
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2007, 18 (02) : 151 - 161
  • [40] Multiview-Video-Plus-Depth Coding Based on the Advanced Video Coding Standard
    Hannuksela, Miska M.
    Rusanovskyy, Dmytro
    Su, Wenyi
    Chen, Lulu
    Li, Ri
    Aflaki, Payman
    Lan, Deyan
    Joachimiak, Michal
    Li, Houqiang
    Gabbouj, Moncef
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 22 (09) : 3449 - 3458