Autoencoder Model Exploration for Multi-Layer Video Compression

被引:1
作者
Cancellier, Luiz Henrique [1 ]
Grellert, Mateus [1 ]
Guntzel, Jose Luis [1 ]
da Silva Cruz, Luis A. [2 ,3 ]
机构
[1] Fed Univ Santa Catarina UFSC, INE, PPGCC, Florianopolis, SC, Brazil
[2] Univ Coimbra, Dept Elect & Comp Engn, Coimbra, Portugal
[3] Inst Telecomunicacoes, Coimbra, Portugal
来源
2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年
关键词
Video Compression; Layered Video Coding; Neural Networks; Autoencoder; Intra;
D O I
10.1109/EUVIP53989.2022.9922780
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of autoencoder models for image and video compression have been explored by a number of works published in recent years. While those works perform the original data compression in a single layer, in this work we propose the use of autoencoder models in two-layered video coding. The adoption of multi-layer encoder provides scalability and allows us for decoupling the traditional video coding implementation from the NN solutions. By restricting the use of the Neural Network (NN) solution in the enhancement layer, it becomes possible to decode the base layer bitstream without the necessity of running the decoding process with the NN. We implemented and evaluated two autoencoder models: one using a symmetric encoder/decoder architecture, and an asymmetric alternative that employs more layers on the decoder side. The models were trained to compress residues for a scenario using All Intra encoding with spatial scalability. The Asymmetric model outperformed the Symmetric one by providing better compression rates and quality results, which is confirmed by the respective BD-Rate and BD-PSNR average results of -17.06% and 0.7dB, respectively.
引用
收藏
页数:6
相关论文
共 16 条
  • [1] Ascenso J., 2019, 1SC29WG1 ISOIEC JTC
  • [2] Balle Johannes, 2016, arXiv
  • [3] Bjontegaard G., 2001, P 13 VCEG M AUST TX
  • [4] Bossen F., 2013, JCTVCL1100 ITUT ISO
  • [5] Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding Standard
    Boyce, Jill M.
    Ye, Yan
    Chen, Jianle
    Ramasubramonian, Adarsh K.
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) : 20 - 34
  • [6] Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding
    He, Gang
    Xu, Li
    Lei, Jie
    Xie, Weiying
    Li, Yunsong
    Fan, Yibo
    Zhou, Jinjia
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3217 - 3234
  • [7] Deep Inter Prediction with Error-Corrected Auto-Regressive Network for Video Coding
    Hu, Yuzhang
    Yang, Wenhan
    Liu, Jiaying
    Guo, Zongming
    [J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
  • [8] ISO Central Secretary, 2020, 230903 ISOIEC DIS
  • [9] CUBIC CONVOLUTION INTERPOLATION FOR DIGITAL IMAGE-PROCESSING
    KEYS, RG
    [J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (06): : 1153 - 1160
  • [10] DVC: An End-to-end Deep Video Compression Framework
    Lu, Guo
    Ouyang, Wanli
    Xu, Dong
    Zhang, Xiaoyun
    Cai, Chunlei
    Gao, Zhiyong
    [J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10998 - 11007