Autoencoder Model Exploration for Multi-Layer Video Compression

被引：1

作者：

Cancellier, Luiz Henrique ^{[1
]}

Grellert, Mateus ^{[1
]}

Guntzel, Jose Luis ^{[1
]}

da Silva Cruz, Luis A. ^{[2
,3
]}

机构：

[1] Fed Univ Santa Catarina UFSC, INE, PPGCC, Florianopolis, SC, Brazil

[2] Univ Coimbra, Dept Elect & Comp Engn, Coimbra, Portugal

[3] Inst Telecomunicacoes, Coimbra, Portugal

来源：

2022 10TH EUROPEAN WORKSHOP ON VISUAL INFORMATION PROCESSING (EUVIP) | 2022年

关键词：

Video Compression; Layered Video Coding; Neural Networks; Autoencoder; Intra;

D O I：

10.1109/EUVIP53989.2022.9922780

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of autoencoder models for image and video compression have been explored by a number of works published in recent years. While those works perform the original data compression in a single layer, in this work we propose the use of autoencoder models in two-layered video coding. The adoption of multi-layer encoder provides scalability and allows us for decoupling the traditional video coding implementation from the NN solutions. By restricting the use of the Neural Network (NN) solution in the enhancement layer, it becomes possible to decode the base layer bitstream without the necessity of running the decoding process with the NN. We implemented and evaluated two autoencoder models: one using a symmetric encoder/decoder architecture, and an asymmetric alternative that employs more layers on the decoder side. The models were trained to compress residues for a scenario using All Intra encoding with spatial scalability. The Asymmetric model outperformed the Symmetric one by providing better compression rates and quality results, which is confirmed by the respective BD-Rate and BD-PSNR average results of -17.06% and 0.7dB, respectively.

引用

页数：6

共 16 条

[1] Ascenso J., 2019, 1SC29WG1 ISOIEC JTC
[2] Balle Johannes, 2016, arXiv
[3] Bjontegaard G., 2001, P 13 VCEG M AUST TX
[4] Bossen F., 2013, JCTVCL1100 ITUT ISO
[5] Overview of SHVC: Scalable Extensions of the High Efficiency Video Coding Standard
Boyce, Jill M.
Ye, Yan
Chen, Jianle
Ramasubramonian, Adarsh K.
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2016, 26 (01) : 20 - 34
[6] Interlayer Restoration Deep Neural Network for Scalable High Efficiency Video Coding
He, Gang
Xu, Li
Lei, Jie
Xie, Weiying
Li, Yunsong
Fan, Yibo
Zhou, Jinjia
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (05) : 3217 - 3234
[7] Deep Inter Prediction with Error-Corrected Auto-Regressive Network for Video Coding
Hu, Yuzhang
Yang, Wenhan
Liu, Jiaying
Guo, Zongming
[J]. ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 19 (01)
[8] ISO Central Secretary, 2020, 230903 ISOIEC DIS
[9] CUBIC CONVOLUTION INTERPOLATION FOR DIGITAL IMAGE-PROCESSING
KEYS, RG
[J]. IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1981, 29 (06): : 1153 - 1160
[10] DVC: An End-to-end Deep Video Compression Framework
Lu, Guo
Ouyang, Wanli
Xu, Dong
Zhang, Xiaoyun
Cai, Chunlei
Gao, Zhiyong
[J]. 2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2019), 2019, : 10998 - 11007

← 1 2 →