Asymmetric Multi-Layer Compression: Decoupling Inter-Layer Coding Dependencies With a Learned Model

被引:0
作者
Cancellier, Luiz Henrique [1 ]
Brascher, Andre Beims [1 ]
Seidel, Ismael [1 ]
Grellert, Mateus [2 ]
Cruz, Luis A. Da Silva [3 ,4 ]
Guntzel, Jose Luis [1 ]
机构
[1] Fed Univ Santa Catarina UFSC, Dept Informat & Stat INE PPGCC, BR-88040900 Florianopolis, Brazil
[2] Fed Univ Rio Grande do Sul UFRGS, Inst Informat, BR-90010150 Porto Alegre, Brazil
[3] Univ Coimbra, Dept Elect & Comp Engn, P-3004531 Coimbra, Portugal
[4] Inst Telecomunicacoes, P-3030290 Coimbra, Portugal
关键词
image compression; learned compression; multi-layer compression; scalability; End-to-end optimization; ENHANCEMENT;
D O I
10.1109/ACCESS.2025.3542979
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Multi-layer compression is employed to achieve scalability by providing a base bitstream and additional enhancement layers to be used when extra bandwidth is available. In this work, we combine handcrafted and learned solutions, isolating the former in the base layer and using the latter in the enhancement layer. Moreover, the proposed Asymmetric Multi-layer Compression (AMLC) model decouples the base and enhancement layers at the encoding side, leveraging the end-to-end enhancement layer model to simplify the coding process. AMLC encodes an image 1.28x faster than a fully learned multi-layer codec while maintaining similar quality and coding efficiency despite the lower quality of the handcrafted base layer used in AMLC. While the proposed asymmetric encoder together with the handcrafted base layer reduce the overall complexity, the learned enhancement layer brings benefits in terms of coding efficiency. AMLC outperforms the coding efficiency of the Scalable HEVC (SHVC) reference software, a handcrafted multi-layer codec, achieving BD-Rate gains of-27.86%,-34.19%, and-26.57% for PSNR-YUV, PSNR-RGB, and MS-SSIM quality metrics, respectively. Finally, an equivalent implementation of the proposed model adding the inter-layer dependency on the encoding side produces results that are similar to the ones from AMLC's, confirming its potential to simplify multi-layer compression without significant losses in coding efficiency.
引用
收藏
页码:31671 / 31682
页数:12
相关论文
共 35 条
[1]   Scale-space flow for end-to-end optimized video compression [J].
Agustsson, Eirikur ;
Minnen, David ;
Johnston, Nick ;
Balle, Johannes ;
Hwang, Sung Jin ;
Toderici, George .
2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR 2020), 2020, :8500-8509
[2]  
[Anonymous], 2020, Multimedia Tools Appl., V79, P11699, DOI [10.1007/s11042-019-08572-3, DOI 10.1007/S11042-019-08572-3]
[3]  
Balle J., 2018, INT C LEARNING REPRE
[4]  
Ball‚ J, 2017, Arxiv, DOI arXiv:1611.01704
[5]   End-to-end optimization of nonlinear transform codes for perceptual quality [J].
Balle, Johannes ;
Laparra, Valero ;
Simoncelli, Eero P. .
2016 PICTURE CODING SYMPOSIUM (PCS), 2016,
[6]   Overview of the Low Complexity Enhancement Video Coding (LCEVC) Standard [J].
Battista, Stefano ;
Meardi, Guido ;
Ferrara, Simone ;
Ciccarelli, Lorenzo ;
Maurer, Florian ;
Conti, Massimo ;
Orcioni, Simone .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 32 (11) :7983-7995
[7]  
Begaint Jean., 2020, arXiv
[8]  
Benjak M., 2023, P IEEE INT C VIS COM, P1
[9]  
Bjontegaard G., 2001, VCEG-M33
[10]   CAESR: Conditional Autoencoder and Super-Resolution for Learned Spatial Scalability [J].
Bonnineau, Charles ;
Hamidouche, Wassim ;
Travers, Jean-Francois ;
Sidaty, Naty ;
Aubie, Jean-Yves ;
Deforges, Olivier .
2021 INTERNATIONAL CONFERENCE ON VISUAL COMMUNICATIONS AND IMAGE PROCESSING (VCIP), 2021,