A region-based hierarchical image compression method with simulated visual perception

被引:0
作者
Wang, Jianxu [1 ]
Zeng, Li [1 ,2 ]
机构
[1] Chongqing Univ, Coll Math & Stat, Chongqing 401331, Peoples R China
[2] Chongqing Univ, Engn Res Ctr Ind Computed Tomog, Nondestruct Testing Educ Minist China, Chongqing 400044, Peoples R China
基金
中国国家自然科学基金;
关键词
Image compression; Deep learning; Visual perception; Modified class activation mapping (CAM); Perceptual metric;
D O I
10.1016/j.dsp.2023.104339
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Most learned image compression methods focus on reducing the average code length of the image, which is unreasonable for the human visual system. Thus, appropriate bit allocation gains significance. This paper presents a novel end-to-end method for perceptual image compression, emphasizing content prioritization and a region-based hierarchical strategy. Firstly, our modified class activation mapping (CAM) can serve as the visual perception network for simulating human visual perception. On this basis, we develop an efficient compression system that leverages a discretized hybrid entropy model to allocate bits automatically according to the perceptual prioritization of image content. In our compression system, an image is compressed jointly and hierarchically with different standards, guided by the importance map extracted by the visual perception network. We introduce content-weighted perceptual metrics to evaluate visual quality more reasonably and objectively. Experiments on publicly available datasets demonstrate that our method outperforms traditional codecs and recent content-oriented learned methods in overall performance. Compared with other methods, the proposed method reconstructs images that are more friendly to the human visual system.
引用
收藏
页数:11
相关论文
共 37 条
[1]   NTIRE 2017 Challenge on Single Image Super-Resolution: Dataset and Study [J].
Agustsson, Eirikur ;
Timofte, Radu .
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, :1122-1131
[2]   Deep dilated and densely connected parallel convolutional groups for compression artifacts reduction [J].
Amaranageswarao, Gadipudi ;
Deivalakshmi, S. ;
Ko, Seok-Bum .
DIGITAL SIGNAL PROCESSING, 2020, 106
[3]  
Ball‚ J, 2017, Arxiv, DOI arXiv:1611.01704
[4]  
Ball‚ J, 2018, Arxiv, DOI arXiv:1802.01436
[5]  
Begaint Jean, 2020, arXiv
[6]  
Bellard F., 2023, BPG image format
[7]  
Bellard F., 2023, Kodak lossless true color image suite
[8]  
Cheng ZX, 2020, INT CONF ACOUST SPEE, P2158, DOI [10.1109/icassp40776.2020.9053413, 10.1109/ICASSP40776.2020.9053413]
[9]   A Topological Loss Function for Deep-Learning Based Image Segmentation Using Persistent Homology [J].
Clough, James R. ;
Byrne, Nicholas ;
Oksuz, Ilkay ;
Zimmer, Veronika A. ;
Schnabel, Julia A. ;
King, Andrew P. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (12) :8766-8778
[10]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848