A region-based hierarchical image compression method with simulated visual perception

被引：0

作者：

Wang, Jianxu ^{[1
]}

Zeng, Li ^{[1
,2
]}

机构：

[1] Chongqing Univ, Coll Math & Stat, Chongqing 401331, Peoples R China

[2] Chongqing Univ, Engn Res Ctr Ind Computed Tomog, Nondestruct Testing Educ Minist China, Chongqing 400044, Peoples R China

来源：

DIGITAL SIGNAL PROCESSING | 2024年 / 145卷

基金：

中国国家自然科学基金;

关键词：

Image compression; Deep learning; Visual perception; Modified class activation mapping (CAM); Perceptual metric;

D O I：

10.1016/j.dsp.2023.104339

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Most learned image compression methods focus on reducing the average code length of the image, which is unreasonable for the human visual system. Thus, appropriate bit allocation gains significance. This paper presents a novel end-to-end method for perceptual image compression, emphasizing content prioritization and a region-based hierarchical strategy. Firstly, our modified class activation mapping (CAM) can serve as the visual perception network for simulating human visual perception. On this basis, we develop an efficient compression system that leverages a discretized hybrid entropy model to allocate bits automatically according to the perceptual prioritization of image content. In our compression system, an image is compressed jointly and hierarchically with different standards, guided by the importance map extracted by the visual perception network. We introduce content-weighted perceptual metrics to evaluate visual quality more reasonably and objectively. Experiments on publicly available datasets demonstrate that our method outperforms traditional codecs and recent content-oriented learned methods in overall performance. Compared with other methods, the proposed method reconstructs images that are more friendly to the human visual system.

引用

页数：11

共 37 条

[11]

Duda Jarek, 2013, arXiv

[12]

Forrester JV., 2016, The Eye, VFourth, P269, DOI [DOI 10.1016/B978-0-7020-5554-6.00005-8, 10.1016/B978-0-7020-5554-6.00005-8]

[13] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[14]

Kingma DP, 2014, ADV NEUR IN, V27

[15] SCAttNet: Semantic Segmentation Network With Spatial and Channel Attention Mechanism for High-Resolution Remote Sensing Images [J].

Li, Haifeng ;

Qiu, Kaijian ;

Chen, Li ;

Mei, Xiaoming ;

Hong, Liang ;

Tao, Chao .

IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2021, 18 (05) :905-909

[16] Content-Oriented Learned Image Compression [J].

Li, Meng ;

Gao, Shangyin ;

Feng, Yihui ;

Shi, Yibo ;

Wang, Jing .

COMPUTER VISION, ECCV 2022, PT XIX, 2022, 13679 :632-647

[17] Video Rain Streak Removal By Multiscale Convolutional Sparse Coding [J].

Li, Minghan ;

Xie, Qi ;

Zhao, Qian ;

Wei, Wei ;

Gu, Shuhang ;

Tao, Jing ;

Meng, Deyu .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :6644-6653

[18] Efficient and Effective Context-Based Convolutional Entropy Modeling for Image Compression [J].

Li, Mu ;

Ma, Kede ;

You, Jane ;

Zhang, David ;

Zuo, Wangmeng .

IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 :5900-5911

[19] Segmenting Objects From Relational Visual Data [J].

Lu, Xiankai ;

Wang, Wenguan ;

Shen, Jianbing ;

Crandall, David J. ;

Van Gool, Luc .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (11) :7885-7897

[20] End-to-End Optimized Versatile Image Compression With Wavelet-Like Transform [J].

Ma, Haichuan ;

Liu, Dong ;

Yan, Ning ;

Li, Houqiang ;

Wu, Feng .

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 44 (03) :1247-1263

← 1 2 3 4 →