Image Compression with Recurrent Neural Network and Generalized Divisive Normalization

被引：25

作者：

Islam, Khawar ^{[1
]}

Dang, L. Minh ^{[1
]}

Lee, Sujin ^{[2
]}

Moon, Hyeonjoon ^{[1
,2
]}

机构：

[1] Sejong Univ, Comp Vis & Pattern Recognit Lab, Seoul, South Korea

[2] Sejong Univ, Dept Artificial Intelligence, Seoul, South Korea

来源：

2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS, CVPRW 2021 | 2021年

基金：

新加坡国家研究基金会;

关键词：

D O I：

10.1109/CVPRW53098.2021.00209

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Image compression is a method to remove spatial redundancy between adjacent pixels and reconstruct a high-quality image. In the past few years, deep learning has gained huge attention from the research community and produced promising image reconstruction results. Therefore, recent methods focused on developing deeper and more complex networks, which significantly increased network complexity. In this paper, two effective novel blocks are developed: analysis and synthesis block that employs the convolution layer and Generalized Divisive Normalization (GDN) in the variablerate encoder and decoder side. Our network utilizes a pixel RNN approach for quantization. Furthermore, to improve the whole network, we encode a residual image using LSTM cells to reduce unnecessary information. Experimental results demonstrated that the proposed variable-rate framework with novel blocks outperforms existing methods and standard image codecs, such as George's [11] and JPEG in terms of image similarity. The project page along with code and models are available at https://github.com/khawar512/cvpr image compress

引用

页码：1875 / 1879

页数：5

共 12 条

[1]

[Anonymous], 2015, INT C LEARN REPR

[2]

Balle J., 2015, ARXIV151106281

[3] End-to-end optimization of nonlinear transform codes for perceptual quality [J].

Balle, Johannes ;

Laparra, Valero ;

Simoncelli, Eero P. .

2016 PICTURE CODING SYMPOSIUM (PCS), 2016,

[4]

Balle Johannes, 2018, arXiv preprint arXiv:1802.01436

[5]

Covell Michele, 2017, ARXIV170506687

[6] Improved Lossy Image Compression with Priming and Spatially Adaptive Bit Rates for Recurrent Networks [J].

Johnston, Nick ;

Vincent, Damien ;

Minnen, David ;

Covell, Michele ;

Singh, Saurabh ;

Chinen, Troy ;

Hwang, Sung Jin ;

Shor, Joel ;

Toderici, George .

2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :4385-4393

[7]

Lee J., 2018, P INT C LEARN REPR I

[8]

Lin Chaoyi, 2020, P IEEECVF C COMPUTER, P13269

[9]

Minnen D, 2018, ADV NEUR IN, V31

[10]

Oord A. v. d., 2016, ARXIV160106759

← 1 2 →