Efficient Variable Rate Image Compression With Multi-Scale Decomposition Network

被引:49
作者
Cai, Chunlei [1 ]
Chen, Li [1 ]
Zhang, Xiaoyun [1 ]
Gao, Zhiyong [1 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Elect Engn, Inst Image Commun & Network Engn, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金; 上海市自然科学基金;
关键词
Image coding; Resource management; Transforms; Transform coding; Codecs; Laplace equations; Standards; Lossy image compression; multi-scale decomposition transform; content adaptive rate allocation; variable rate image compression; convolutional neural network;
D O I
10.1109/TCSVT.2018.2880492
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While deep learning image compression methods have shown an impressive coding performance, most of them output a single-optimized-compression rate using a trained-specific network. However, in practice, it is essential to support the variable rate compression or meet a target rate with a high-coding performance. This paper proposes a novel image compression method, making it possible for a single convolutional neural network (CNN) model to generate the variable rate efficiently with an optimized rate-distortion (RD) performance. The method consists of CNN-based multi-scale decomposition transform and content adaptive rate allocation. Specifically, the transform network is learned to decompose the input image into several scales of representations while optimizing the RD performance for all scales. Rate allocation algorithms for two typical scenarios are provided to determine the optimal scale of each image block for a given target rate or quality factor. For a target rate, the allocation is adaptive based on content complexity. In addition, for a target quality factor which indicates a tradeoff between the rate and the quality, the optimal scale is determined by minimizing the RD cost. The experimental results have shown that our method has outperformed the JPEG2000 and BPG standards with high efficiency and the state-of-the-art RD performance as measured by the multi-scale structural similarity index metric. Moreover, our method can strictly control the rate to generate the target compression result.
引用
收藏
页码:3687 / 3700
页数:14
相关论文
共 37 条
[1]  
Abadi M., 2015, TensorFlow: Large-scale machine learning on heterogeneous systems
[2]  
Agustsson E, 2017, ADV NEUR IN, V30
[3]  
[Anonymous], 2018, CVPR WORKSH
[4]  
[Anonymous], 2016, P INT C LEARN REPR
[5]  
[Anonymous], 2017, LEARNING CONVOLUTION
[6]  
[Anonymous], 2018, P EUR C COMP VIS ECC
[7]  
[Anonymous], EFFICIENT TRIMMED CO
[8]  
[Anonymous], 2016, P INT C LEARN REPR
[9]  
[Anonymous], 2016, P INT C LEARN REPR
[10]  
[Anonymous], CALCULATION AVERAGE