UNIFIED SIGNAL COMPRESSION USING GENERATIVE ADVERSARIAL NETWORKS

被引:0
作者
Liu, Bowen [1 ]
Cao, Ang [1 ]
Kim, Hun-Seok [1 ]
机构
[1] Univ Michigan, EECS, Ann Arbor, MI 48109 USA
来源
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING | 2020年
关键词
Signal Compression; GAN; ADMM;
D O I
10.1109/icassp40776.2020.9053233
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We propose a unified compression framework that uses generative adversarial networks (GAN) to compress image and speech signals. The compressed signal is represented by a latent vector fed into a generator network which is trained to produce high quality signals that minimize a target objective function. To efficiently quantize the compressed signal, non-uniformly quantized optimal latent vectors are identified by iterative back-propagation with ADMM optimization performed for each iteration. Our experiments show that the proposed algorithm outperforms prior signal compression methods for both image and speech compression quantified in various metrics including bit rate, PSNR, and neural network based signal classification accuracy.
引用
收藏
页码:3177 / 3181
页数:5
相关论文
共 28 条
[1]  
Agustsson E., 2018, ARXIV180402958
[2]  
[Anonymous], 2018, ARXIV180208435
[3]  
[Anonymous], 2019, ARXIV190204072
[4]  
Ball Johannes, 2016, 5 INT C LEARNING REP
[5]  
Bellard F., BPG IMAGE
[6]   The Adaptive Multirate Wideband speech codec (AMR-WB) [J].
Bessette, B ;
Salami, R ;
Lefebvre, R ;
Jelínek, M ;
Rotola-Pukkila, J ;
Vainio, J ;
Mikkola, H ;
Järvinen, K .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2002, 10 (08) :620-636
[7]   Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding [J].
Cernak, Milos ;
Lazaridis, Alexandros ;
Asaei, Afsaneh ;
Garner, Philip N. .
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2016, 24 (12) :2301-2312
[8]  
Franzen R., 2006, KODAK DATASET
[9]  
Goodfellow IJ, 2014, ADV NEUR IN, V27, P2672
[10]  
Kankanahalli S, 2018, 2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), P2521, DOI 10.1109/ICASSP.2018.8461487