Joint Source-Channel Coding Over Additive Noise Analog Channels Using Mixture of Variational Autoencoders

被引:27
作者
Saidutta, Yashas Malur [1 ]
Abdi, Afshin [1 ,2 ]
Fekri, Faramarz [1 ]
机构
[1] Geogia Inst Technol, Dept Elect & Comp Engn, Atlanta, GA 30318 USA
[2] Qualcomm Technol Inc, San Diego, CA 92121 USA
基金
美国国家科学基金会;
关键词
Decoding; Training; AWGN channels; Neural networks; Channel coding; Scalability; Optimization; Joint source-channel coding; machine learning; deep learning; Variational Autoencoders; ALGORITHMS;
D O I
10.1109/JSAC.2021.3078489
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In this paper, we present a learning scheme for Joint Source-Channel Coding (JSCC) over analog independent additive noise channels. We formulate the learning problem by showing that the minimization loss function from rate-distortion theory, is upper bounded by the loss function of the Variational Autoencoder (VAE). We show that when the source dimension is greater than the channel dimension, the encoding of two source samples in the neighborhood of each other need not be near each other. Such discontinuous projection needs to be accounted for by using multiple encoders and selecting an encoder to encode samples on a particular side of the discontinuity. We explore two selection methodologies, one based on an intuitive rule and the other where it is posed as a learning task in a Mixture-of-Experts (MoE) setup. We analyze the gradients of these methods and reason why the latter is better at avoiding local optima. We show the efficacy of the proposed methodology by simulating the performance of the system for JSCC of Gaussian sources over AWGN channels and showing that the learned solutions are close to or better than the ones proposed earlier. The proposed methodology is also naturally capable of generalizing to other source distributions which we showcase by simulating for Laplace sources. The learned systems are also robust to changes in channel conditions. Further, a single system can be trained to generalize over a range of channel conditions provided the channel conditions are known at both the transmitter and the receiver. Finally, we evaluate our proposed methodology on three different image datasets and showcase consistent improvement over existing methods due to the VAE formulation.
引用
收藏
页码:2000 / 2013
页数:14
相关论文
共 58 条
  • [1] Abadi M, 2016, PROCEEDINGS OF OSDI'16: 12TH USENIX SYMPOSIUM ON OPERATING SYSTEMS DESIGN AND IMPLEMENTATION, P265
  • [2] On Zero-Delay Source-Channel Coding
    Akyol, Emrah
    Viswanatha, Kumar B.
    Rose, Kenneth
    Ramstad, Tor A.
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2014, 60 (12) : 7473 - 7489
  • [3] [Anonymous], 2018, ARXIV180205983
  • [4] [Anonymous], 2012, ELEMENTS INFORM THEO
  • [5] [Anonymous], 2018, ARXIV180901733
  • [6] PERFORMANCE BOUNDS AND OPTIMAL LINEAR CODING FOR DISCRETE-TIME MULTICHANNEL COMMUNICATION-SYSTEMS
    BASAR, TU
    SANKUR, B
    ABUT, H
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1980, 26 (02) : 212 - 217
  • [7] Belghazi MI, 2018, PR MACH LEARN RES, V80
  • [8] End-to-End Optimized ROI Image Compression
    Cai, Chunlei
    Chen, Li
    Zhang, Xiaoyun
    Gao, Zhiyong
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 29 : 3442 - 3457
  • [9] Improved learning algorithms for mixture of experts in multiclass classification
    Chen, K
    Xu, L
    Chi, H
    [J]. NEURAL NETWORKS, 1999, 12 (09) : 1229 - 1252
  • [10] Choi K., 2018, ABS181107557 CORR, P1