Contrastive Learning based Semantic Communication for Wireless Image Transmission

被引：1

作者：

Tang, Shunpu ^{[1
,2
]}

Yang, Qianqian ^{[1
]}

Fan, Lisheng ^{[2
]}

Lei, Xianfu ^{[3
]}

Deng, Yansha ^{[4
]}

Nallanathan, Arumugam ^{[5
]}

机构：

[1] Zhejiang Univ, Key Lab Collaborat Sensing & Autonomous Unmanned, Hangzhou, Peoples R China

[2] Guangzhou Univ, Sch Comp Sci & Cyber Engn, Guangzhou, Peoples R China

[3] Southwest Jiaotong Univ, Inst Mobile Commun, Chengdu, Peoples R China

[4] Kings Coll London, Dept Engn, London, England

[5] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London, England

来源：

2023 IEEE 98TH VEHICULAR TECHNOLOGY CONFERENCE, VTC2023-FALL | 2023年

关键词：

Semantic communication; image transmission; contrastive learning; joint source-channel coding;

D O I：

10.1109/VTC2023-Fall60731.2023.10333392

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Recently, semantic communication has been widely applied in wireless image transmission systems as it can prioritize the preservation of meaningful semantic information in images over the accuracy of transmitted symbols, leading to improved communication efficiency. However, existing semantic communication approaches still face limitations in achieving considerable inference performance in downstream AI tasks like image recognition, or balancing the inference performance with the quality of the reconstructed image at the receiver. Therefore, this paper proposes a contrastive learning (CL)-based semantic communication approach to overcome these limitations. Specifically, we regard the image corruption during transmission as a form of data augmentation in CL and leverage CL to reduce the semantic distance between the original and the corrupted reconstruction while maintaining the semantic distance among irrelevant images for better discrimination in downstream tasks. Moreover, we design a two-stage training procedure and the corresponding loss functions for jointly optimizing the semantic encoder and decoder to achieve a good trade-off between the performance of image recognition in the downstream task and reconstructed quality. Simulations are finally conducted to demonstrate the superiority of the proposed method over the competitive approaches. In particular, the proposed method can achieve up to 56% accuracy gain on the CIFAR10 dataset when the bandwidth compression ratio is 1/48.

引用

页数：6

共 7 条

[1] Deep Joint Source-Channel Coding for Wireless Image Transmission [J].

Bourtsoulatze, Eirina ;

Kurka, David Burth ;

Gunduz, Deniz .

IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (03) :567-579

[2]

Chen T, 2020, PR MACH LEARN RES, V119

[3]

Gündüz D, 2023, IEEE J SEL AREA COMM, V41, P5, DOI 10.1109/JSAC.2022.3223408

[4] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

[5] DeepJS']JSCC-f : Deep Joint Source-Channel Coding of Images With Feedback [J].

Kurka, David Burth ;

Gunduz, Deniz .

IEEE JOURNAL ON SELECTED AREAS IN INFORMATION THEORY, 2020, 1 (01) :178-193

[6] Real-Time Single Image and Video Super-Resolution Using an Efficient Sub-Pixel Convolutional Neural Network [J].

Shi, Wenzhe ;

Caballero, Jose ;

Huszar, Ferenc ;

Totz, Johannes ;

Aitken, Andrew P. ;

Bishop, Rob ;

Rueckert, Daniel ;

Wang, Zehan .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :1874-1883

[7] Deep Learning-Enabled Semantic Communication Systems With Task-Unaware Transmitter and Dynamic Data [J].

Zhang, Hongwei ;

Shao, Shuo ;

Tao, Meixia ;

Bi, Xiaoyan ;

Letaief, Khaled B. .

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) :170-185

← 1 →