Speech Semantic Communication Based on Swin Transformer

被引:0
作者
Zhou, Ziliang [1 ]
Zheng, Shilian [2 ]
Chen, Jie [1 ]
Zhao, Zhijin [1 ]
Yang, Xiaoniu [2 ]
机构
[1] Hangzhou Dianzi Univ, Coll Commun Engn, Hangzhou 310018, Peoples R China
[2] Natl Key Lab Electromagnet Space Secur, Innovat Studio Academician Yang, Jiaxing 314033, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantic communication; speech communication; deep learning; end-to-end communication; SYSTEMS;
D O I
10.1109/TCCN.2023.3345858
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Semantic communication is a novel communication paradigm which refers to the extraction and encoding of semantic information from the source, and the restoration of information from a semantic perspective at the receiver. In this paper, we propose an end-to-end speech semantic communication system based on Transformer, called DeepSC-TS. It focuses on reconstructing and integrating multi-level information from the transmitted semantic signal at the receiver, effectively eliminating semantic signal noise while preserving the original semantic information. Moreover, it does not increase the data load of the original signal during transmission. Simulation results demonstrate that our proposed DeepSC-TS exhibits outstanding performance in different channel environments and it performs better than an existing speech semantic communication system.
引用
收藏
页码:756 / 768
页数:13
相关论文
共 33 条
[1]   Deep Joint Source-Channel Coding for Wireless Image Transmission [J].
Bourtsoulatze, Eirina ;
Kurka, David Burth ;
Gunduz, Deniz .
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (03) :567-579
[2]  
Cheng ZX, 2018, PICT COD SYMP, P253, DOI 10.1109/PCS.2018.8456308
[3]   SNR-ADAPTIVE DEEP JOINT SOURCE-CHANNEL CODING FOR WIRELESS IMAGE TRANSMISSION [J].
Ding, Mingze ;
Li, Jiahui ;
Ma, Mengyao ;
Fan, Xiaopeng .
2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, :1555-1559
[4]   A Survey of 5G Network: Architecture and Emerging Technologies [J].
Gupta, Akhil ;
Jha, Rakesh Kumar .
IEEE ACCESS, 2015, 3 :1206-1232
[5]   Semantic-Preserved Communication System for Highly Efficient Speech Transmission [J].
Han, Tianxiao ;
Yang, Qianqian ;
Shi, Zhiguo ;
He, Shibo ;
Zhang, Zhaoyang .
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) :245-259
[6]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[7]   Deep Learning-Based Image Semantic Coding for Semantic Communications [J].
Huang, Danlan ;
Tao, Xiaoming ;
Gao, Feifei ;
Lu, Jianhua .
2021 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2021,
[8]  
Jankowski M, 2020, INT CONF ACOUST SPEE, P5070, DOI [10.1109/icassp40776.2020.9054078, 10.1109/ICASSP40776.2020.9054078]
[9]   Deep Source-Channel Coding for Sentence Semantic Transmission With HARQ [J].
Jiang, Peiwen ;
Wen, Chao-Kai ;
Jin, Shi ;
Li, Geoffrey Ye .
IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (08) :5225-5240
[10]   Deep learning [J].
LeCun, Yann ;
Bengio, Yoshua ;
Hinton, Geoffrey .
NATURE, 2015, 521 (7553) :436-444