Semantic MIMO Systems for Speech-to-Text Transmission

被引:0
作者
Weng, Zhenzi [1 ]
Qin, Zhijin [2 ,3 ,4 ]
Xie, Huiqiang [5 ]
Tao, Xiaoming [2 ,3 ,4 ]
Letaief, Khaled B. [6 ,7 ]
机构
[1] Imperial Coll London, Dept Elect & Elect Engn, London SW7 2AZ, England
[2] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
[3] State Key Lab Space Network Commun, Beijing 100084, Peoples R China
[4] Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[5] Jinan Univ, Dept Elect Engn, Guangzhou 510632, Peoples R China
[6] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[7] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Transformers; Speech to text; Precoding; Channel estimation; Image reconstruction; Feature extraction; Wireless communication; Receivers; Accuracy; Deep learning; MIMO; semantic communication; speech-to-text; COMMUNICATION-SYSTEMS;
D O I
10.1109/TWC.2024.3472612
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Semantic communications have been utilized to execute numerous intelligent tasks by transmitting task-related semantic information instead of bits. In this article, we propose a semantic-aware speech-to-text transmission system for the single-user multiple-input multiple-output (MIMO) and multi-user MIMO communication scenarios, named SAC-ST. Particularly, a semantic communication system to serve the speech-to-text task at the receiver is first designed, which compresses the semantic information and generates the low-dimensional semantic features by leveraging the transformer module. In addition, a novel semantic-aware network is proposed to facilitate transmission with high semantic fidelity by identifying the critical semantic information and guaranteeing its accurate recovery. Furthermore, we extend the SAC-ST with a neural network-enabled channel estimation network to mitigate the dependence on accurate channel state information and validate the feasibility of SAC-ST in practical communication environments. Simulation results will show that the proposed SAC-ST outperforms the communication framework without the semantic-aware network for speech-to-text transmission over the MIMO channels in terms of the speech-to-text metrics, especially in the low signal-to-noise regime. Moreover, the SAC-ST with the developed channel estimation network is comparable to the SAC-ST with perfect channel state information.
引用
收藏
页码:18697 / 18710
页数:14
相关论文
共 47 条
  • [1] Bian Chenghong, 2023, 2023 IEEE 24th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), P616, DOI 10.1109/SPAWC53906.2023.10304536
  • [2] Cer D., 2017, arXiv
  • [3] A transmit preprocessing technique for multiuser MIMO systems using a decomposition approach
    Choi, LU
    Murch, RD
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2004, 3 (01) : 20 - 24
  • [4] Cicchetti G, 2024, Arxiv, DOI arXiv:2405.09976
  • [5] Deep CNN-Based Channel Estimation for mmWave Massive MIMO Systems
    Dong, Peihao
    Zhang, Hua
    Li, Geoffrey Ye
    Gaspar, Ivan Simoes
    NaderiAlizadeh, Navid
    [J]. IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING, 2019, 13 (05) : 989 - 1000
  • [6] Graves A., 2006, P 23 INT C MACH LEAR, DOI DOI 10.1145/1143844.1143891
  • [7] Gulati A, 2020, Arxiv, DOI arXiv:2005.08100
  • [8] Semantic-Preserved Communication System for Highly Efficient Speech Transmission
    Han, Tianxiao
    Yang, Qianqian
    Shi, Zhiguo
    He, Shibo
    Zhang, Zhaoyang
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) : 245 - 259
  • [9] Robust Semantic Communications With Masked VQ-VAE Enabled Codebook
    Hu, Qiyu
    Zhang, Guangyi
    Qin, Zhijin
    Cai, Yunlong
    Yu, Guanding
    Li, Geoffrey Ye
    [J]. IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2023, 22 (12) : 8707 - 8722
  • [10] Toward Semantic Communications: Deep Learning-Based Image Semantic Coding
    Huang, Danlan
    Gao, Feifei
    Tao, Xiaoming
    Du, Qiyuan
    Lu, Jianhua
    [J]. IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2023, 41 (01) : 55 - 71