Task-Oriented Multi-User Semantic Communications

被引:146
|
作者
Xie, Huiqiang [1 ]
Qin, Zhijin [1 ]
Tao, Xiaoming [2 ]
Letaief, Khaled B. [3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Transmitters; Transformers; Receivers; Image retrieval; Machine translation; Deep learning; semantic communications; multimodal fusion; multi-user communications; transformer; WIRELESS COMMUNICATIONS; INTERNET;
D O I
10.1109/JSAC.2022.3191326
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We propose a Transformer based framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between images and DeepSC-MT is trained to minimize the semantic errors by recovering the semantic meaning of sentences. For the multimodal multi-user system, we develop a Transformer enabled model, named, DeepSC-VQA, for the VQA task by extracting text-image information at the transmitters and fusing it at the receiver. In particular, a novel layer-wise Transformer is designed to help fuse multimodal data by adding connection between each of the encoder and decoder layers. Numerical results show that the proposed models are superior to traditional communications in terms of the robustness to channels, computational complexity, transmission delay, and the task-execution performance at various task-specific metrics.
引用
收藏
页码:2584 / 2597
页数:14
相关论文
共 50 条
  • [21] Task-Oriented Semantic Communication Based on Semantic Triplets
    Liu, Chuanhong
    Guo, Caili
    Wang, Siyi
    Li, Yuze
    Hu, Dingxin
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [22] Privacy-Preserving Task-Oriented Semantic Communications Against Model Inversion Attacks
    Wang, Yanhu
    Guo, Shuaishuai
    Deng, Yiqin
    Zhang, Haixia
    Fang, Yuguang
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (08) : 10150 - 10165
  • [23] Improving Channel Resilience for Task-Oriented Semantic Communications: A Unified Information Bottleneck Approach
    Lyu, Shuai
    Sun, Yao
    Guo, Linke
    Yuan, Xiaoyong
    Fang, Fang
    Zhang, Lan
    Wang, Xianbin
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (11) : 2623 - 2627
  • [24] Exploiting Multi-user Semantic Communications: A Non-orthogonal Approach
    Zhong, Ruikang
    Mu, Xidong
    Chen, Yue
    Liu, Yuanwei
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,
  • [25] Task-Oriented Scene Graph-Based Semantic Communications With Adaptive Channel Coding
    Sun, Shiqi
    Qin, Zhijin
    Xie, Huiqiang
    Tao, Xiaoming
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (11) : 17070 - 17083
  • [26] TASK-ORIENTED COMMUNICATIONS FOR FUTURE WIRELESS NETWORKS
    Xu, Wei
    Yang, Zhaohui
    Ng, Derrick Wing Kwan
    Dobre, Octavia A.
    Wang, Li-Chun
    Schober, Robert
    IEEE WIRELESS COMMUNICATIONS, 2023, 30 (03) : 16 - 17
  • [27] Semantic Understanding and Task-Oriented for Image Assessment
    Tsai, Cheng-Min
    Guan, Shin-Shen
    Tsai, Wang-Chin
    Zhang, Zhi-Hua
    HUMAN ASPECTS OF IT FOR THE AGED POPULATION: ACCEPTANCE, COMMUNICATION AND PARTICIPATION, PT I, 2018, 10926 : 392 - 400
  • [28] Task-Oriented Semantic Communication with Foundation Models
    Chen Mingkai
    Liu Minghao
    Zhang Zhe
    Xu Zhiping
    Wang Lei
    China Communications, 2024, 21 (07) : 65 - 77
  • [29] Semantic Equivalence of Task-Oriented Programs in TopHat
    Klijnsma, Tosca
    Steenvoorden, Tim
    TRENDS IN FUNCTIONAL PROGRAMMING, TFP 2022, 2022, 13401 : 100 - 125
  • [30] Task-Oriented Semantic Communication with Foundation Models
    Chen, Mingkai
    Liu, Minghao
    Zhe, Zhang
    Xu, Zhiping
    Lei, Wang
    CHINA COMMUNICATIONS, 2024, 21 (07) : 65 - 77