Task-Oriented Multi-User Semantic Communications

被引:146
|
作者
Xie, Huiqiang [1 ]
Qin, Zhijin [1 ]
Tao, Xiaoming [2 ]
Letaief, Khaled B. [3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Transmitters; Transformers; Receivers; Image retrieval; Machine translation; Deep learning; semantic communications; multimodal fusion; multi-user communications; transformer; WIRELESS COMMUNICATIONS; INTERNET;
D O I
10.1109/JSAC.2022.3191326
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We propose a Transformer based framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between images and DeepSC-MT is trained to minimize the semantic errors by recovering the semantic meaning of sentences. For the multimodal multi-user system, we develop a Transformer enabled model, named, DeepSC-VQA, for the VQA task by extracting text-image information at the transmitters and fusing it at the receiver. In particular, a novel layer-wise Transformer is designed to help fuse multimodal data by adding connection between each of the encoder and decoder layers. Numerical results show that the proposed models are superior to traditional communications in terms of the robustness to channels, computational complexity, transmission delay, and the task-execution performance at various task-specific metrics.
引用
收藏
页码:2584 / 2597
页数:14
相关论文
共 50 条
  • [41] Utility-Oriented Communications for 6G Mobile Networks and the Metaverse: Semantic, Task-Oriented, Goal-Oriented, and More
    Wang, Zefan
    Zhao, Jun
    2023 IEEE 43RD INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, ICDCS, 2023, : 987 - 988
  • [42] Selfish Multi-User Task Scheduling\
    Carroll, Thomas E.
    Grosu, Daniel
    ISPDC 2006: FIFTH INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED COMPUTING, PROCEEDINGS, 2006, : 99 - +
  • [43] Channel-Transferable Semantic Communications for Multi-User OFDM-NOMA Systems
    Lin, Lan
    Xu, Wenjun
    Wang, Fengyu
    Zhang, Yimeng
    Zhang, Wei
    Zhang, Ping
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (03) : 721 - 725
  • [44] User oriented IP accounting in multi-user systems
    Ge, Z
    Reuther, B
    Mueller, P
    INTEGRATED NETWORK MANAGEMENT VIII: MANAGING IT ALL, 2003, 118 : 59 - 72
  • [45] User Pairing Algorithms for Multi-cell and Multi-user Communications
    Huang, Fong-Ru
    Chiu, Mao-Ching
    Sheen, Wern-Ho
    2012 INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY AND ITS APPLICATIONS (ISITA 2012), 2012, : 465 - 469
  • [46] Guest Editorial: Task-Oriented Communications and Networking for the Internet of Things
    Deng Y.
    Liu Y.
    Pappas N.
    Zhang J.
    Wang Y.
    Sivanesan K.
    IEEE Internet of Things Magazine, 2023, 6 (04): : 8 - 9
  • [47] Dynamic Resource Allocation for Multi-User Goal-oriented Communications at the Wireless Edge
    Binucci, Francesco
    Banelli, Paolo
    Di Lorenzo, Paolo
    Barbarossa, Sergio
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 697 - 701
  • [48] Task-oriented and Semantics-aware Communications for Augmented Reality
    Wang, Zhe
    Deng, Yansha
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2215 - 2220
  • [49] MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark
    Li, Haoran
    Arora, Abhinav
    Chen, Shuohui
    Gupta, Anchit
    Gupta, Sonal
    Mehdad, Yashar
    16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), 2021, : 2950 - 2962
  • [50] Learning Multi-Rate Task-Oriented Communications Over Symmetric Discrete Memoryless Channels
    Zhang, Anbang
    Guo, Shuaishuai
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (10) : 2303 - 2307