Task-Oriented Multi-User Semantic Communications

被引:146
|
作者
Xie, Huiqiang [1 ]
Qin, Zhijin [1 ]
Tao, Xiaoming [2 ]
Letaief, Khaled B. [3 ,4 ]
机构
[1] Queen Mary Univ London, Sch Elect Engn & Comp Sci, London E1 4NS, England
[2] Tsinghua Univ, Beijing Natl Res Ctr Informat Sci & Technol, Dept Elect Engn, Beijing 100084, Peoples R China
[3] Hong Kong Univ Sci & Technol, Dept Elect & Comp Engn, Hong Kong, Peoples R China
[4] Peng Cheng Lab, Shenzhen 518066, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Task analysis; Transmitters; Transformers; Receivers; Image retrieval; Machine translation; Deep learning; semantic communications; multimodal fusion; multi-user communications; transformer; WIRELESS COMMUNICATIONS; INTERNET;
D O I
10.1109/JSAC.2022.3191326
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
While semantic communications have shown the potential in the case of single-modal single-users, its applications to the multi-user scenario remain limited. In this paper, we investigate deep learning (DL) based multi-user semantic communication systems for transmitting single-modal data and multimodal data, respectively. We adopt three intelligent tasks, including, image retrieval, machine translation, and visual question answering (VQA) as the transmission goal of semantic communication systems. We propose a Transformer based framework to unify the structure of transmitters for different tasks. For the single-modal multi-user system, we propose two Transformer based models, named, DeepSC-IR and DeepSC-MT, to perform image retrieval and machine translation, respectively. In this case, DeepSC-IR is trained to optimize the distance in embedding space between images and DeepSC-MT is trained to minimize the semantic errors by recovering the semantic meaning of sentences. For the multimodal multi-user system, we develop a Transformer enabled model, named, DeepSC-VQA, for the VQA task by extracting text-image information at the transmitters and fusing it at the receiver. In particular, a novel layer-wise Transformer is designed to help fuse multimodal data by adding connection between each of the encoder and decoder layers. Numerical results show that the proposed models are superior to traditional communications in terms of the robustness to channels, computational complexity, transmission delay, and the task-execution performance at various task-specific metrics.
引用
收藏
页码:2584 / 2597
页数:14
相关论文
共 50 条
  • [31] Diagnosing Transformers in Task-Oriented Semantic Parsing
    Desai, Shrey
    Aly, Ahmed
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 57 - 62
  • [32] Task-oriented Web user modeling for recommendation
    Jin, X
    Zhou, YZ
    Mobasher, B
    USER MODELING 2005, PROCEEDINGS, 2005, 3538 : 109 - 118
  • [33] Flag Vector Assisted Multi-User Semantic Communications for Downlink Text Transmission
    Huang, Wei
    Wang, Jun
    Chen, Xiaonan
    Peng, Qihang
    Zhu, Yichao
    IEEE COMMUNICATIONS LETTERS, 2024, 28 (06) : 1283 - 1287
  • [34] Contextual Semantic Parsing for Multilingual Task-Oriented Dialogues
    Moradshahi, Mehrad
    Tsai, Victoria
    Campagna, Giovanni
    Lam, Monica S.
    17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 902 - 915
  • [35] A Multidimensional Design for Multi-user Communications
    Wu, Yuteng
    Attang, Edidiong
    Norouzi, Mandana
    Atkin, G. E.
    2016 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2016, : 402 - 406
  • [36] RETRONLU: Retrieval Augmented Task-Oriented Semantic Parsing
    Gupta, Vivek
    Shrivastava, Akshat
    Sagar, Adithya
    Aghajanyan, Armen
    Savenkov, Denis
    PROCEEDINGS OF THE 4TH WORKSHOP ON NLP FOR CONVERSATIONAL AI, 2022, : 184 - 196
  • [37] Task-oriented Grasping with Semantic and Geometric Scene Understanding
    Detry, Renaud
    Papon, Jeremie
    Matthies, Larry
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 3266 - 3273
  • [38] EmoUS: Simulating User Emotions in Task-Oriented Dialogues
    Lin, Hsien-Chin
    Feng, Shutong
    Geishauser, Christian
    Lubis, Nurul
    van Niekerk, Carel
    Heck, Michael
    Ruppik, Benjamin
    Vukovic, Renato
    Gasic, Milica
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 2526 - 2531
  • [39] LUIS - a logic for task-oriented user interface specification
    Vienna Univ of Technology, Vienna, Austria
    Int J Intell Syst, 2 (201-231):
  • [40] Understanding User Satisfaction with Task-oriented Dialogue Systems
    Siro, Clemencia
    Aliannejadi, Mohammad
    de Rijke, Maarten
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 2018 - 2023