Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引:2
|
作者
Xing, Chenlin [1 ]
Lv, Jie [1 ]
Luo, Tao [1 ]
Zhang, Zhilong [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;
D O I
10.1109/LWC.2024.3369864
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.
引用
收藏
页码:1344 / 1348
页数:5
相关论文
共 50 条
  • [1] An Enhanced Multi-Modal Recommendation Based on Alternate Training With Knowledge Graph Representation
    Wang, Yuequn
    Dong, Liyan
    Zhang, Hao
    Ma, Xintao
    Li, Yongli
    Sun, Minghui
    IEEE ACCESS, 2020, 8 : 213012 - 213026
  • [2] Contrastive Multi-Modal Knowledge Graph Representation Learning
    Fang, Quan
    Zhang, Xiaowei
    Hu, Jun
    Wu, Xian
    Xu, Changsheng
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
  • [3] MGKsite: Multi-Modal Knowledge-Driven Site Selection via Intra and Inter-Modal Graph Fusion
    Liang, Ke
    Meng, Lingyuan
    Li, Hao
    Liu, Meng
    Wang, Siwei
    Zhou, Sihang
    Liu, Xinwang
    He, Kunlun
    IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1722 - 1735
  • [4] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
    Junbin Zhang
    Pei-Hsuan Tsai
    Meng-Hsun Tsai
    Applied Intelligence, 2024, 54 : 2084 - 2099
  • [5] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
    Zhang, Junbin
    Tsai, Pei-Hsuan
    Tsai, Meng-Hsun
    APPLIED INTELLIGENCE, 2024, 54 (02) : 2084 - 2099
  • [6] Semantic Communication Enhanced by Knowledge Graph Representation Learning
    Hello, Nour
    Di Lorenzo, Paolo
    Strinati, Emilio Calvanese
    2024 IEEE 25TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, SPAWC 2024, 2024, : 876 - 880
  • [7] Richpedia: A Comprehensive Multi-modal Knowledge Graph
    Wang, Meng
    Qi, Guilin
    Wang, Haofen
    Zheng, Qiushuo
    SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 130 - 145
  • [8] MultiJAF: Multi-modal joint entity alignment framework for multi-modal knowledge graph
    Cheng, Bo
    Zhu, Jia
    Guo, Meimei
    NEUROCOMPUTING, 2022, 500 : 581 - 591
  • [9] Towards Using Semantic-Web Technologies for Multi-Modal Knowledge Graph Construction
    Baumgartner, Matthias
    Rossetto, Luca
    Bernstein, Abraham
    MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4645 - 4649
  • [10] Multi-Modal Sensor Fusion-Based Semantic Segmentation for Snow Driving Scenarios
    Vachmanus, Sirawich
    Ravankar, Ankit A.
    Emaru, Takanori
    Kobayashi, Yukinori
    IEEE SENSORS JOURNAL, 2021, 21 (15) : 16839 - 16851