Representation and Fusion Based on Knowledge Graph in Multi-Modal Semantic Communication

被引：2

作者：

Xing, Chenlin ^{[1
]}

Lv, Jie ^{[1
]}

Luo, Tao ^{[1
]}

Zhang, Zhilong ^{[1
]}

机构：

[1] Beijing Univ Posts & Telecommun, Sch Informat & Commun Engn, Beijing 100876, Peoples R China

来源：

IEEE WIRELESS COMMUNICATIONS LETTERS | 2024年 / 13卷 / 05期

基金：

中国国家自然科学基金;

关键词：

Semantics; Correlation; Feature extraction; Knowledge graphs; Cognition; Head; Data mining; Semantic communication; multi-modal fusion; knowledge graph;

D O I：

10.1109/LWC.2024.3369864

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The existing research on multi-modal semantic communication ignores the exploration of reasoning correlation among multi-modal data. Motivated by this, a multi-modal semantic representation and fusion model based on knowledge graph (KG-MSF) is proposed in this letter. In KG-MSF, the direct and reasoning correlation semantic information is extracted and mapped into a two-layer semantic architecture to represent the semantics of each modal fully. After that, the knowledge graph with structural advantage is utilized to fuse multi-modal semantic information, which is transmitted under different channel conditions. To assess the efficacy of semantic representation and fusion of the proposed KG-MSF in the multi-modal semantic communication system, we conduct comprehensive experiments on the task of visual question answer (VQA) with a metric of answer accuracy. The results demonstrate the superiority compared with existing models for multi-modal semantic representation, fusion, transmission efficiency and channel robustness.

引用

页码：1344 / 1348

页数：5

共 50 条

[1] An Enhanced Multi-Modal Recommendation Based on Alternate Training With Knowledge Graph Representation
Wang, Yuequn
Dong, Liyan
Zhang, Hao
Ma, Xintao
Li, Yongli
Sun, Minghui
IEEE ACCESS, 2020, 8 : 213012 - 213026
[2] Contrastive Multi-Modal Knowledge Graph Representation Learning
Fang, Quan
Zhang, Xiaowei
Hu, Jun
Wu, Xian
Xu, Changsheng
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 35 (09) : 8983 - 8996
[3] MGKsite: Multi-Modal Knowledge-Driven Site Selection via Intra and Inter-Modal Graph Fusion
Liang, Ke
Meng, Lingyuan
Li, Hao
Liu, Meng
Wang, Siwei
Zhou, Sihang
Liu, Xinwang
He, Kunlun
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 27 : 1722 - 1735
[4] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
Junbin Zhang
Pei-Hsuan Tsai
Meng-Hsun Tsai
Applied Intelligence, 2024, 54 : 2084 - 2099
[5] Semantic2Graph: graph-based multi-modal feature fusion for action segmentation in videos
Zhang, Junbin
Tsai, Pei-Hsuan
Tsai, Meng-Hsun
APPLIED INTELLIGENCE, 2024, 54 (02) : 2084 - 2099
[6] Semantic Communication Enhanced by Knowledge Graph Representation Learning
Hello, Nour
Di Lorenzo, Paolo
Strinati, Emilio Calvanese
2024 IEEE 25TH INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING ADVANCES IN WIRELESS COMMUNICATIONS, SPAWC 2024, 2024, : 876 - 880
[7] Richpedia: A Comprehensive Multi-modal Knowledge Graph
Wang, Meng
Qi, Guilin
Wang, Haofen
Zheng, Qiushuo
SEMANTIC TECHNOLOGY, JIST 2019: PROCEEDINGS, 2020, 12032 : 130 - 145
[8] MultiJAF: Multi-modal joint entity alignment framework for multi-modal knowledge graph
Cheng, Bo
Zhu, Jia
Guo, Meimei
NEUROCOMPUTING, 2022, 500 : 581 - 591
[9] Towards Using Semantic-Web Technologies for Multi-Modal Knowledge Graph Construction
Baumgartner, Matthias
Rossetto, Luca
Bernstein, Abraham
MM '20: PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, 2020, : 4645 - 4649
[10] Multi-Modal Sensor Fusion-Based Semantic Segmentation for Snow Driving Scenarios
Vachmanus, Sirawich
Ravankar, Ankit A.
Emaru, Takanori
Kobayashi, Yukinori
IEEE SENSORS JOURNAL, 2021, 21 (15) : 16839 - 16851

← 1 2 3 4 5 →