MGKGR: Multimodal Semantic Fusion for Geographic Knowledge Graph Representation

被引：0

作者：

Zhang, Jianqiang ^{[1
]}

Chen, Renyao ^{[1
]}

Li, Shengwen ^{[1
,2
,3
]}

Li, Tailong ^{[4
]}

Yao, Hong ^{[1
,2
,3
,4
]}

机构：

[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China

[2] China Univ Geosci, State Key Lab Biogeol & Environm Geol, Wuhan 430074, Peoples R China

[3] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430078, Peoples R China

[4] China Univ Geosci, Sch Future Technol, Wuhan 430074, Peoples R China

来源：

ALGORITHMS | 2024年 / 17卷 / 12期

关键词：

multimodal; geographic knowledge graph; knowledge graph representation;

D O I：

10.3390/a17120593

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Geographic knowledge graph representation learning embeds entities and relationships in geographic knowledge graphs into a low-dimensional continuous vector space, which serves as a basic method that bridges geographic knowledge graphs and geographic applications. Previous geographic knowledge graph representation methods primarily learn the vectors of entities and their relationships from their spatial attributes and relationships, which ignores various semantics of entities, resulting in poor embeddings on geographic knowledge graphs. This study proposes a two-stage multimodal geographic knowledge graph representation (MGKGR) model that integrates multiple kinds of semantics to improve the embedding learning of geographic knowledge graph representation. Specifically, in the first stage, a spatial feature fusion method for modality enhancement is proposed to combine the structural features of geographic knowledge graphs with two modal semantic features. In the second stage, a multi-level modality feature fusion method is proposed to integrate heterogeneous features from different modalities. By fusing the semantics of text and images, the performance of geographic knowledge graph representation is improved, providing accurate representations for downstream geographic intelligence tasks. Extensive experiments on two datasets show that the proposed MGKGR model outperforms the baselines. Moreover, the results demonstrate that integrating textual and image data into geographic knowledge graphs can effectively enhance the model's performance.

引用

页数：16

共 50 条

[41] Hybrid Graph Representation Learning for Carotid Artery Stenosis Detection Based on Multimodal Retinal OCTA Images
Lan, Wenting
Hao, Jinkui
Zhou, Shengjun
Zhang, Jingfeng
Ma, Shaodong
Zhao, Yitian
IEEE ACCESS, 2025, 13 : 9538 - 9548
[42] Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph
Dong, Shaohua
Fan, Xiaochao
Ma, Xinchun
NEURAL PROCESSING LETTERS, 2024, 56 (03)
[43] A knowledge-augmented heterogeneous graph convolutional network for aspect-level multimodal sentiment analysis
Yujie, Wan
Yuzhong, Chen
Jiali, Lin
Jiayuan, Zhong
Chen, Dong
COMPUTER SPEECH AND LANGUAGE, 2024, 85
[44] A Knowledge-Guided Spatio-Temporal Correlation Measure Considering Rules and Dependency Syntax for Knowledge Graph Adaptive Representation
Qiu, Qinjun
Li, Haiyan
Hu, Xinxin
Tian, Miao
Ma, Kai
Zhu, Yunqiang
Sun, Kai
Li, Weirong
Wang, Shu
Xie, Zhong
TRANSACTIONS IN GIS, 2025, 29 (01)
[45] Incorporating Domain Knowledge Graph into Multimodal Movie Genre Classification with Self-Supervised Attention and Contrastive Learning
Li, Jiaqi
Qi, Guilin
Zhang, Chuanyi
Chen, Yongrui
Tan, Yiming
Xia, Chenlong
Tian, Ye
PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3337 - 3345
[46] MTKGCformer: A Multi-train Transformer-based Representation Learning for Knowledge Graph Completion task
Deng, Bowen
Sun, Ming
PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024, 2024, : 219 - 224
[47] Multimodal heterogeneous graph entity-level fusion for named entity recognition with multi-granularity visual guidance
Gong, Yunchao
Lv, Xueqiang
Yuan, Zhu
Wang, ZhaoJun
Hu, Feng
You, Xindong
JOURNAL OF SUPERCOMPUTING, 2024, 80 (16) : 23767 - 23793
[48] Geographic Knowledge Graph Attribute Normalization: Improving the Accuracy by Fusing Optimal Granularity Clustering and Co-Occurrence Analysis
Yin, Chuan
Zhang, Binyu
Liu, Wanzeng
Du, Mingyi
Luo, Nana
Zhai, Xi
Ba, Tu
ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (07)
[49] MAWKDN: A Multimodal Fusion Wavelet Knowledge Distillation Approach Based on Cross-View Attention for Action Recognition
Quan, Zhenzhen
Chen, Qingshan
Zhang, Moyan
Hu, Weifeng
Zhao, Qiang
Hou, Jiangang
Li, Yujun
Liu, Zhi
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5734 - 5749
[50] Hyperspectral Point Cloud Projection for the Semantic Segmentation of Multimodal Hyperspectral and Lidar Data with Point Convolution-Based Deep Fusion Neural Networks
Decker, Kevin T.
Borghetti, Brett J.
APPLIED SCIENCES-BASEL, 2023, 13 (14):

← 1 2 3 4 5 →