MGKGR: Multimodal Semantic Fusion for Geographic Knowledge Graph Representation

被引:0
|
作者
Zhang, Jianqiang [1 ]
Chen, Renyao [1 ]
Li, Shengwen [1 ,2 ,3 ]
Li, Tailong [4 ]
Yao, Hong [1 ,2 ,3 ,4 ]
机构
[1] China Univ Geosci, Sch Comp Sci, Wuhan 430074, Peoples R China
[2] China Univ Geosci, State Key Lab Biogeol & Environm Geol, Wuhan 430074, Peoples R China
[3] China Univ Geosci, Hubei Key Lab Intelligent Geoinformat Proc, Wuhan 430078, Peoples R China
[4] China Univ Geosci, Sch Future Technol, Wuhan 430074, Peoples R China
关键词
multimodal; geographic knowledge graph; knowledge graph representation;
D O I
10.3390/a17120593
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Geographic knowledge graph representation learning embeds entities and relationships in geographic knowledge graphs into a low-dimensional continuous vector space, which serves as a basic method that bridges geographic knowledge graphs and geographic applications. Previous geographic knowledge graph representation methods primarily learn the vectors of entities and their relationships from their spatial attributes and relationships, which ignores various semantics of entities, resulting in poor embeddings on geographic knowledge graphs. This study proposes a two-stage multimodal geographic knowledge graph representation (MGKGR) model that integrates multiple kinds of semantics to improve the embedding learning of geographic knowledge graph representation. Specifically, in the first stage, a spatial feature fusion method for modality enhancement is proposed to combine the structural features of geographic knowledge graphs with two modal semantic features. In the second stage, a multi-level modality feature fusion method is proposed to integrate heterogeneous features from different modalities. By fusing the semantics of text and images, the performance of geographic knowledge graph representation is improved, providing accurate representations for downstream geographic intelligence tasks. Extensive experiments on two datasets show that the proposed MGKGR model outperforms the baselines. Moreover, the results demonstrate that integrating textual and image data into geographic knowledge graphs can effectively enhance the model's performance.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Hybrid Graph Representation Learning for Carotid Artery Stenosis Detection Based on Multimodal Retinal OCTA Images
    Lan, Wenting
    Hao, Jinkui
    Zhou, Shengjun
    Zhang, Jingfeng
    Ma, Shaodong
    Zhao, Yitian
    IEEE ACCESS, 2025, 13 : 9538 - 9548
  • [42] Multichannel Multimodal Emotion Analysis of Cross-Modal Feedback Interactions Based on Knowledge Graph
    Dong, Shaohua
    Fan, Xiaochao
    Ma, Xinchun
    NEURAL PROCESSING LETTERS, 2024, 56 (03)
  • [43] A knowledge-augmented heterogeneous graph convolutional network for aspect-level multimodal sentiment analysis
    Yujie, Wan
    Yuzhong, Chen
    Jiali, Lin
    Jiayuan, Zhong
    Chen, Dong
    COMPUTER SPEECH AND LANGUAGE, 2024, 85
  • [44] A Knowledge-Guided Spatio-Temporal Correlation Measure Considering Rules and Dependency Syntax for Knowledge Graph Adaptive Representation
    Qiu, Qinjun
    Li, Haiyan
    Hu, Xinxin
    Tian, Miao
    Ma, Kai
    Zhu, Yunqiang
    Sun, Kai
    Li, Weirong
    Wang, Shu
    Xie, Zhong
    TRANSACTIONS IN GIS, 2025, 29 (01)
  • [45] Incorporating Domain Knowledge Graph into Multimodal Movie Genre Classification with Self-Supervised Attention and Contrastive Learning
    Li, Jiaqi
    Qi, Guilin
    Zhang, Chuanyi
    Chen, Yongrui
    Tan, Yiming
    Xia, Chenlong
    Tian, Ye
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2023, 2023, : 3337 - 3345
  • [46] MTKGCformer: A Multi-train Transformer-based Representation Learning for Knowledge Graph Completion task
    Deng, Bowen
    Sun, Ming
    PROCEEDINGS OF THE 8TH INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING, ICDSP 2024, 2024, : 219 - 224
  • [47] Multimodal heterogeneous graph entity-level fusion for named entity recognition with multi-granularity visual guidance
    Gong, Yunchao
    Lv, Xueqiang
    Yuan, Zhu
    Wang, ZhaoJun
    Hu, Feng
    You, Xindong
    JOURNAL OF SUPERCOMPUTING, 2024, 80 (16) : 23767 - 23793
  • [48] Geographic Knowledge Graph Attribute Normalization: Improving the Accuracy by Fusing Optimal Granularity Clustering and Co-Occurrence Analysis
    Yin, Chuan
    Zhang, Binyu
    Liu, Wanzeng
    Du, Mingyi
    Luo, Nana
    Zhai, Xi
    Ba, Tu
    ISPRS INTERNATIONAL JOURNAL OF GEO-INFORMATION, 2022, 11 (07)
  • [49] MAWKDN: A Multimodal Fusion Wavelet Knowledge Distillation Approach Based on Cross-View Attention for Action Recognition
    Quan, Zhenzhen
    Chen, Qingshan
    Zhang, Moyan
    Hu, Weifeng
    Zhao, Qiang
    Hou, Jiangang
    Li, Yujun
    Liu, Zhi
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 5734 - 5749
  • [50] Hyperspectral Point Cloud Projection for the Semantic Segmentation of Multimodal Hyperspectral and Lidar Data with Point Convolution-Based Deep Fusion Neural Networks
    Decker, Kevin T.
    Borghetti, Brett J.
    APPLIED SCIENCES-BASEL, 2023, 13 (14):