MERGE: A Modal Equilibrium Relational Graph Framework for Multi-Modal Knowledge Graph Completion

被引:0
作者
Shang, Yuying [1 ,2 ,3 ,4 ]
Fu, Kun [1 ,2 ,3 ]
Zhang, Zequn [1 ,2 ]
Jin, Li [1 ,2 ]
Liu, Zinan [1 ,3 ,4 ]
Wang, Shensi [1 ,2 ,3 ,4 ]
Li, Shuchao [1 ,2 ]
机构
[1] Chinese Acad Sci, Aerosp Informat Res Inst, Beijing 100094, Peoples R China
[2] Chinese Acad Sci, Aerosp Informat Res Inst, Key Lab Network Informat Syst Technol NIST, Beijing 100190, Peoples R China
[3] Univ Chinese Acad Sci, Beijing 100190, Peoples R China
[4] Univ Chinese Acad Sci, Sch Elect Elect & Commun Engn, Beijing 100094, Peoples R China
基金
中国国家自然科学基金;
关键词
multi-modal knowledge graph; knowledge graph representation; graph attention network; information integration;
D O I
10.3390/s24237605
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
The multi-modal knowledge graph completion (MMKGC) task aims to automatically mine the missing factual knowledge from the existing multi-modal knowledge graphs (MMKGs), which is crucial in advancing cross-modal learning and reasoning. However, few methods consider the adverse effects caused by different missing modal information in the model learning process. To address the above challenges, we innovatively propose a Modal Equilibrium Relational Graph framEwork, called MERGE. By constructing three modal-specific directed relational graph attention networks, MERGE can implicitly represent missing modal information for entities by aggregating the modal embeddings from neighboring nodes. Subsequently, a fusion approach based on low-rank tensor decomposition is adopted to align multiple modal features in both the explicit structural level and the implicit semantic level, utilizing the structural information inherent in the original knowledge graphs, which enhances the interpretability of the fused features. Furthermore, we introduce a novel interpolation re-ranking strategy to adjust the importance of modalities during inference while preserving the semantic integrity of each modality. The proposed framework has been validated on four publicly available datasets, and the experimental results have demonstrated the effectiveness and robustness of our method in the MMKGC task.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] M3HOGAT: A Multi-View Multi-Modal Multi-Scale High-Order Graph Attention Network for Microbe-Disease Association Prediction
    Wang, Shuang
    Liu, Jin-Xing
    Li, Feng
    Wang, Juan
    Gao, Ying-Lian
    IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2024, 28 (10) : 6259 - 6267
  • [42] A Multi-Role Graph Attention Network for Knowledge Graph Alignment
    Ding, Linyi
    Yuan, Weijie
    Meng, Kui
    Liu, Gongshen
    2021 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2021,
  • [43] Multi-Feature Graph Attention Network for Cross-Modal Video-Text Retrieval
    Hao, Xiaoshuai
    Zhou, Yucan
    Wu, Dayan
    Zhang, Wanqian
    Li, Bo
    Wang, Weiping
    PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL (ICMR '21), 2021, : 135 - 143
  • [44] Multi-stream graph attention network for recommendation with knowledge graph
    Hu, Zhifei
    Xia, Feng
    JOURNAL OF WEB SEMANTICS, 2024, 82
  • [45] Enhanced Entity Interaction Modeling for Multi-Modal Entity Alignment
    Li, Jinxu
    Zhou, Qian
    Chen, Wei
    Zhao, Lei
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT II, KSEM 2023, 2023, 14118 : 214 - 227
  • [46] A structure distinguishable graph attention network for knowledge base completion
    Xue Zhou
    Bei Hui
    Lizong Zhang
    Kexi Ji
    Neural Computing and Applications, 2021, 33 : 16005 - 16017
  • [47] A structure distinguishable graph attention network for knowledge base completion
    Zhou, Xue
    Hui, Bei
    Zhang, Lizong
    Ji, Kexi
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (23) : 16005 - 16017
  • [48] Dual graph-structured semantics multi-subspace learning for cross-modal retrieval
    Li, Yirong
    Tang, Xianghong
    Lu, Jianguang
    Huang, Yong
    MULTIMEDIA SYSTEMS, 2024, 30 (05)
  • [49] MERGE: A Multi-graph Attentive Representation learning framework integrating Group information from similar patients
    An, Ying
    Li, Runze
    Chen, Xianlai
    COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 151
  • [50] Temporal knowledge graph representation learning based on relational aggregation
    Su F.-L.
    Jing N.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2023, 57 (02): : 235 - 242