Contrastive Multi-Modal Knowledge Graph Representation Learning

被引:12
作者
Fang, Quan [1 ]
Zhang, Xiaowei [2 ]
Hu, Jun [1 ]
Wu, Xian [3 ]
Xu, Changsheng [1 ,4 ]
机构
[1] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[2] Zhengzhou Univ, Zhengzhou 450001, Peoples R China
[3] Tencent Med AI Lab, Beijing 100080, Peoples R China
[4] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
基金
中国国家自然科学基金;
关键词
Knowledge graph; multimedia; graph neural network; contrastive learning; NETWORK;
D O I
10.1109/TKDE.2022.3220625
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Representation learning of knowledge graphs (KGs) aims to embed both entities and relations as vectors in a continuous low-dimensional space, which has facilitated various applications such as link prediction and entity retrieval. Most existing KG embedding methods focus on modeling the structured fact triples independently and ignore the multi-type relations among triples as well as the variety of data types (e.g., texts and images) associated with entities in KGs, and thus fail to capture the complex and multi-modal information that is inherently inside the entity-relation triples. In this paper, we propose a novel approach for knowledge graph embedding named Contrastive Multi-modal Graph Neural Network (CMGNN), which can encapsulate comprehensive features from multi-modal content descriptions of entities and high-order connectivity structures. Specifically, CMGNN first learns entity embeddings from multi-modal content and then contrasts encodings from multi-relational local neighbors and high-order connectivities to obtain latent representations of entities and relations simultaneously. Experimental results demonstrate that CMGNN can effectively model the multi-modalities and multi-type structures in KGs, and significantly outperforms existing state-of-the-art methods on benchmark datasets for the tasks of link prediction and entity classification.
引用
收藏
页码:8983 / 8996
页数:14
相关论文
共 75 条
[1]  
[Anonymous], 2014, 2 INT C LEARNING REP
[2]  
Arora S, 2020, Arxiv, DOI arXiv:2007.12374
[3]  
Balazevic I, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P5185
[4]   Hypernetwork Knowledge Graph Embeddings [J].
Balazevic, Ivana ;
Allen, Carl ;
Hospedales, Timothy M. .
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2019: WORKSHOP AND SPECIAL SESSIONS, 2019, 11731 :553-565
[5]  
Bollacker KD., 2008, P ACM SIGMOD INT C M, P1247, DOI DOI 10.1145/1376616.1376746
[6]  
Bordes Antoine, 2014, Machine Learning and Knowledge Discovery in Databases. European Conference, ECML PKDD 2014. Proceedings: LNCS 8724, P165, DOI 10.1007/978-3-662-44848-9_11
[7]  
Bordes A, 2011, P 25 AAAI C ART INT, P301, DOI DOI 10.1016/J.PROCS.2017.05.045
[8]  
Bordes A., 2013, P 26 INT C NEUR INF, V2, P2787
[9]   A semantic matching energy function for learning with multi-relational data Application to word-sense disambiguation [J].
Bordes, Antoine ;
Glorot, Xavier ;
Weston, Jason ;
Bengio, Yoshua .
MACHINE LEARNING, 2014, 94 (02) :233-259
[10]   TransGCN:Coupling Transformation Assumptions with Graph Convolutional Networks for Link Prediction [J].
Cai, Ling ;
Yan, Bo ;
Mai, Gengchen ;
Janowicz, Krzysztof ;
Zhu, Rui .
PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE ON KNOWLEDGE CAPTURE (K-CAP '19), 2019, :131-138