Multi-modal Siamese Network for Entity Alignment

被引:46
作者
Chen, Liyi [1 ]
Li, Zhi [2 ]
Xu, Tong [1 ]
Wu, Han [1 ]
Wang, Zhefeng [3 ]
Yuan, Nicholas Jing [3 ]
Chen, Enhong [1 ]
机构
[1] Univ Sci & Technol China, State Key Lab Cognit Intelligence, Hefei, Peoples R China
[2] Tsinghua Univ, Shenzhen Int Grad Sch, Beijing, Peoples R China
[3] Huawei Cloud, Shenzhen, Peoples R China
来源
PROCEEDINGS OF THE 28TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, KDD 2022 | 2022年
基金
中国国家自然科学基金;
关键词
Entity alignment; multi-modal learning; knowledge graph;
D O I
10.1145/3534678.3539244
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The booming of multi-modal knowledge graphs (MMKGs) has raised the imperative demand for multi-modal entity alignment techniques, which facilitate the integration of multiple MMKGs from separate data sources. Unfortunately, prior arts harness multi-modal knowledge only via the heuristic merging of uni-modal feature embeddings. Therefore, inter-modal cues concealed in multi-modal knowledge could be largely ignored. To deal with that problem, in this paper, we propose a novel Multi-modal Siamese Network for Entity Alignment (MSNEA) to align entities in different MMKGs, in which multi-modal knowledge could be comprehensively leveraged by the exploitation of inter-modal effect. Specifically, we first devise a multi-modal knowledge embedding module to extract visual, relational, and attribute features of entities to generate holistic entity representations for distinct MMKGs. During this procedure, we employ inter-modal enhancement mechanisms to integrate visual features to guide relational feature learning and adaptively assign attention weights to capture valuable attributes for alignment. Afterwards, we design a multi-modal contrastive learning module to achieve inter-modal enhancement fusion with avoiding the overwhelming impact of weak modalities. Experimental results on two public datasets demonstrate that our proposed MSNEA provides state-of-the-art performance with a large margin compared with competitive baselines.
引用
收藏
页码:118 / 126
页数:9
相关论文
共 45 条
[1]  
Bordes A., 2013, NIPS'13, P1
[2]  
Bordes Antoine, 2014, Machine Learning, V2014
[3]  
Cao YX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1452
[4]   MMEA: Entity Alignment for Multi-modal Knowledge Graph [J].
Chen, Liyi ;
Li, Zhi ;
Wang, Yijun ;
Xu, Tong ;
Wang, Zhefeng ;
Chen, Enhong .
KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 :134-147
[5]  
Chen Liyi, 2022, IEEE Transactions on Neural Networks and Learning Systems
[6]  
Chen MH, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1511
[7]  
Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848
[8]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9]  
Glorot X., 2010, P 13 INT C ART INT S, P249
[10]  
Hadsell R., 2006, 2006 IEEE COMPUT SOC, V2, P1735, DOI [10.1109/CVPR.2006.100, DOI 10.1109/CVPR.2006.100]