Cross-Lingual Entity Alignment via Joint Attribute-Preserving Embedding

被引:301
作者
Sun, Zequn [1 ]
Hu, Wei [1 ]
Li, Chengkai [2 ]
机构
[1] Nanjing Univ, State Key Lab Novel Software Technol, Nanjing, Peoples R China
[2] Univ Texas Arlington, Dept Comp Sci & Engn, Arlington, TX 76019 USA
来源
SEMANTIC WEB - ISWC 2017, PT I | 2017年 / 10587卷
基金
美国国家科学基金会; 中国国家自然科学基金;
关键词
Cross-lingual entity alignment; Knowledge base embedding; Joint attribute-preserving embedding;
D O I
10.1007/978-3-319-68288-4_37
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Entity alignment is the task of finding entities in two knowledge bases (KBs) that represent the same real-world object. When facing KBs in different natural languages, conventional cross-lingual entity alignment methods rely on machine translation to eliminate the language barriers. These approaches often suffer from the uneven quality of translations between languages. While recent embedding-based techniques encode entities and relationships in KBs and do not need machine translation for cross-lingual entity alignment, a significant number of attributes remain largely unexplored. In this paper, we propose a joint attribute-preserving embedding model for cross-lingual entity alignment. It jointly embeds the structures of two KBs into a unified vector space and further refines it by leveraging attribute correlations in the KBs. Our experimental results on real-world datasets show that this approach significantly outperforms the state-of-the-art embedding approaches for cross-lingual entity alignment and could be complemented with methods based on machine translation.
引用
收藏
页码:628 / 644
页数:17
相关论文
共 25 条
[1]  
[Anonymous], 2013, ARXIV13013618
[2]  
[Anonymous], 2013, Bilingual word embeddings for phrasebased machine translation
[3]  
[Anonymous], 2017, P 20 INT C EXTENDING
[4]  
Bordes A., 2011, AAAI C ARTIF INTELL, P301
[5]  
Bordes A, 2013, P 26 INT C NEURAL IN, P2787
[6]  
Cheatham M, 2013, LECT NOTES COMPUT SC, V8219, P294, DOI 10.1007/978-3-642-41338-4_19
[7]  
Chen MH, 2017, PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P1511
[8]  
Duchi J, 2011, J MACH LEARN RES, V12, P2121
[9]  
Feng X., 2016, PROC 26 INT C COMPUT, P2935
[10]  
Fu B., 2010, Proceedings of the 1st Workshop on the Multilingual Semantic Web, P13, DOI DOI 10.1080/17517575.2016.1161238