A Critical Assessment of State-of-the-Art in Entity Alignment

被引:2
作者
Berrendorf, Max [1 ]
Wacker, Ludwig [1 ]
Faerman, Evgeniy [1 ]
机构
[1] Ludwig Maximilians Univ Munchen, Munich, Germany
来源
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2021, PT II | 2021年 / 12657卷
关键词
Knowledge Graph; Entity Alignment; Word embeddings;
D O I
10.1007/978-3-030-72240-1_2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this work, we perform an extensive investigation of two state-of-the-art (SotA) methods for the task of Entity Alignment in Knowledge Graphs. Therefore, we first carefully examine the benchmarking process and identify several shortcomings, making the results reported in the original works not always comparable. Furthermore, we suspect that it is a common practice in the community to make the hyperparameter optimization directly on a test set, reducing the informative value of reported performance. Thus, we select a representative sample of benchmarking datasets and describe their properties. We also examine different initializations for entity representations since they are a decisive factor for model performance. Furthermore, we use a shared train/validation/test split for an appropriate evaluation setting to evaluate all methods on all datasets. In our evaluation, we make several interesting findings. While we observe that most of the time SotA approaches perform better than baselines, they have difficulties when the dataset contains noise, which is the case in most real-life applications. Moreover, in our ablation study, we find out that often different features of SotA method are crucial for good performance than previously assumed. The code is available at https://github.com/mberr/ea-sota-comparison.
引用
收藏
页码:18 / 32
页数:15
相关论文
共 25 条
[1]  
Bergstra J, 2012, J MACH LEARN RES, V13, P281
[2]  
Berrendorf M., 2020, CoRR
[3]   Knowledge Graph Entity Alignment with Graph Convolutional Networks: Lessons Learned [J].
Berrendorf, Max ;
Faerman, Evgeniy ;
Melnychuk, Valentyn ;
Tresp, Volker ;
Seidl, Thomas .
ADVANCES IN INFORMATION RETRIEVAL, ECIR 2020, PT II, 2020, 12036 :3-11
[4]  
Cao YX, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P1452
[5]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[6]   Special issue on knowledge graphs and semantics in text analysis and retrieval [J].
Dietz, Laura ;
Xiong, Chenyan ;
Dalton, Jeff ;
Meij, Edgar .
INFORMATION RETRIEVAL JOURNAL, 2019, 22 (3-4) :229-231
[7]  
Fey Matthias, 2020, INT C LEARN REPR
[8]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[9]  
Li CJ, 2019, 2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019), P2723
[10]   MRAEA: An Efficient and Robust Entity Alignment Approach for Cross-lingual Knowledge Graph [J].
Mao, Xin ;
Wang, Wenting ;
Xu, Huimin ;
Lan, Man ;
Wu, Yuanbin .
PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING (WSDM '20), 2020, :420-428