Knowledge Graph Embedding for Link Prediction: A Comparative Analysis

被引:360
作者
Rossi, Andrea [1 ]
Barbosa, Denilson [2 ]
Firmani, Donatella [1 ]
Matinata, Antonio [1 ]
Merialdo, Paolo [1 ]
机构
[1] Roma Tre Univ, Dept Engn, Via Vasca Navale 79, I-00146 Rome, Italy
[2] Univ Alberta, Dept Comp Sci, 2-21 Athabasca Hall, Edmonton, AB T6G 2E8, Canada
关键词
Knowledge graphs; link prediction; knowledge graph embeddings; comparative analysis;
D O I
10.1145/3424672
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge Graphs (KGs) have found many applications in industrial and in academic settings, which in turn, have motivated considerable research efforts towards large-scale information extraction from a variety of sources. Despite such efforts, it is well known that even the largest KGs suffer from incompleteness; Link Prediction (LP) techniques address this issue by identifying missing facts among entities already in the KG. Among the recent LP techniques, those based on KG embeddings have achieved very promising performance in some benchmarks. Despite the fast-growing literature on the subject, insufficient attention has been paid to the effect of the design choices in those methods. Moreover, the standard practice in this area is to report accuracy by aggregating over a large number of test facts in which some entities are vastly more represented than others; this allows LP methods to exhibit good results by just attending to structural properties that include such entities, while ignoring the remaining majority of the KG. This analysis provides a comprehensive comparison of embedding-based LP methods, extending the dimensions of analysis beyond what is commonly available in the literature. We experimentally compare the effectiveness and efficiency of 18 state-of-the-art methods, consider a rule-based baseline, and report detailed analysis over the most popular benchmarks in the literature.
引用
收藏
页数:49
相关论文
共 77 条
[1]  
Agrawal Nilesh, 2020, P 2020 AAAI C ART IN
[2]  
An Bo, 2018, P 2018 C N AM CHAPT
[3]  
[Anonymous], 1999, P TEXT RETRIEVAL C
[4]  
[Anonymous], 2015, P 2015 C EMPIRICAL M
[5]   DBpedia: A nucleus for a web of open data [J].
Auer, Soeren ;
Bizer, Christian ;
Kobilarov, Georgi ;
Lehmann, Jens ;
Cyganiak, Richard ;
Ives, Zachary .
SEMANTIC WEB, PROCEEDINGS, 2007, 4825 :722-+
[6]  
Bahare Fatemi, FASTER SIMPLE IMPLEM
[7]  
Bollacker K, 2008, SIGMOD, P1247, DOI DOI 10.1145/1376616.1376746
[8]  
Bordes A., 2013, Advances in Neural Information Processing Systems, V26, P2787, DOI DOI 10.5555/2999792.2999923
[9]   A Comprehensive Survey of Graph Embedding: Problems, Techniques, and Applications [J].
Cai, HongYun ;
Zheng, Vincent W. ;
Chang, Kevin Chen-Chuan .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2018, 30 (09) :1616-1637
[10]  
Cohen William W., 2011, P 2011 C EMP METH NA