MateTee: A Semantic Similarity Metric Based on Translation Embeddings for Knowledge Graphs

被引:6
作者
Morales, Camilo [1 ,2 ]
Collarana, Diego [1 ,2 ]
Vidal, Maria-Esther [2 ,3 ]
Auer, Soeren [1 ,2 ]
机构
[1] Univ Bonn, Enterprise Informat Syst EIS, Bonn, Germany
[2] Fraunhofer Inst Intelligent Anal & Informat Syst, St Augustin, Germany
[3] Univ Simon Bolivar, Caracas, Venezuela
来源
WEB ENGINEERING (ICWE 2017) | 2017年 / 10360卷
关键词
GENE ONTOLOGY;
D O I
10.1007/978-3-319-60131-1_14
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Large Knowledge Graphs (KGs), e.g., DBpedia or Wikidata, are created with the goal of providing structure to unstructured or semi-structured data. Having these special datasets constantly evolving, the challenge is to utilize them in a meaningful, accurate, and efficient way. Further, exploiting semantics encoded in KGs, e.g., class and property hierarchies, provides the basis for addressing this challenge and producing a more accurate analysis of KG data. Thus, we focus on the problem of determining relatedness among entities in KGs, which corresponds to a fundamental building block for any semantic data integration task. We devise MateTee, a semantic similarity measure that combines the gradient descent optimization method with semantics encoded in ontologies, to precisely compute values of similarity between entities in KGs. We empirically study the accuracy of MateTee with respect to state-of-the-art methods. The observed results show that MateTee is competitive in terms of accuracy with respect to existing methods, with the advantage that background domain knowledge is not required.
引用
收藏
页码:246 / 263
页数:18
相关论文
共 22 条
[1]  
Benik Joseph, 2012, Data Integration in the Life Sciences. Proceedings 8th International Conference, DILS 2012, P21, DOI 10.1007/978-3-642-31040-9_3
[2]   A New Look at the Semantic Web [J].
Bernstein, Abraham ;
Hendler, James ;
Noy, Natalya .
COMMUNICATIONS OF THE ACM, 2016, 59 (09) :35-37
[3]  
Bordes A., 2011, Learning structured embeddings of knowledge bases
[4]  
Bordes A, 2013, P 26 INT C NEURAL IN, P2787
[5]   FuhSen: A Federated Hybrid Search Engine for Building a Knowledge Graph On-Demand [J].
Collarana, Diego ;
Galkin, Mikhail ;
Lange, Christoph ;
Grangel-Gonzalez, Irlan ;
Vidal, Maria-Esther ;
Auer, Soeren .
ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2016 CONFERENCES, 2016, 10033 :752-761
[6]   Measuring semantic similarity between Gene Ontology terms [J].
Couto, Francisco M. ;
Silva, Mario J. ;
Coutinho, Pedro M. .
DATA & KNOWLEDGE ENGINEERING, 2007, 61 (01) :137-152
[7]  
Devos D, 2000, PROTEINS, V41, P98, DOI 10.1002/1097-0134(20001001)41:1<98::AID-PROT120>3.0.CO
[8]  
2-S
[9]  
Glorot X., 2010, P 13 INT C ART INT S, P249, DOI DOI 10.1109/LGRS.2016.2565705
[10]   node2vec: Scalable Feature Learning for Networks [J].
Grover, Aditya ;
Leskovec, Jure .
KDD'16: PROCEEDINGS OF THE 22ND ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2016, :855-864