Data linking over RDF knowledge graphs: A survey

被引:7
作者
Assi, Ali [1 ]
Mcheick, Hamid [2 ]
Dhifli, Wajdi [3 ]
机构
[1] Univ Quebec Montreal, Montreal, PQ, Canada
[2] Univ Quebec Chicoutimi, Chicoutimi, PQ, Canada
[3] Univ Lille, CHU Lille, ULR 2694 Metr Evaluat Technol Sante & Prat Med, F-59000 Lille, France
关键词
data linking; instance matching; knowledge graph; record linkage; semantic web; web of data; ENTITY RESOLUTION; RECORD LINKAGE; LINKED DATA; DISCOVERY; ALGORITHM; ALIGNMENT; FACTS;
D O I
10.1002/cpe.5746
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Instance matching (IM) is the process of matching instances across Knowledge Bases (KBs) that refer to the same real-world object (eg, the same person in two different KBs). Several approaches in the literature were developed to perform this process using different algorithmic techniques and search strategies. In this article, we aim to provide the rationale for IM and to survey the existing algorithms for performing this task. We begin by identifying the importance of such a process and define it formally. We also provide a new classification of these approaches depending on the "source of evidence," which can be considered as the context information integrated explicitly or implicitly in the IM process. We survey and discuss the state-of-the-art IM methods regarding the context information. We, furthermore, describe and compare different state-of-the-art IM approaches in relation to several criteria. Such a comprehensive comparative study constitutes an asset and a guide for future research in IM.
引用
收藏
页数:40
相关论文
共 146 条
[31]   Variation-based Sparse Cortical Current Density Imaging in Estimating Cortical Sources with MEG Data [J].
Ding, Lei ;
Zhu, Min ;
Zhang, Wenbo ;
Dickens, Deanna L. .
2010 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2010, :5145-5148
[32]  
Doran P., 2007, P 16 ACM C C INF KNO, P61
[33]   Approximate data instance matching: a survey [J].
Dorneles, Carina Friedrich ;
Goncalves, Rodrigo ;
Mello, Ronaldo dos Santos .
KNOWLEDGE AND INFORMATION SYSTEMS, 2011, 27 (01) :1-21
[34]  
Efthymiou V., 2019, EDBT, P373
[35]   TAILOR: A record linkage toolbox [J].
Elfeky, MG ;
Verykios, VS ;
Elmagarmid, AK .
18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, :17-28
[36]   Duplicate record detection: A survey [J].
Elmagarmid, Ahmed K. ;
Ipeirotis, Panagiotis G. ;
Verykios, Vassilios S. .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2007, 19 (01) :1-16
[37]  
EUZENAT J, 2013, ONTOLOGY MATCHING, V18
[38]  
FARAH H, 2017, P KNOWL CAPT C KCAP, P7, DOI DOI 10.1145/3148011
[39]  
Faria D, 2013, LECT NOTES COMPUT SC, V8185, P527, DOI 10.1007/978-3-642-41030-7_38
[40]  
FERRARA A, 2008, P 3 INT C ONT MATCH, V431, P37