Knowledge Graph Completeness: A Systematic Literature Review

被引:39
作者
Issa, Subhi [1 ]
Adekunle, Onaopepo [2 ]
Hamdi, Faycal [1 ]
Cherfi, Samira Si-Said [1 ]
Dumontier, Michel [2 ]
Zaveri, Amrapali [2 ]
机构
[1] Conservatoire Natl Arts & Metiers, Ctr Studies & Res Comp Sci & Commun CEDRIC, F-75003 Paris, France
[2] Maastricht Univ, Inst Data Sci, NL-6229 GT Maastricht, Netherlands
来源
IEEE ACCESS | 2021年 / 9卷
关键词
Data integrity; Linked data; Systematics; Measurement; Bibliographies; Tools; Search problems; Assessment; completeness; data quality; KG; knowledge graph; linked data; LOD; metrics; survey; systematic literature review; DATA QUALITY; LINKED DATA;
D O I
10.1109/ACCESS.2021.3056622
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The quality of a Knowledge Graph (also known as Linked Data) is an important aspect to indicate its fitness for use in an application. Several quality dimensions are identified, such as accuracy, completeness, timeliness, provenance, and accessibility, which are used to assess the quality. While many prior studies offer a landscape view of data quality dimensions, here we focus on presenting a systematic literature review for assessing the completeness of Knowledge Graph. We gather existing approaches from the literature and analyze them qualitatively and quantitatively. In particular, we unify and formalize commonly used terminologies across 56 articles related to the completeness dimension of data quality and provide a comprehensive list of methodologies and metrics used to evaluate the different types of completeness. We identify seven types of completeness, including three types that were not previously identified in previous surveys. We also analyze nine different tools capable of assessing Knowledge Graph completeness. The aim of this Systematic Literature Review is to provide researchers and data curators a comprehensive and deeper understanding of existing works on completeness and its properties, thereby encouraging further experimentation and development of new approaches focused on completeness as a data quality dimension of Knowledge Graph.
引用
收藏
页码:31322 / 31339
页数:18
相关论文
共 79 条
[1]   Enhancing answer completeness of SPARQL queries via crowdsourcing [J].
Acosta, Maribel ;
Simperl, Elena ;
Floeck, Fabian ;
Vidal, Maria-Esther .
JOURNAL OF WEB SEMANTICS, 2017, 45 :41-62
[2]  
Albertoni R., 2015, P CEUR WORKSH, V1376, P1
[3]  
Albertoni R., 2013, P JOINT EDBTICDT 201, P52
[4]   A Model for Linked Open Data Acquisition and SPARQL Query Generation [J].
Alec, Celine ;
Reynaud-Delaitre, Chantal ;
Safar, Brigitte .
GRAPH-BASED REPRESENTATION AND REASONING (ICCS 2016), 2016, 9717 :237-251
[5]  
Ali M., 2018, J THEOR APPL INF TEC, V96, P3924
[6]  
[Anonymous], 2018, COMP WEB C 2018 WEB
[7]  
[Anonymous], 2012, P 2012 JOINT EDBT IC, DOI DOI 10.1145/2320765.2320803
[8]  
[Anonymous], 2014, P COLD
[9]   Towards An Objective Assessment Framework for Linked Data Quality: Enriching Dataset Profiles with Quality Indicators [J].
Assaf, Ahmad ;
Senart, Aline ;
Troncy, Raphael .
INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS, 2016, 12 (03) :111-133
[10]   What's up LOD Cloud? Observing the State of Linked Open Data Cloud Metadata [J].
Assaf, Ahmad ;
Troncy, Raphael ;
Senart, Aline .
SEMANTIC WEB: ESWC 2015 SATELLITE EVENTS, 2015, 9341 :247-254