LODQuMa: A Free-ontology process for Linked (Open) Data quality management

被引:0
作者
Salem, Samah [1 ]
Benchikha, Fouzia [1 ]
机构
[1] Abdelhamid Mehri Constantine 2 Univ, LIRE Lab, Constantine, Algeria
关键词
Linked Open Data; Quality assessment; Quality improvement; Synonym predicates; Profiling statistics; DBpedia;
D O I
10.1016/j.jksuci.2021.06.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For many years, data quality is among the most commonly discussed issue in Linked Open Data (LOD) due to the huge volume of integrated datasets that are usually heterogeneous. Several ontology-based approaches dealing with quality problems have been proposed. However, when datasets lack a well-defined schema, these approaches become ineffective because of the lack of metadata. Moreover, the detection of quality problems based on an analysis between RDF (Resource Description Framework) triples without requiring ontology statistical and semantical information is not addressed. Keeping in mind that ontologies are not always available and they may be incomplete or misused. In this paper, a novel free-ontology process called LODQuMa is proposed to assess and improve the quality of LOD. It is mainly based on profiling statistics, synonym relationships between predicates, QVCs (Quality Verification Cases), and SPARQL (SPARQL Protocol and RDF Query Language) query templates. Experiments on the DBpedia dataset demonstrate that the proposed process is effective for increasing the intrinsic quality dimensions, resulting in correct and compact datasets.(c) 2021 The Authors. Published by Elsevier B.V. on behalf of King Saud University.
引用
收藏
页码:5552 / 5563
页数:12
相关论文
共 27 条
[1]  
Abedjan Ziawasch, 2013, Semantic Web: Semantics and Big Data. Proceedings of 10th International Conference (ESWC 2013): LNCS 7882, P140
[2]  
Abedjan Z, 2014, PROC INT CONF DATA, P1198, DOI 10.1109/ICDE.2014.6816740
[3]  
Andrea C., 2002, CHRISTIAN SCI MONITO
[4]   Roomba: An Extensible Framework to Validate and Build Dataset Profiles [J].
Assaf, Ahmad ;
Troncy, Raphael ;
Senart, Aline .
SEMANTIC WEB: ESWC 2015 SATELLITE EVENTS, 2015, 9341 :325-339
[5]  
Atkinson K., 2006, Gnu aspell 0.60. 4
[6]  
Beek Wouter, 2018, Semantic Web - Interoperability, Usability, Applicability, V9, P131, DOI 10.3233/SW-170288
[7]   Evaluating the quality of linked open data in digital libraries [J].
Candela, Gustavo ;
Escobar, Pilar ;
Carrasco, Rafael C. ;
Marco-Such, Manuel .
JOURNAL OF INFORMATION SCIENCE, 2022, 48 (01) :21-43
[8]   Luzzu-A Methodology and Framework for Linked Data Quality Assessment [J].
Debattista, Jeremy ;
Auer, Soeren ;
Lange, Christoph .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2016, 8 (01)
[9]  
Doroba I.C., 2020, EL SOFTW ED C, P375, DOI [10.12753/2066-026X-20-133, DOI 10.12753/2066-026X-20-133]
[10]  
Furber C., 2011, ECIS 2011 Proceedings