Data Quality Assessment in the Integration Process of Linked Open Data (LOD)

被引:10
作者
Ahmed, Hana Haj [1 ]
机构
[1] Univ Manouba, Lab ENSI, RIADI, Manouba, Tunisia
来源
2017 IEEE/ACS 14TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA) | 2017年
关键词
data quality; linked open data; assessment; data integration; improvement;
D O I
10.1109/AICCSA.2017.178
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Linked Open Data (LOD) entails a set of best practices for publishing and connecting structured data on the Web, which allows sharing and exchanging information in an inter-operable and reusable manner. The increasing adoption of these principles has lead to the creation of a globally distributed and huge informative space that covers various domains such as government, libraries, life sciences, and media. This offers a great opportunity to end-users to build semantic applications by exploring and consuming heterogeneous and dispersed possibly interlinked data. Thus, consuming linked data can be considered as a typical scenario of linked data integration in which a user requires to combine data residing in large and varying quality LOD datasets. In this paper, we examine the specifics of linked data integration and focus on three key challenges, namely data quality profiling and assessment, conflict resolution and quality improvement. We postulate that data quality assessment can act both as a deciding factor for conflict resolution and as an indicator of low quality data which need to be improved.
引用
收藏
页码:1 / 6
页数:6
相关论文
共 25 条
[1]  
[Anonymous], 2006, Linked data
[2]  
[Anonymous], 2002, P ACM SIGACT SIGMOD, DOI DOI 10.1145/543613.543644
[3]  
[Anonymous], FUNDAMENTALS DATA WA
[4]  
[Anonymous], 2012, WWW
[5]  
[Anonymous], 2014, Semantic Web Journal
[6]  
[Anonymous], 2009, CEUR WORKSHOP P
[7]  
Bizer C, 2007, QUALITY DRIVEN INFOR
[8]  
Dimou A., 2015, P INT SEMANTICWEB C, P133
[9]  
Frber C., 2010, BUSINESS INFORM SYST, P3546
[10]  
Gil Y., 2002, P INT SEM WEB C ISWC, P162