Open data: Quality over quantity

被引:118
作者
Sadiq, Shazia [1 ]
Indulska, Marta [2 ]
机构
[1] Univ Queensland, Sch Informat Technol & Elect Engn, St Lucia, Qld 4072, Australia
[2] Univ Queensland, UQ Business Sch, St Lucia, Qld 4072, Australia
关键词
Open data; Data quality; INFORMATION;
D O I
10.1016/j.ijinfomgt.2017.01.003
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Open data aims to unlock the innovation potential of businesses, governments, and entrepreneurs, yet it also harbours significant challenges for its effective use. While numerous innovation successes exist that are based on the open data paradigm, there is uncertainty over the data quality of such datasets. This data quality uncertainty is a threat to the value that can be generated from such data. Data quality has been studied extensively over many decades and many approaches to data quality management have been proposed. However, these approaches are typically based on datasets internal to organizations, with known metadata, and domain knowledge of the data semantics. Open data, on the other hand, are often unfamiliar to the user and may lack metadata. The aim of this research note is to outline the challenges in dealing with data quality of open datasets, and to set an agenda for future research to address this risk to deriving value from open data investments. (C) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:150 / 154
页数:5
相关论文
共 40 条
[1]   Profiling relational data: a survey [J].
Abedjan, Ziawasch ;
Golab, Lukasz ;
Naumann, Felix .
VLDB JOURNAL, 2015, 24 (04) :557-581
[2]  
Abiteboul S., 2015, P 18 INT WORKSH WEB
[3]  
[Anonymous], 2009, Information quality applied: Best practices for improving business information, processes and systems
[4]  
[Anonymous], QUALITY CONTROL HDB
[5]  
[Anonymous], FINDING QUALITY QUAN
[6]  
[Anonymous], 2012, Synthesis Lectures on Data Management, DOI [DOI 10.2200/S00439ED1V01Y201207DTM030, 10.2200/S00439ED1V01Y201207DTM030]
[7]  
[Anonymous], 2011, 80001 ISOTS
[8]  
[Anonymous], 2014, DEC WITH DAT
[9]  
[Anonymous], 2003, EXPLORATORY DATA MIN
[10]  
[Anonymous], 2013, HDB DATA QUALITY