On Data Quality Assurance and Conflation Entanglement in Crowdsourcing for Environmental Studies

被引:18
作者
Leibovici, Didier G. [1 ]
Rosser, Julian F. [1 ]
Hodges, Crona [2 ]
Evans, Barry [3 ]
Jackson, Michael J. [1 ]
Higgins, Chris I. [4 ]
机构
[1] Univ Nottingham, Nottingham Geospatial Inst, Nottingham NG7 2RD, England
[2] Aberystwyth Univ Penglais, Earth Observat Grp, Aberystwyth SY23 3FL, Dyfed, Wales
[3] Univ Exeter, Coll Engn Math & Phys Sci, Exeter EX4 4QF, Devon, England
[4] Univ Edinburgh, EDINA, Edinburgh EH8 9YL, Midlothian, Scotland
关键词
data curation; data quality; ISO standard; data fusion; data conflation; citizen science; crowdsourcing; VOLUNTEERED GEOGRAPHIC INFORMATION;
D O I
10.3390/ijgi6030078
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Volunteer geographical information (VGI), either in the context of citizen science or the mining of social media, has proven to be useful in various domains including natural hazards, health status, disease epidemics, and biological monitoring. Nonetheless, the variable or unknown data quality due to crowdsourcing settings are still an obstacle for fully integrating these data sources in environmental studies and potentially in policy making. The data curation process, in which a quality assurance (QA) is needed, is often driven by the direct usability of the data collected within a data conflation process or data fusion (DCDF), combining the crowdsourced data into one view, using potentially other data sources as well. Looking at current practices in VGI data quality and using two examples, namely land cover validation and inundation extent estimation, this paper discusses the close links between QA and DCDF. It aims to help in deciding whether a disentanglement can be possible, whether beneficial or not, in understanding the data curation process with respect to its methodology for future usage of crowdsourced data. Analysing situations throughout the data curation process where and when entanglement between QA and DCDF occur, the paper explores the various facets of VGI data capture, as well as data quality assessment and purposes. Far from rejecting the usability ISO quality criterion, the paper advocates for a decoupling of the QA process and the DCDF step as much as possible while still integrating them within an approach analogous to a Bayesian paradigm.
引用
收藏
页数:17
相关论文
共 43 条
[1]  
Alabri A., 2010, P 2010 IEEE 6 INT C
[2]  
[Anonymous], 2011, 44 HAWAII INT C SYST
[3]  
[Anonymous], 1985, Statistical method for meta-analysis
[4]  
Antelio M., 2012, P 2012 IEEE INT C SY
[5]  
Bishr M., 2013, LECT NOTES COMPUTER, P53
[6]   On predicting and improving the quality of Volunteer Geographic Information projects [J].
Bordogna, Gloria ;
Carrara, Paola ;
Criscuolo, Laura ;
Pepe, Monica ;
Rampini, Anna .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2016, 9 (02) :134-155
[7]  
Chapman C., 2016, CAN CITIZEN SCI SERI
[8]   Using control data to determine the reliability of volunteered geographic information about land cover [J].
Comber, Alexis ;
See, Linda ;
Fritz, Steffen ;
Van der Velde, Marijn ;
Perger, Christoph ;
Foody, Giles .
INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION, 2013, 23 :37-48
[9]   Data democracy - increased supply of geospatial information and expanded participatory processes in the production of data [J].
Craglia, Max ;
Shanley, Lea .
INTERNATIONAL JOURNAL OF DIGITAL EARTH, 2015, 8 (09) :679-693
[10]  
Evans B., 2016, P COWM2016INTERNATIO