Crowdsourcing Linked Data Quality Assessment

被引:0
作者
Acosta, Maribel [1 ]
Zaveri, Amrapali [2 ]
Simperl, Elena [3 ]
Kontokostas, Dimitris [2 ]
Auer, Soeren [4 ,5 ]
Lehmann, Jens [2 ]
机构
[1] Karlsruhe Inst Technol, Inst AIFB, D-76021 Karlsruhe, Germany
[2] Univ Leipzig, Inst Informat, AKSW, D-04109 Leipzig, Germany
[3] Univ Southampton, Web & Internet Sci Grp, Southampton SO9 5NH, Hants, England
[4] Univ Bonn, Enterprise Informat Syst, Bonn, Germany
[5] Univ Bonn, Fraunhofer IAIS, Bonn, Germany
来源
SEMANTIC WEB - ISWC 2013, PART II | 2013年 / 8219卷
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we look into the use of crowdsourcing as a means to handle Linked Data quality problems that are challenging to be solved automatically. We analyzed the most common errors encountered in Linked Data sources and classified them according to the extent to which they are likely to be amenable to a specific form of crowdsourcing. Based on this analysis, we implemented a quality assessment methodology for Linked Data that leverages the wisdom of the crowds in different ways: (i) a contest targeting an expert crowd of researchers and Linked Data enthusiasts; complemented by (ii) paid microtasks published on Amazon Mechanical Turk. We empirically evaluated how this methodology could efficiently spot quality issues in DBpedia. We also investigated how the contributions of the two types of crowds could be optimally integrated into Linked Data curation processes. The results show that the two styles of crowdsourcing are complementary and that crowdsourcing-enabled quality assessment is a promising and affordable way to enhance the quality of Linked Data.
引用
收藏
页码:260 / 276
页数:17
相关论文
共 16 条
[1]  
[Anonymous], 2010, P 23ND ANN ACM S USE, DOI 10.1145/1866029.1866078
[2]  
[Anonymous], 2010, LDOW
[3]   DBpedia - A crystallization point for the Web of Data [J].
Bizer, Christian ;
Lehmann, Jens ;
Kobilarov, Georgi ;
Auer, Soeren ;
Becker, Christian ;
Cyganiak, Richard ;
Hellmann, Sebastian .
JOURNAL OF WEB SEMANTICS, 2009, 7 (03) :154-165
[4]   Quality-driven information filtering using the WIQA policy framework [J].
Bizer, Christian ;
Cyganiak, Richard .
JOURNAL OF WEB SEMANTICS, 2009, 7 (01) :1-10
[5]   WebTables: Exploring the Power of Tables on the Web [J].
Cafarella, Michael J. ;
Halevy, Alon ;
Wang, Daisy Zhe ;
Wu, Eugene ;
Zhang, Yang .
PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (01) :538-549
[6]  
Demartini G., 2012, P 21 INT C WORLD WID, P469
[7]  
Flemming A., 2010, THESIS
[8]  
Gueret Christophe, 2012, The Semantic Web: Research and Applications. Proceedings 9th Extended Semantic Web Conference (ESWC 2012), P87, DOI 10.1007/978-3-642-30284-8_13
[9]   An empirical survey of Linked Data conformance [J].
Hogan, Aidan ;
Umbrich, Juergen ;
Harth, Andreas ;
Cyganiak, Richard ;
Polleres, Axel ;
Decker, Stefan .
JOURNAL OF WEB SEMANTICS, 2012, 14 :14-44
[10]  
Markotschi T., 2010, P WORKSH KNOWL INJ E