Can The Crowd Identify Misinformation Objectively? The Effects of Judgment Scale and Assessor's Background

被引:30
作者
Roitero, Kevin [1 ]
Soprano, Michael [1 ]
Fan, Shaoyang [2 ]
Spina, Damiano [3 ]
Mizzaro, Stefano [1 ]
Demartini, Gianluca [2 ]
机构
[1] Univ Udine, Udine, Italy
[2] Univ Queensland, Brisbane, Qld, Australia
[3] RMIT Univ, Melbourne, Vic, Australia
来源
PROCEEDINGS OF THE 43RD INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '20) | 2020年
基金
澳大利亚研究理事会;
关键词
Crowdsourcing; Information Credibility; Classification; VERIFICATION;
D O I
10.1145/3397271.3401112
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Truthfulness judgments are a fundamental step in the process of fighting misinformation, as they are crucial to train and evaluate classifiers that automatically distinguish true and false statements. Usually such judgments are made by experts, like journalists for political statements or medical doctors for medical statements. In this paper, we follow a different approach and rely on (non-expert) crowd workers. This of course leads to the following research question: Can crowdsourcing be reliably used to assess the truthfulness of information and to create large-scale labeled collections for information credibility systems? To address this issue, we present the results of an extensive study based on crowdsourcing: we collect thousands of truthfulness assessments over two datasets, and we compare expert judgments with crowd judgments, expressed on scales with various granularity levels. We also measure the political bias and the cognitive background of the workers, and quantify their effect on the reliability of the data provided by the crowd.
引用
收藏
页码:439 / 448
页数:10
相关论文
共 57 条
[1]  
Agresti Alan, 2012, Analysis of Ordinal Categorical Data, V2nd, DOI DOI 10.1002/9780470594001
[2]   Evaluating Topic Representations for Exploring Document Collections [J].
Aletras, Nikolaos ;
Baldwin, Timothy ;
Lau, Jey Han ;
Stevenson, Mark .
JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2017, 68 (01) :154-167
[3]  
Amigó E, 2013, SIGIR'13: THE PROCEEDINGS OF THE 36TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH & DEVELOPMENT IN INFORMATION RETRIEVAL, P643
[4]  
Amigó E, 2013, LECT NOTES COMPUT SC, V8138, P333, DOI 10.1007/978-3-642-40802-1_31
[5]  
[Anonymous], 2008, P 2008 INT C WEB SEA, DOI [10.1145/1341531, DOI 10.1145/1341531.1341545]
[6]   Automatic Fact-Checking Using Context and Discourse Information [J].
Atanasova, Pepa ;
Nakov, Preslav ;
Marquez, Lluis ;
Barron-Cedeno, Alberto ;
Karadzhov, Georgi ;
Mihaylova, Tsvetomila ;
Mohtarami, Mitra ;
Glass, James .
ACM JOURNAL OF DATA AND INFORMATION QUALITY, 2019, 11 (03)
[7]  
Checco Alessandro, 2017, 5 AAAI C HUM COMP CR
[8]  
Conroy Niall J, 2015, Proc. Assoc. Inf. Sci. Technol, P82, DOI DOI 10.1002/PRA2.2015.145052010082
[9]  
Cuzzocrea Alfredo, 2019, CEUR WORKSHOP PROC, V2482
[10]   MULTIPLE COMPARISONS USING RANK SUMS [J].
DUNN, OJ .
TECHNOMETRICS, 1964, 6 (03) :241-&