Automatically Identifying Fake News in Popular Twitter Threads

被引:86
作者
Buntain, Cody [1 ]
Golbeck, Jennifer [2 ]
机构
[1] Univ Maryland, College Pk, MD 20742 USA
[2] Univ Maryland, Coll Informat Studies, College Pk, MD 20742 USA
来源
2017 IEEE INTERNATIONAL CONFERENCE ON SMART CLOUD (SMARTCLOUD) | 2017年
关键词
misinformation; credibility; accuracy; data quality; fake news; twitter;
D O I
10.1109/SmartCloud.2017.40
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Information quality in social media is an increasingly important issue, but web-scale data hinders experts' ability to assess and correct much of the inaccurate content, or "fake news," present in these platforms. This paper develops a method for automating fake news detection on Twitter by learning to predict accuracy assessments in two credibility-focused Twitter datasets: CREDBANK, a crowdsourced dataset of accuracy assessments for events in Twitter, and PHEME, a dataset of potential rumors in Twitter and journalistic assessments of their accuracies. We apply this method to Twitter content sourced from BuzzFeed's fake news dataset and show models trained against crowdsourced workers outperform models based on journalists' assessment and models trained on a pooled dataset of both crowdsourced workers and journalists. All three datasets, aligned into a uniform format, are also publicly available. A feature analysis then identifies features that are most predictive for crowdsourced and journalistic accuracy assessments, results of which are consistent with prior work. We close with a discussion contrasting accuracy and credibility and why models of non-experts outperform models of journalists for fake news detection in Twitter.
引用
收藏
页码:208 / 215
页数:8
相关论文
共 23 条
[1]  
Abbott R., 2015, INTERNET ARGUMENT CO, P4445
[2]  
[Anonymous], 2012, P COLING 2012
[3]  
[Anonymous], 2014, ICONFERENCE 2014 P, DOI [DOI 10.9776/14308, 10.9776/14308]
[4]  
[Anonymous], 1999, P SIGCHI C HUM FACT, DOI [DOI 10.1145/302979.303001, 10.1145/302979.303001.3]
[5]   Predicting information credibility in time-sensitive social media [J].
Castillo, Carlos ;
Mendoza, Marcelo ;
Poblete, Barbara .
INTERNET RESEARCH, 2013, 23 (05) :560-588
[6]  
Gupta A, 2013, PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), P729
[7]  
Kang Byungkyu, 2012, P 2012 ACM INT C INT, P179, DOI [DOI 10.1145/2166966.2166998, 10.1145/2166966.2166998]
[8]   Prominent Features of Rumor Propagation in Online Social Media [J].
Kwon, Sejeong ;
Cha, Meeyoung ;
Jung, Kyomin ;
Chen, Wei ;
Wang, Yajun .
2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, :1103-1108
[9]  
Liu B., 2013, TECH REP
[10]  
Mackay J.B., 2011, J MEDIA SOCIOLOGY, V3, P39