Predicting information credibility in time-sensitive social media

被引:214
作者
Castillo, Carlos [1 ]
Mendoza, Marcelo [2 ]
Poblete, Barbara [3 ]
机构
[1] Qatar Comp Res Inst, Doha, Qatar
[2] Univ Tecn Federico Santa Maria, Santiago, Chile
[3] Univ Chile, Dept Comp Sci, Santiago, Chile
关键词
Information credibility; Online social networks; Model transfer; Time sensitiveness; Social media prediction; PERCEPTIONS;
D O I
10.1108/IntR-05-2012-0095
中图分类号
F [经济];
学科分类号
02 ;
摘要
Purpose - Twitter is a popular microblogging service which has proven, in recent years, its potential for propagating news and information about developing events. The purpose of this paper is to focus on the analysis of information credibility on Twitter. The purpose of our research is to establish if an automatic discovery process of relevant and credible news events can be achieved. Design/methodology/approach - The paper follows a supervised learning approach for the task of automatic classification of credible news events. A first classifier decides if an information cascade corresponds to a newsworthy event. Then a second classifier decides if this cascade can be considered credible or not. The paper undertakes this effort training over a significant amount of labeled data, obtained using crowdsourcing tools. The paper validates these classifiers under two settings: the first, a sample of automatically detected Twitter "trends" in English, and second, the paper tests how well this model transfers to Twitter topics in Spanish, automatically detected during a natural disaster. Findings - There are measurable differences in the way microblog messages propagate. The paper shows that these differences are related to the newsworthiness and credibility of the information conveyed, and describes features that are effective for classifying information automatically as credible or not credible. Originality/value - The paper first tests the approach under normal conditions, and then the paper extends the findings to a disaster management situation, where many news and rumors arise. Additionally, by analyzing the transfer of our classifiers across languages, the paper is able to look more deeply into which topic-features are more relevant for credibility assessment. To the best of our knowledge, this is the first paper that studies the power of prediction of social media for information credibility; considering model transfer into time-sensitive and language-sensitive contexts.
引用
收藏
页码:560 / 588
页数:29
相关论文
共 50 条
  • [1] Al-Eidan R. M. B., 2010, 2010 Fifth International Conference on Digital Information Management (ICDIM 2010), P285, DOI 10.1109/ICDIM.2010.5664223
  • [2] An experimental system for measuring the credibility of news content in Twitter
    Al-Khalifa, Hend S.
    Al-Eidan, Rasha M.
    [J]. INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2011, 7 (02) : 130 - +
  • [3] Alonso O., 2010, SIGIR CROWDSOURCING
  • [4] [Anonymous], 2008, P 2008 INT C WEB SEA
  • [5] [Anonymous], 2010, SOCIALCOM, DOI DOI 10.1109/SOCIALCOM.2010.33
  • [6] [Anonymous], 2011, Fifth International AAAI Conference on Weblogs and Social Media, DOI 10.1609/icwsm.v5i1.14127
  • [7] [Anonymous], 2010, 1 MONDAY
  • [8] [Anonymous], 2009, P 2009 INT WORKSH LO, DOI DOI 10.1145/1629890.1629907
  • [9] [Anonymous], 2010, Proceedings of the 2010 international conference on Management of data
  • [10] [Anonymous], 2009, P 17 ACM SIGSP INT C