Do Rumors Diffuse Differently from Non-rumors? A Systematically Empirical Analysis in Sina Weibo for Rumor Identification

被引:18
作者
Liu, Yahui [1 ]
Jin, Xiaolong [1 ]
Shen, Huawei [1 ]
Cheng, Xueqi [1 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, CAS Key Lab Network Data Sci & Technol, Beijing, Peoples R China
来源
ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2017, PT I | 2017年 / 10234卷
基金
中国国家自然科学基金;
关键词
Rumor identification; Diffusion tree; Sina Weibo;
D O I
10.1007/978-3-319-57454-7_32
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
With the prosperity of social media, online rumors become a severe social problem, which often lead to serious consequences, e.g., social panic and even chaos. Therefore, how to automatically identify rumors in social media has attracted much research attention. Most existing studies address this problem by extracting features from the contents of rumors and their reposts as well as the users involved. For these features, especially diffusion features, these works ignore systematic analysis and the exploration of difference between rumors and non-rumors, which exert targeted effect on rumor identification. In this paper, we systematically investigate this problem from a diffusion perspective using Sina Weibo data. We first extract a group of new features from the diffusion processes of messages and then make a few important observations on them. Based on these features, we develop classifiers to discriminate rumors and non-rumors. Experimental comparisons with the state-ofthe-arts methods demonstrate the effectiveness of these features.
引用
收藏
页码:407 / 420
页数:14
相关论文
共 24 条
  • [1] [Anonymous], 2014, ICONFERENCE 2014 P, DOI [DOI 10.9776/14308, 10.9776/14308]
  • [2] Bao P., 2013, PLOS ONE, V8, P1
  • [3] Bao P, 2013, PROCEEDINGS OF THE 22ND INTERNATIONAL CONFERENCE ON WORLD WIDE WEB (WWW'13 COMPANION), P177
  • [4] The power of a good idea:: Quantitative modeling of the spread of ideas from epidemiological models
    Bettencourt, LMA
    Cintrón-Arias, A
    Kaiser, DI
    Castillo-Chávez, C
    [J]. PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2006, 364 : 513 - 536
  • [5] Castillo C., 2011, P 20 INT C WORLD WID, P675, DOI [DOI 10.1145/1963405.1963500, 10.1145/1963405.1963500]
  • [6] SMOTE: Synthetic minority over-sampling technique
    Chawla, Nitesh V.
    Bowyer, Kevin W.
    Hall, Lawrence O.
    Kegelmeyer, W. Philip
    [J]. 2002, American Association for Artificial Intelligence (16)
  • [7] Can Cascades be Predicted?
    Cheng, Justin
    Adamic, Lada A.
    Dow, P. Alex
    Kleinberg, Jon
    Leskovec, Jure
    [J]. WWW'14: PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2014, : 925 - 935
  • [8] Friggeri A., 2014, P 8 INT AAAI C WEBL
  • [9] Japkowicz N., 2002, Intelligent Data Analysis, V6, P429
  • [10] Prominent Features of Rumor Propagation in Online Social Media
    Kwon, Sejeong
    Cha, Meeyoung
    Jung, Kyomin
    Chen, Wei
    Wang, Yajun
    [J]. 2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2013, : 1103 - 1108