Retweet communities reveal the main sources of hate speech

被引:13
作者
Evkoski, Bojan [1 ,2 ]
Pelicon, Andraz [1 ,2 ]
Mozetic, Igor [1 ]
Ljubesic, Nikola [1 ,3 ]
Novak, Petra Kralj [1 ]
机构
[1] Jozef Stefan Inst, Dept Knowledge Technol, Ljubljana, Slovenia
[2] Jozef Stefan Int Postgrad Sch, Ljubljana, Slovenia
[3] Univ Ljubljana, Fac Informat & Commun Sci, Ljubljana, Slovenia
来源
PLOS ONE | 2022年 / 17卷 / 03期
关键词
SOCIAL MEDIA;
D O I
10.1371/journal.pone.0265602
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address a challenging problem of identifying main sources of hate speech on Twitter. On one hand, we carefully annotate a large set of tweets for hate speech, and deploy advanced deep learning to produce high quality hate speech classification models. On the other hand, we create retweet networks, detect communities and monitor their evolution through time. This combined approach is applied to three years of Slovenian Twitter data. We report a number of interesting results. Hate speech is dominated by offensive tweets, related to political and ideological issues. The share of unacceptable tweets is moderately increasing with time, from the initial 20% to 30% by the end of 2020. Unacceptable tweets are retweeted significantly more often than acceptable tweets. About 60% of unacceptable tweets are produced by a single right-wing community of only moderate size. Institutional Twitter accounts and media accounts post significantly less unacceptable tweets than individual accounts. In fact, the main sources of unacceptable tweets are anonymous accounts, and accounts that were suspended or closed during the years 2018-2020.
引用
收藏
页数:22
相关论文
共 55 条
  • [1] A comparison of extrinsic clustering evaluation metrics based on formal constraints
    Amigo, Enrique
    Gonzalo, Julio
    Artiles, Javier
    Verdejo, Felisa
    [J]. INFORMATION RETRIEVAL, 2009, 12 (04): : 461 - 486
  • [2] [Anonymous], 2017, P 2017 C EMPIRICAL M
  • [3] Antin S.J., 2012, P ACM ANN C HUMAN FA, DOI DOI 10.1145/2207676.2208610
  • [4] Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI 10.18653/v1/S19-2007
  • [5] Bayer J., 2020, HATE SPEECH HATE CRI
  • [6] Benavoli A, 2017, J MACH LEARN RES, V18
  • [7] Fast unfolding of communities in large networks
    Blondel, Vincent D.
    Guillaume, Jean-Loup
    Lambiotte, Renaud
    Lefebvre, Etienne
    [J]. JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2008,
  • [8] Twitter mood predicts the stock market
    Bollen, Johan
    Mao, Huina
    Zeng, Xiaojun
    [J]. JOURNAL OF COMPUTATIONAL SCIENCE, 2011, 2 (01) : 1 - 8
  • [9] Bretschneider U., 2016, Detecting cyberbullying in online communities
  • [10] Retweet networks of the European Parliament: evaluation of the community structure
    Cherepnalkoski D.
    Mozetič I.
    [J]. Applied Network Science, 2016, 1 (01)