Would Your Tweet Invoke Hate on the Fly? Forecasting Hate Intensity of Reply Threads on Twitter

被引:12
作者
Dahiya, Snehil [1 ]
Sharma, Shalini [1 ]
Sahnan, Dhruv [1 ]
Goel, Vasu [1 ]
Chouzenoux, Emilie [2 ]
Elvira, Victor [3 ]
Majumdar, Angshul [1 ]
Bandhakavi, Anil [4 ]
Chakraborty, Tanmoy [1 ]
机构
[1] IIIT Delhi, Delhi, India
[2] Univ Paris Saclay, Inria Saclay, Paris, France
[3] Univ Edinburgh, Edinburgh, Midlothian, Scotland
[4] Logically, Portland, ME USA
来源
KDD '21: PROCEEDINGS OF THE 27TH ACM SIGKDD CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING | 2021年
关键词
Hate speech; Signal processing; State-space model; Time series forecasting; ALGORITHM;
D O I
10.1145/3447548.3467150
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Curbing hate speech is undoubtedly a major challenge for online microblogging platforms like Twitter. While there have been studies around hate speech detection, it is not clear how hate speech finds its way into an online discussion. It is important for a content moderator to not only identify which tweet is hateful, but also to predict which tweet will be responsible for accumulating hate speech. This would help in prioritizing tweets that need constant monitoring. Our analysis reveals that for hate speech to manifest in an ongoing discussion, the source tweet may not necessarily be hateful; rather, there are plenty of such non-hateful tweets which gradually invoke hateful replies, resulting in the entire reply threads becoming provocative. In this paper, we define a novel problem - given a source tweet and a few of its initial replies, the task is to forecast the hate intensity of upcoming replies. To this end, we curate a novel dataset constituting similar to 4.5k contemporary tweets and their entire reply threads. Our preliminary analysis confirms that the evolution patterns along time of hate intensity among reply threads have highly diverse patterns, and there is no significant correlation between the hate intensity of the source tweets and that of their reply threads. We employ seven state-of-the-art dynamic models (either statistical signal processing or deep learning based) and show that they fail badly to forecast the hate intensity. We then propose DESSERT, a novel deep state-space model that leverages the function approximation capability of deep neural networks with the capacity to quantify the uncertainty of statistical signal processing models. Exhaustive experiments and ablation study show that DESSERT outperforms all the baselines substantially. Further, its deployment in an advanced AI platform designed to monitor real-world problematic hateful content has improved the aggregated insights extracted for countering the spread of online harms.
引用
收藏
页码:2732 / 2742
页数:11
相关论文
共 40 条
[1]   Deep Learning for Hate Speech Detection in Tweets [J].
Badjatiya, Pinkesh ;
Gupta, Shashank ;
Gupta, Manish ;
Varma, Vasudeva .
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760
[2]   Examining Untempered Social Media: Analyzing Cascades of Polarized Conversations [J].
Bagavathi, Arunkumar ;
Bashiri, Pedram ;
Reid, Shannon ;
Phillips, Matthew ;
Krishnan, Siddharth .
PROCEEDINGS OF THE 2019 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM 2019), 2019, :625-632
[3]  
Bengio Y., 2019, ARXIV190510437, P1
[4]   INTERVENTION ANALYSIS WITH APPLICATIONS TO ECONOMIC AND ENVIRONMENTAL PROBLEMS [J].
BOX, GEP ;
TIAO, GC .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (349) :70-79
[5]  
Chouzenoux É, 2020, INT CONF ACOUST SPEE, P5840, DOI [10.1109/ICASSP40776.2020.9053646, 10.1109/icassp40776.2020.9053646]
[6]   A Stochastic Majorize-Minimize Subspace Algorithm for Online Penalized Least Squares Estimation [J].
Chouzenoux, Emilie ;
Pesquet, Jean-Christophe .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (18) :4770-4783
[7]  
Chung YL, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2819
[8]  
Davidson T., 2017, P INT AAAI C WEB SOC, VVolume 11, P512, DOI [10.1609/icwsm.v11i1.14955, DOI 10.1609/ICWSM.V11I1.14955]
[9]  
Do Hang Thi-Thuy, 2019, AEXIV191103648
[10]  
Elsworth S., 2020, ARXIV200305672