Sparse Shield: Social Network Immunization vs. Harmful Speech

被引:24
作者
Petrescu, Alexandru [1 ]
Truica, Ciprian-Octavian [1 ]
Apostol, Elena-Simona [1 ]
Karras, Panagiotis [2 ]
机构
[1] Univ Politeh Bucharest, Bucharest, Romania
[2] Aarhus Univ, Aarhus, Denmark
来源
PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021 | 2021年
关键词
network immunization; harmful speech detection; preventive immunization; counteractive immunization;
D O I
10.1145/3459637.3482481
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rise of social media users and the general shift of communication from traditional media to online platforms, the spread of harmful content (e.g., hate speech, misinformation, fake news) has been exacerbated. Harmful content in the form of hate speech causes a person distress or harm, having a negative impact on the individual mental health, with even more detrimental effects on the psychology of children and teenagers. In this paper, we propose an end-to-end solution with real-time capabilities to detect harmful content in real-time and mitigate its spread over the network. Our main contribution is Sparse Shield, a novel method that out-scales existing state-of-the-art methods for network immunization. We also propose a novel architecture for harmful speech mitigation that maximizes the impact of immunization. Our solution aims to identify a set of users for which to move harmful content at the bottom of the user feed, rather than censoring users. By immunizing certain network nodes in this manner, we minimize the negative impact on the network and minimize the interference with and limitation of individual freedoms: the information is not hidden but rather not as easy to reach without an explicit search. Our analysis is based on graphs built on real-world data collected from Twitter; these graphs reflect real user behavior. We perform extensive scalability experiments to prove the superiority of our method over existing state-of-the-art network immunization techniques. We also perform extensive experiments to showcase that Sparse Shield outperforms existing techniques on the task of harmful speech mitigation on a real-world dataset.
引用
收藏
页码:1426 / 1436
页数:11
相关论文
共 44 条
[1]   Combinatorial trace method for network immunization [J].
Ahmad, Muhammad ;
Ali, Sarwan ;
Tariq, Juvaria ;
Khan, Imdadullah ;
Shabbir, Mudassir ;
Zaman, Arif .
INFORMATION SCIENCES, 2020, 519 :215-228
[2]  
Ahmad Muhammad, 2017, Australasian Journal of Information Systems, V21, P1, DOI DOI 10.3127/AJIS.V21I0.1563
[3]  
Alonso P, 2020, PROCEEDINGS OF THE FOURTEENTH WORKSHOP ON SEMANTIC EVALUATION, P2197
[4]  
Ananthakrishnan Uttara M., 2021, SSRN ELECT J, P1, DOI [10.2139/ssrn.3793801, DOI 10.2139/SSRN.3793801]
[5]  
[Anonymous], 2015, ACS SYM SER
[6]  
[Anonymous], **DATA OBJECT**, DOI DOI 10.5441/002/EDBT.2019.97
[7]   Deep Learning for Hate Speech Detection in Tweets [J].
Badjatiya, Pinkesh ;
Gupta, Shashank ;
Gupta, Manish ;
Varma, Vasudeva .
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760
[8]   Problematic Machine Behavior: A Systematic Literature Review of Algorithm Audits [J].
Bandy J. .
Proceedings of the ACM on Human-Computer Interaction, 2021, 5 (CSCW1)
[9]  
Banko M., 2020, P 4 WORKSH ONL AB HA, P125, DOI [DOI 10.18653/V1/2020.ALW-1.16, 10.18653/v1/2020.alw-1.16]
[10]   Hate Speech Epidemic. The Dynamic Effects of Derogatory Language on Intergroup Relations and Political Radicalization [J].
Bilewicz, Michal ;
Soral, Wiktor .
POLITICAL PSYCHOLOGY, 2020, 41 :3-33