Detecting Harmful Content on Online Platforms: What Platforms Need vs. Where Research Efforts Go

被引:24
作者
Arora, Arnav [1 ]
Nakov, Preslav [1 ]
Hardalov, Momchil [2 ]
Sarwar, Sheikh Muhammad [1 ]
Nayak, Vibha [1 ]
Dinkov, Yoan [2 ]
Zlatkova, Dimitrina [2 ]
Dent, Kyle [1 ]
Bhatawdekar, Ameya [1 ]
Bouchard, Guillaume [1 ]
Augenstein, Isabelle [1 ]
机构
[1] Checkstep Res, London, England
[2] Checkstep Res, Sofia, Bulgaria
关键词
Online harms; content moderation; hate speech; offensive language; bullying and harassment; misinformation; spam; violence; graphic content; sexual abuse; self-harm; HATE SPEECH; REPRESENTATIONS; COVID-19; TWITTER; TROLLS; DARK;
D O I
10.1145/3603399
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The proliferation of harmful content on online platforms is a major societal problem, which comes in many different forms, including hate speech, offensive language, bullying and harassment, misinformation, spam, violence, graphic content, sexual abuse, self-harm, and many others. Online platforms seek to moderate such content to limit societal harm, to comply with legislation, and to create a more inclusive environment for their users. Researchers have developed different methods for automatically detecting harmful content, often focusing on specific sub-problems or on narrow communities, as what is considered harmful often depends on the platform and on the context. We argue that there is currently a dichotomy between what types of harmful content online platforms seek to curb, and what research efforts there are to automatically detect such content. We thus survey existing methods as well as content moderation policies by online platforms in this light and suggest directions for future work.
引用
收藏
页数:17
相关论文
共 110 条
[1]   Malicious accounts: Dark of the social networks [J].
Adewole, Kayode Sakariyah ;
Anuar, Nor Badrul ;
Kamsin, Amirrudin ;
Varathan, Kasturi Dewi ;
Razak, Syed Abdul .
JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2017, 79 :41-67
[2]  
Alam F, 2021, FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, P611
[3]   If it looks like a spammer and behaves like a spammer, it must be a spammer: analysis and detection of microblogging spam accounts [J].
Almaatouq, Abdullah ;
Shmueli, Erez ;
Nouh, Mariam ;
Alabdulkareem, Ahmad ;
Singh, Vivek K. ;
Alsaleh, Mansour ;
Alarifi, Abdulrahman ;
Alfaris, Anas ;
Pentland, Alex 'Sandy' .
INTERNATIONAL JOURNAL OF INFORMATION SECURITY, 2016, 15 (05) :475-491
[4]  
[Anonymous], 2015, P 19 C COMP NAT LANG
[5]  
[Anonymous], 2021, BBC News
[6]  
[Anonymous], 2022, FINDINGS ASS COMPUTA, P1572
[7]  
[Anonymous], 2015, P INT C REC ADV NLP
[8]  
Atanasov A., 2019, P 23 C COMPUTATIONAL, P1023, DOI [10.18653/v1/K19-1096, DOI 10.18653/V1/K19-1096]
[9]  
Barker K., 2019, Online Harms White Paper Consultation Response
[10]  
Basile V., 2019, P 13 INT WORKSH SEM, P54