Learning to Detect Online Harassment on Twitter with the Transformer

被引:11
作者
Bugueno, Margarita [1 ]
Mendoza, Marcelo [1 ]
机构
[1] Univ Tecn Federico Santa Maria, Dept Informat, Inst Milenio Fundamentos Los Datos, Santiago, Chile
来源
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT II | 2020年 / 1168卷
关键词
Harassment detection; Self-attention models; Social media;
D O I
10.1007/978-3-030-43887-6_23
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes our submission to the SIMAH challenge (SocIaL Media And Harassment). The proposed competition addresses the challenge of harassment detection on Twitter posts as well as the identification of a harassment category. Automatically detecting content containing harassment could be the basis for removing it. Accordingly, the task is considered to be an essential step to distinguishing different types of harassment provides the means to control such a mechanism in a fine-grained way. In this work, we classify a set of Twitter posts into non-harassment or harassment tweets where the last ones are classified as indirect harassment, sexual harassment, or physical harassment. We explore how to use self-attention models for harassment classification in order to combine different baselines' outputs. For a given post, we use the transformer architecture to encode each baseline output exploiting relationships between baselines and posts. Then, the transformer learns how to combine the outputs of these methods with a BERT representation of the post, reaching a macro-averaged F-score of 0.481 on the SIMAH test set.
引用
收藏
页码:298 / 306
页数:9
相关论文
共 20 条
[1]   Hate Speech Detection is Not as Easy as You May Think: A Closer Look at Model Validation [J].
Arango, Ayme ;
Perez, Jorge ;
Poblete, Barbara .
PROCEEDINGS OF THE 42ND INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '19), 2019, :45-53
[2]   Deep Learning for Hate Speech Detection in Tweets [J].
Badjatiya, Pinkesh ;
Gupta, Shashank ;
Gupta, Manish ;
Varma, Vasudeva .
WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, :759-760
[3]   Mean Birds: Detecting Aggression and Bullying on Twitter [J].
Chatzakou, Despoina ;
Kourtellis, Nicolas ;
Blackburn, Jeremy ;
De Cristofaro, Emiliano ;
Stringhini, Gianluca ;
Vakali, Athena .
PROCEEDINGS OF THE 2017 ACM WEB SCIENCE CONFERENCE (WEBSCI '17), 2017, :13-22
[4]  
Davidson T., 2017, ICWSM, P512
[5]  
Devlin J., 2018, Annual Conference of the North American Chapter of the ACL
[6]   A Survey on Automatic Detection of Hate Speech in Text [J].
Fortuna, Paula ;
Nunes, Sergio .
ACM COMPUTING SURVEYS, 2018, 51 (04)
[7]  
Gamback B., 2017, P 1 WORKSHOP ABUSIVE, P85, DOI DOI 10.18653/V1/W17-3013
[8]  
Hess A., 2014, Pacific Standard
[9]  
Jha A., 2017, P 2 WORKSH NLP COMP, P7
[10]  
Papegnies Etienne, 2017, Statistical Language and Speech Processing. 5th International Conference, SLSP. Proceedings: LNAI 10583, P70, DOI 10.1007/978-3-319-68456-7_6