Enhancing social network hate detection using back translation and GPT-3 augmentations during training and test-time

被引:14
作者
Cohen, Seffi [1 ]
Presil, Dan [1 ]
Katz, Or [1 ]
Arbili, Ofir [1 ]
Messica, Shvat [1 ]
Rokach, Lior [1 ]
机构
[1] Ben Gurion Univ Negev, Dept Software & Informat Syst Engn, IL-8410501 Beer Sheva, Israel
关键词
Hate-detection; TTA; Back-translation; GPT;
D O I
10.1016/j.inffus.2023.101887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media platforms have become an essential means of communication, but they also serve as a breeding ground for hateful content. Detecting hate speech accurately is challenging due to factors such as slang and implicit hate speech. In response to these challenges, this paper presents a novel ensemble approach utilizing DeBERTa models, integrating back-translation and GPT-3 augmentation techniques during both training and test time. This method aims to address the complexities associated with detecting hate speech, resulting in more robust and accurate results. Our findings indicate that the proposed approach significantly enhances hate speech detection performance across various metrics and models in both the Parler and GAB datasets. For reproducibility and further exploration, our code is publicly available at https://github.com/OrKatz7/parler-hate-speech.
引用
收藏
页数:9
相关论文
共 55 条
[1]   Tackling racial bias in automated online hate detection: Towards fair and accurate detection of hateful users with geometric deep learning [J].
Ahmed, Zo ;
Vidgen, Bertie ;
Hale, Scott A. .
EPJ DATA SCIENCE, 2022, 11 (01)
[2]  
Aliapoulios M, 2021, Arxiv, DOI arXiv:2101.03820
[3]  
Azam U, 2022, LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P4523
[4]  
Bajak A., 2021, USATODAY FEB
[5]  
Balkus SV, 2022, Arxiv, DOI [arXiv:2205.10981, 10.48550/arXiv.2205.10981]
[6]  
Barbieri F, 2020, Arxiv, DOI arXiv:2010.12421
[7]  
Beddiar D. R., 2021, Online Social Networks and Media, V24, DOI DOI 10.1016/J.OSNEM.2021.100153
[8]  
Cao R., 2020, P 28 INT C COMP LING, P6327
[9]  
Chin L., 2021, arXiv
[10]  
Cohen S., 2023, INFORM SCIENTIST