Multi-class hate speech detection in the Norwegian language using FAST-RNN and multilingual fine-tuned transformers

被引：12

作者：

Hashmi, Ehtesham ^{[1
]}

Yayilgan, Sule Yildirim ^{[1
]}

机构：

[1] Norwegian Univ Sci & Technol NTNU, Dept Informat Secur & Commun Technol IIK, Teknol Vegen 22, N-2815 Gjovik, Innlandet, Norway

来源：

COMPLEX & INTELLIGENT SYSTEMS | 2024年 / 10卷 / 03期

关键词：

Hate speech; Norwegian language; Natural language processing; Deep Learning; Transformers; Interpretability modeling;

D O I：

10.1007/s40747-024-01392-5

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The growth of social networks has provided a platform for individuals with prejudiced views, allowing them to spread hate speech and target others based on their gender, ethnicity, religion, or sexual orientation. While positive interactions within diverse communities can considerably enhance confidence, it is critical to recognize that negative comments can hurt people's reputations and well-being. This emergence emphasizes the need for more diligent monitoring and robust policies on these platforms to protect individuals from such discriminatory and harmful behavior. Hate speech is often characterized as an intentional act of aggression directed at a specific group, typically meant to harm or marginalize them based on certain aspects of their identity. Most of the research related to hate speech has been conducted in resource-aware languages like English, Spanish, and French. However, low-resource European languages, such as Irish, Norwegian, Portuguese, Polish, Slovak, and many South Asian, present challenges due to limited linguistic resources, making information extraction labor-intensive. In this study, we present deep neural networks with FastText word embeddings using regularization methods for multi-class hate speech detection in the Norwegian language, along with the implementation of multilingual transformer-based models with hyperparameter tuning and generative configuration. FastText outperformed other deep learning models when stacked with Bidirectional LSTM and GRU, resulting in the FAST-RNN model. In the concluding phase, we compare our results with the state-of-the-art and perform interpretability modeling using Local Interpretable Model-Agnostic Explanations to achieve a more comprehensive understanding of the model's decision-making mechanisms.

引用

页码：4535 / 4556

页数：22

共 75 条

[31] Griffin Rachel, 2023, CODES CONDUCT DIGITA
[32] Holtzman Ari, 2020, arXiv
[33] Huang B, 2018, ICLR POSTER
[34] A systematic review of hate speech automatic detection using natural language processing
Jahan, Md Saroar
Oussalah, Mourad
[J]. NEUROCOMPUTING, 2023, 546
[35] Deep Sentiment Analysis Using CNN-LSTM Architecture of English and Roman Urdu Text Shared in Social Media
Khan, Lal
Amjad, Ammar
Afaq, Kanwar Muhammad
Chang, Hsien-Tsung
[J]. APPLIED SCIENCES-BASEL, 2022, 12 (05):
[36] Khanday, 2022, INT J INFORM MANAG D, V2
[37] Kim JY, 2021, J ONLINE TRUST SAF, V1
[38] Against 'Hate Speech'
Kindermann, Dirk
[J]. JOURNAL OF APPLIED PHILOSOPHY, 2023, 40 (05) : 813 - 835
[39] Kumar S., 2023, ARXIV
[40] Kummervold PE, 2021, ARXIV

← 1 2 3 4 5 6 7 8 →