Multi-class hate speech detection in the Norwegian language using FAST-RNN and multilingual fine-tuned transformers

被引:12
作者
Hashmi, Ehtesham [1 ]
Yayilgan, Sule Yildirim [1 ]
机构
[1] Norwegian Univ Sci & Technol NTNU, Dept Informat Secur & Commun Technol IIK, Teknol Vegen 22, N-2815 Gjovik, Innlandet, Norway
关键词
Hate speech; Norwegian language; Natural language processing; Deep Learning; Transformers; Interpretability modeling;
D O I
10.1007/s40747-024-01392-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The growth of social networks has provided a platform for individuals with prejudiced views, allowing them to spread hate speech and target others based on their gender, ethnicity, religion, or sexual orientation. While positive interactions within diverse communities can considerably enhance confidence, it is critical to recognize that negative comments can hurt people's reputations and well-being. This emergence emphasizes the need for more diligent monitoring and robust policies on these platforms to protect individuals from such discriminatory and harmful behavior. Hate speech is often characterized as an intentional act of aggression directed at a specific group, typically meant to harm or marginalize them based on certain aspects of their identity. Most of the research related to hate speech has been conducted in resource-aware languages like English, Spanish, and French. However, low-resource European languages, such as Irish, Norwegian, Portuguese, Polish, Slovak, and many South Asian, present challenges due to limited linguistic resources, making information extraction labor-intensive. In this study, we present deep neural networks with FastText word embeddings using regularization methods for multi-class hate speech detection in the Norwegian language, along with the implementation of multilingual transformer-based models with hyperparameter tuning and generative configuration. FastText outperformed other deep learning models when stacked with Bidirectional LSTM and GRU, resulting in the FAST-RNN model. In the concluding phase, we compare our results with the state-of-the-art and perform interpretability modeling using Local Interpretable Model-Agnostic Explanations to achieve a more comprehensive understanding of the model's decision-making mechanisms.
引用
收藏
页码:4535 / 4556
页数:22
相关论文
共 75 条
  • [21] Elzayady H, 2023, Int J Electric Comput Eng, V13, P1979
  • [22] Stigmatization in social media: Documenting and analyzing hate speech for COVID-19 on Twitter
    Fan L.
    Yu H.
    Yin Z.
    [J]. Proceedings of the Association for Information Science and Technology, 2020, 57 (01)
  • [23] Fersini E., 2018, CEUR WORKSHOP P, P1
  • [24] Founta A., 2018, P INT AAAI C WEB SOC, V12
  • [25] A Unified Deep Learning Architecture for Abuse Detection
    Founta, Antigoni-Maria
    Chatzakou, Despoina
    Kourtellis, Nicolas
    Blackburn, Jeremy
    Vakali, Athena
    Leontiadis, Ilias
    [J]. PROCEEDINGS OF THE 11TH ACM CONFERENCE ON WEB SCIENCE (WEBSCI'19), 2019, : 105 - 114
  • [26] Gagliardone I ..., 2015, Countering Online Hate Speech
  • [27] Garibo i Orts Oscar, 2019, P 13 INT WORKSH SEM, P460
  • [28] HATE SPEECH DETECTION IN LOW-RESOURCE BODO AND ASSAMESE TEXTS WITH ML-DL AND BERT MODELS
    Ghosh, Koyel
    Senapati, Apurbalal
    Narzary, Mwnthai
    Brahma, Maharaj
    [J]. SCALABLE COMPUTING-PRACTICE AND EXPERIENCE, 2023, 24 (04): : 941 - 955
  • [29] Different systems, similar challenges: humor and free speech in the United States and Europe
    Godioli, Alberto
    Little, Laura E.
    [J]. HUMOR-INTERNATIONAL JOURNAL OF HUMOR RESEARCH, 2022, 35 (03): : 305 - 327
  • [30] Gomez Martin V., 2023, Crisis of the criminal law in the democratic constitutional state: manifestations and trends, P119