How to Detect Online Hate towards Migrants and Refugees? Developing and Evaluating a Classifier of Racist and Xenophobic Hate Speech Using Shallow and Deep Learning

被引:11
作者
Arcila-Calderon, Carlos [1 ]
Amores, Javier J. [1 ]
Sanchez-Holgado, Patricia [1 ]
Vrysis, Lazaros [2 ]
Vryzas, Nikolaos [2 ]
Alonso, Martin Oller [3 ]
机构
[1] Univ Salamanca, Fac Ciencias Sociales, Campus Unamuno, Salamanca 37007, Spain
[2] Aristotle Univ Thessaloniki, Multidisciplinary Media & Mediated Commun Res Grp, Thessaloniki 54124, Greece
[3] Univ Milan, Dept Social & Polit Sci, I-20122 Milan, Italy
关键词
hate speech; racism; xenophobia; migration; social media; deep learning;
D O I
10.3390/su142013094
中图分类号
X [环境科学、安全科学];
学科分类号
08 ; 0830 ;
摘要
Hate speech spreading online is a matter of growing concern since social media allows for its rapid, uncontrolled, and massive dissemination. For this reason, several researchers are already working on the development of prototypes that allow for the detection of cyberhate automatically and on a large scale. However, most of them are developed to detect hate only in English, and very few focus specifically on racism and xenophobia, the category of discrimination in which the most hate crimes are recorded each year. In addition, ad hoc datasets manually generated by several trained coders are rarely used in the development of these prototypes since almost all researchers use already available datasets. The objective of this research is to overcome the limitations of those previous works by developing and evaluating classification models capable of detecting racist and/or xenophobic hate speech being spread online, first in Spanish, and later in Greek and Italian. In the development of these prototypes, three differentiated machine learning strategies are tested. First, various traditional shallow learning algorithms are used. Second, deep learning is used, specifically, an ad hoc developed RNN model. Finally, a BERT-based model is developed in which transformers and neural networks are used. The results confirm that deep learning strategies perform better in detecting anti-immigration hate speech online. It is for this reason that the deep architectures were the ones finally improved and tested for hate speech detection in Greek and Italian and in multisource. The results of this study represent an advance in the scientific literature in this field of research, since up to now, no online anti-immigration hate detectors had been tested in these languages and using this type of deep architecture.
引用
收藏
页数:16
相关论文
共 52 条
  • [1] Progressive domain adaptation for detecting hate speech on social media with small training set and its application to COVID-19 concerned posts
    Abul Bashar, Md
    Nayak, Richi
    Luong, Khanh
    Balasubramaniam, Thirunavukarasu
    [J]. SOCIAL NETWORK ANALYSIS AND MINING, 2021, 11 (01)
  • [2] Detection of hate speech in Arabic tweets using deep learning
    Al-Hassan, Areej
    Al-Dossari, Hmood
    [J]. MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1963 - 1974
  • [3] Automatic hate speech detection using killer natural language processing optimizing ensemble deep learning approach
    Al-Makhadmeh, Zafer
    Tolba, Amr
    [J]. COMPUTING, 2020, 102 (02) : 501 - 522
  • [4] Arabic Offensive and Hate Speech Detection Using a Cross-Corpora Multi-Task Learning Model
    Aldjanabi, Wassen
    Dahou, Abdelghani
    Al-qaness, Mohammed A. A.
    Abd Elaziz, Mohamed
    Helmi, Ahmed Mohamed
    Damasevicius, Robertas
    [J]. INFORMATICS-BASEL, 2021, 8 (04):
  • [5] Detecting ideological hatred on Twitter. Development and evaluation of a political ideology hate speech detector in tweets in Spanish
    Amores, Javier J.
    Blanco-Herrero, David
    Sanchez-Holgado, Patricia
    Frias-Vazquez, Maximiliano
    [J]. CUADERNOS INFO, 2021, (49) : 98 - 124
  • [6] Evolution of negative visual frames of immigrants and refugees in the main media of Southern Europe
    Amores, Javier J.
    Arcila-Calderon, Carlos
    Blanco-Herrero, David
    [J]. PROFESIONAL DE LA INFORMACION, 2020, 29 (06): : 1 - 21
  • [7] VISUAL FRAMES OF MIGRANTS AND REFUGEES IN THE MAIN WESTERN EUROPEAN MEDIA
    Amores, Javier J.
    Arcila Calderon, Carlos
    Stanek, Mikolaj
    [J]. ECONOMICS & SOCIOLOGY, 2019, 12 (03) : 147 - 161
  • [8] [Anonymous], 2017, MOVIMIENTO INTOLERAN
  • [9] [Anonymous], 2020, MOVIMIENTO INTOLERAN
  • [10] Anti-Defamation League Online Hate and Harassment, 2021, AM EXPERIENCE