Detection of Offensive Language and ITS Severity for Low Resource Language

被引:7
作者
Saeed, Ramsha [1 ]
Afzal, Hammad [1 ]
Rauf, Sadaf Abdul [2 ]
Iltaf, Naima [1 ]
机构
[1] NUST, H-12, Islamabad 46000, Pakistan
[2] FJWU, Dept Comp Sci, Kachari Chowk 46000, Rawalpindi, Pakistan
关键词
Hate speech; long short-term memory; Urdu NLP; convolutional neural network; BERT; HATE-SPEECH; IDENTIFICATION; TWITTER;
D O I
10.1145/3580476
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Continuous proliferation of hate speech in different languages on social media has drawn significant attention from researchers in the past decade. Detecting hate speech is indispensable irrespective of the scale of use of language, as it inflicts huge harm on society. This work presents a first resource for classifying the severity of hate speech in addition to classifying offensive and hate speech content. Current research mostly limits hate speech classification to its primary categories, such as racism, sexism, and hatred of religions. However, hate speech targeted at different protected characteristics also manifests in different forms and intensities. It is important to understand varying severity levels of hate speech so that the most harmful cases of hate speech may be identified and dealt with earlier than the less harmful ones. In this work, we focus on detecting offensive speech, hate speech, and multiple levels of hate speech in the Urdu language. We investigate three primary target categories of hate speech: religion, racism, and national origin. We further divide these categories into levels based on the severity of hate conveyed. The severity levels are referred to as symbolization, insult, and attribution. A corpus comprising more than 20,000 tweets against the corresponding hate speech categories and severity levels is collected and annotated. A comprehensive experimentation scheme is applied using traditional as well as deep learning-based models to examine their impact on hate speech detection. The highest macro-averaged F-score yielded for detecting offensive speech is 86% while the highest F-scores for detecting hate speech with respect to ethnicity, national origin, and religious affiliation are 80%, 81%, and 72%, respectively. This shows that results are very encouraging and would provide a lead towards further investigation in this domain.
引用
收藏
页数:27
相关论文
共 58 条
  • [1] Agarwal S, 2016, EUR INTELL SECUR INF, P124, DOI [10.1109/EISIC.2016.14, 10.1109/EISIC.2016.032]
  • [2] Automatic Detection of Offensive Language for Urdu and Roman Urdu
    Akhter, Muhammad Pervez
    Zheng Jiangbin
    Naqvi, Irfan Raza
    Abdelmajeed, Mohammed
    Sadiq, Muhammad Tariq
    [J]. IEEE ACCESS, 2020, 8 (08): : 91213 - 91226
  • [3] Akram Qurat-ul-Ain, 2009, P 7 WORKSH AS LANG R, P40, DOI DOI 10.3115/1690299.1690305
  • [4] Albadi N, 2018, 2018 IEEE/ACM INTERNATIONAL CONFERENCE ON ADVANCES IN SOCIAL NETWORKS ANALYSIS AND MINING (ASONAM), P69, DOI 10.1109/ASONAM.2018.8508247
  • [5] Alfina I, 2017, INT C ADV COMP SCI I, P233, DOI 10.1109/ICACSIS.2017.8355039
  • [6] [Anonymous], 2020, Twitter
  • [7] [Anonymous], 2020, YOUTUBE
  • [8] Automatic Identification and Classification of Misogynistic Language on Twitter
    Anzovino, Maria
    Fersini, Elisabetta
    Rosso, Paolo
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 : 57 - 64
  • [9] Deep Learning for Hate Speech Detection in Tweets
    Badjatiya, Pinkesh
    Gupta, Shashank
    Gupta, Manish
    Varma, Vasudeva
    [J]. WWW'17 COMPANION: PROCEEDINGS OF THE 26TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2017, : 759 - 760
  • [10] Bagdon Christopher, 2021, CLEF WORKING NOTES, P1822