A Survey of Offensive Language Detection for the Arabic Language

被引:34
|
作者
Husain, Fatemah [1 ]
Uzuner, Ozlem [2 ]
机构
[1] Kuwait Univ, Sabah AlSalem Univ City Alshadadiya, Coll Life Sci, Informat Sci Dept, POB 5969, Safat 13060, Kuwait
[2] George Mason Univ, 4400 Univ Dr,5359 Nguyen Engn Bldg,MSN 1G8, Fairfax, VA 22030 USA
关键词
Offensive language; literature review; natural language processing; Arabic language; machine learning; deep learning; ONLINE COMMUNICATION;
D O I
10.1145/3421504
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.
引用
收藏
页数:44
相关论文
共 50 条
  • [31] Offensive Language Detection for Low Resource Language Using Deep Sequence Model
    Khan, Anas Ali
    Iqbal, M. Hammad
    Nisar, Shibli
    Ahmad, Awais
    Iqbal, Waseem
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5210 - 5218
  • [32] Offensive language detection in low resource languages: A use case of Persian language
    Mozafari, Marzieh
    Mnassri, Khouloud
    Farahbakhsh, Reza
    Crespi, Noel
    PLOS ONE, 2024, 19 (06):
  • [33] Sentiment Classification Techniques For Arabic Language: A Survey
    Biltawi, Mariam
    Etaiwi, Wael
    Tedmori, Sara
    Hudaib, Amjad
    Awajan, Arafat
    2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2016, : 339 - 346
  • [34] Automatic Detection of Cyberbullying and Abusive Language in Arabic Content on Social Networks: A Survey
    Khairy, Marwa
    Mahmoud, Tarek M.
    Abd-El-Hafeez, Tarek
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 156 - 166
  • [35] Automatic Detection of Offensive Language for Urdu and Roman Urdu
    Akhter, Muhammad Pervez
    Zheng Jiangbin
    Naqvi, Irfan Raza
    Abdelmajeed, Mohammed
    Sadiq, Muhammad Tariq
    IEEE ACCESS, 2020, 8 (08): : 91213 - 91226
  • [36] A New Corpus and Lexicon for Offensive Tamazight Language Detection
    Abainia, Kheireddine
    Kara, Kenza
    Hamouni, Tassadit
    PROCEEDINGS OF THE 7TH INTERNATIONAL WORKSHOP ON SOCIAL MEDIA WORLD SENSORS, SIDEWAYS 2022, 2022,
  • [37] SOD: A Corpus for Saudi Offensive Language Detection Classification
    Asiri, Afefa
    Saleh, Mostafa
    COMPUTERS, 2024, 13 (08)
  • [38] BERT-based Approach to Arabic Hate Speech and Offensive Language Detection in Twitter: Exploiting Emojis and Sentiment Analysis
    Althobaiti, Maha Jarallah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 972 - 980
  • [39] The Arabic language
    Wright, O
    BULLETIN OF THE SCHOOL OF ORIENTAL AND AFRICAN STUDIES-UNIVERSITY OF LONDON, 2002, 65 : 491 - 492
  • [40] The Arabic language
    Kaye, AS
    JOURNAL OF THE AMERICAN ORIENTAL SOCIETY, 2000, 120 (01) : 120 - 122