A Survey of Offensive Language Detection for the Arabic Language

被引：34

作者：

Husain, Fatemah ^{[1
]}

Uzuner, Ozlem ^{[2
]}

机构：

[1] Kuwait Univ, Sabah AlSalem Univ City Alshadadiya, Coll Life Sci, Informat Sci Dept, POB 5969, Safat 13060, Kuwait

[2] George Mason Univ, 4400 Univ Dr,5359 Nguyen Engn Bldg,MSN 1G8, Fairfax, VA 22030 USA

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2021年 / 20卷 / 01期

关键词：

Offensive language; literature review; natural language processing; Arabic language; machine learning; deep learning; ONLINE COMMUNICATION;

D O I：

10.1145/3421504

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.

引用

页数：44

共 50 条

[31] Offensive Language Detection for Low Resource Language Using Deep Sequence Model
Khan, Anas Ali
Iqbal, M. Hammad
Nisar, Shibli
Ahmad, Awais
Iqbal, Waseem
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (04) : 5210 - 5218
[32] Offensive language detection in low resource languages: A use case of Persian language
Mozafari, Marzieh
Mnassri, Khouloud
Farahbakhsh, Reza
Crespi, Noel
PLOS ONE, 2024, 19 (06):
[33] Sentiment Classification Techniques For Arabic Language: A Survey
Biltawi, Mariam
Etaiwi, Wael
Tedmori, Sara
Hudaib, Amjad
Awajan, Arafat
2016 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION SYSTEMS (ICICS), 2016, : 339 - 346
[34] Automatic Detection of Cyberbullying and Abusive Language in Arabic Content on Social Networks: A Survey
Khairy, Marwa
Mahmoud, Tarek M.
Abd-El-Hafeez, Tarek
AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 156 - 166
[35] Automatic Detection of Offensive Language for Urdu and Roman Urdu
Akhter, Muhammad Pervez
Zheng Jiangbin
Naqvi, Irfan Raza
Abdelmajeed, Mohammed
Sadiq, Muhammad Tariq
IEEE ACCESS, 2020, 8 (08): : 91213 - 91226
[36] A New Corpus and Lexicon for Offensive Tamazight Language Detection
Abainia, Kheireddine
Kara, Kenza
Hamouni, Tassadit
PROCEEDINGS OF THE 7TH INTERNATIONAL WORKSHOP ON SOCIAL MEDIA WORLD SENSORS, SIDEWAYS 2022, 2022,
[37] SOD: A Corpus for Saudi Offensive Language Detection Classification
Asiri, Afefa
Saleh, Mostafa
COMPUTERS, 2024, 13 (08)
[38] BERT-based Approach to Arabic Hate Speech and Offensive Language Detection in Twitter: Exploiting Emojis and Sentiment Analysis
Althobaiti, Maha Jarallah
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2022, 13 (05) : 972 - 980
[39] The Arabic language
Wright, O
BULLETIN OF THE SCHOOL OF ORIENTAL AND AFRICAN STUDIES-UNIVERSITY OF LONDON, 2002, 65 : 491 - 492
[40] The Arabic language
Kaye, AS
JOURNAL OF THE AMERICAN ORIENTAL SOCIETY, 2000, 120 (01) : 120 - 122

← 1 2 3 4 5 →