A Survey of Offensive Language Detection for the Arabic Language

被引：34

作者：

Husain, Fatemah ^{[1
]}

Uzuner, Ozlem ^{[2
]}

机构：

[1] Kuwait Univ, Sabah AlSalem Univ City Alshadadiya, Coll Life Sci, Informat Sci Dept, POB 5969, Safat 13060, Kuwait

[2] George Mason Univ, 4400 Univ Dr,5359 Nguyen Engn Bldg,MSN 1G8, Fairfax, VA 22030 USA

来源：

ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING | 2021年 / 20卷 / 01期

关键词：

Offensive language; literature review; natural language processing; Arabic language; machine learning; deep learning; ONLINE COMMUNICATION;

D O I：

10.1145/3421504

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.

引用

页数：44

共 50 条

[21] Offensive Language
Olson, Bennett
DOWN BEAT, 2010, 77 (04): : 10 - 10
[22] Offensive Language Detection in Nepali Social Media
Niraula, Nobal B.
Dulal, Saurab
Koirala, Diwa
WOAH 2021: THE 5TH WORKSHOP ON ONLINE ABUSE AND HARMS, 2021, : 67 - 75
[23] On the effects of machine translation on offensive language detection
Dmonte, Alphaeus
Satapara, Shrey
Alsudais, Rehab
Ranasinghe, Tharindu
Zampieri, Marcos
SOCIAL NETWORK ANALYSIS AND MINING, 2025, 14 (01)
[24] Offensive Language Detection Using Brown Clustering
Tian, Zuoyu
Kubler, Sandra
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 5079 - 5087
[25] Domain Adaptation for Chinese Offensive Language Detection
Ying, Hao
Ou, Qiongrong
Fan, Chengjun
Mei, Lin
Zhang, Shuyu
Xu, Xu
NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, PT IV, NLPCC 2024, 2025, 15362 : 146 - 158
[26] Offensive Language and Hate Speech Detection for Danish
Sigurbergsson, Gudbjartur Ingi
Derczynski, Leon
PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 3498 - 3508
[27] Politeness strategies in translating Donald Trump's offensive language into Arabic
Abudayeh, Haneen
Dubbati, Barkuzar
PERSPECTIVES-STUDIES IN TRANSLATION THEORY AND PRACTICE, 2020, 28 (03): : 424 - 439
[28] An Automatic Approach for the Identification of Offensive Language in Perso-Arabic Urdu Language: Dataset Creation and Evaluation
Din, Salah Ud
Khusro, Shah
Khan, Farman Ali
Ahmad, Munir
Ali, Oualid
Ghazal, Taher M.
IEEE ACCESS, 2025, 13 : 19755 - 19769
[29] A comprehensive review on Arabic offensive language and hate speech detection on social media: methods, challenges and solutions
Abdelsamie, Mahmoud Mohamed
Azab, Shahira Shaaban
Hefny, Hesham A.
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)
[30] Advancing offensive language detection in Arabic social media: a BERT-based ensemble learning approach
Mazari, Ahmed Cherif
Benterkia, Asmaa
Takdenti, Zineb
SOCIAL NETWORK ANALYSIS AND MINING, 2024, 14 (01)

← 1 2 3 4 5 →