A Survey of Offensive Language Detection for the Arabic Language

被引:34
|
作者
Husain, Fatemah [1 ]
Uzuner, Ozlem [2 ]
机构
[1] Kuwait Univ, Sabah AlSalem Univ City Alshadadiya, Coll Life Sci, Informat Sci Dept, POB 5969, Safat 13060, Kuwait
[2] George Mason Univ, 4400 Univ Dr,5359 Nguyen Engn Bldg,MSN 1G8, Fairfax, VA 22030 USA
关键词
Offensive language; literature review; natural language processing; Arabic language; machine learning; deep learning; ONLINE COMMUNICATION;
D O I
10.1145/3421504
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The use of offensive language in user-generated content is a serious problem that needs to be addressed with the latest technology. The field of Natural Language Processing (NLP) can support the automatic detection of offensive language. In this survey, we review previous NLP studies that cover Arabic offensive language detection. This survey investigates the state-of-the-art in offensive language detection for the Arabic language, providing a structured overview of previous approaches, including core techniques, tools, resources, methods, and main features used. This work also discusses the limitations and gaps of the previous studies. Findings from this survey emphasize the importance of investing further effort in detecting Arabic offensive language, including the development of benchmark resources and the invention of novel preprocessing and feature extraction techniques.
引用
收藏
页数:44
相关论文
共 50 条
  • [41] Arabic Language
    Es'ad, Seydisehirli Mahmud
    Kizikli, Sadelestiren Zafer
    DINBILIMLERI AKADEMIK ARASTIRMA DERGISI-JOURNAL OF ACADEMIC RESEARCH IN RELIGIOUS SCIENCES, 2009, 9 (04): : 163 - 177
  • [42] Language resources for Maghrebi Arabic dialects’ NLP: a survey
    Jihene Younes
    Emna Souissi
    Hadhemi Achour
    Ahmed Ferchichi
    Language Resources and Evaluation, 2020, 54 : 1079 - 1142
  • [43] A Survey of OCR in Arabic Language: Applications, Techniques, and Challenges
    Faizullah, Safiullah
    Ayub, Muhammad Sohaib
    Hussain, Sajid
    Khan, Muhammad Asad
    APPLIED SCIENCES-BASEL, 2023, 13 (07):
  • [44] The Arabic language
    Ould Mohamed Baba, Ahmed Salem
    ANAQUEL DE ESTUDIOS ARABES, 2012, 23 : 147 - 148
  • [45] Language resources for Maghrebi Arabic dialects' NLP: a survey
    Younes, Jihene
    Souissi, Emna
    Achour, Hadhemi
    Ferchichi, Ahmed
    LANGUAGE RESOURCES AND EVALUATION, 2020, 54 (04) : 1079 - 1142
  • [46] Offensive Language Detection Using Soft Voting Ensemble Model
    Fieri B.
    Suhartono D.
    Mendel, 2023, 29 (01) : 1 - 6
  • [47] Offensive Language Detection Using Multi-level Classification
    Razavi, Amir H.
    Inkpen, Diana
    Uritsky, Sasha
    Matwin, Stan
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 16 - +
  • [48] Investigating cross-lingual training for offensive language detection
    Pelicon, Andraz
    Shekhar, Ravi
    Skrlj, Blaz
    Purver, Matthew
    Pollak, Senja
    PEERJ COMPUTER SCIENCE, 2021, 7 : 2 - 39
  • [49] Hate-Speech and Offensive Language Detection in Roman Urdu
    Rizwan, Hammad
    Shakeel, Muhammad Haroon
    Karim, Asim
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 2512 - 2522
  • [50] Towards Automatic Detection and Explanation of Hate Speech and Offensive Language
    Dorris, Wyatt
    Hu, Ruijia
    Vishwamitra, Nishant
    Luo, Feng
    Costello, Matthew
    PROCEEDINGS OF THE SIXTH INTERNATIONAL WORKSHOP ON SECURITY AND PRIVACY ANALYTICS (IWSPA'20), 2020, : 23 - 29