Arabic Cyberbullying Detection: A Comprehensive Review of Datasets and Methodologies

被引：0

作者：

Aljalaoud, Huda ^{[1
,2
]}

Dashtipour, Kia ^{[1
]}

Al-Dubai, Ahmed Y. ^{[1
]}

机构：

[1] Edinburgh Napier Univ, Sch Comp, Merchiston Campus, Edinburgh EH10 5DT, Scotland

[2] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia

来源：

IEEE ACCESS | 2025年 / 13卷

关键词：

Cyberbullying; Internet; Surveys; Systematic literature review; Natural language processing; Databases; Linguistics; Focusing; Deep learning; Annotations; Arabic cyberbullying detection; Arabic cyberbullying dataset; deep learning; machine learning; transformers-based; TWEETS; BEHAVIOR; LANGUAGE;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The freedom of speech in online spaces has substantially promoted engagement on social media platforms, where cyberbullying has emerged as a significant consequence. While extensive research has been conducted on cyberbullying detection in English, efforts in the Arabic language remain limited. To address this gap, the current study provides a comprehensive, state-of-the-art review of datasets and methodologies specifically focused on Arabic cyberbullying detection. It systematically reviews different relevant studies from six academic databases, examining their methodologies, dataset characteristics, and performance in terms of classification accuracy and limitations. The paper critically evaluates existing Arabic cyberbullying datasets according to criteria such as dataset size, dialectal diversity, annotation processes, and accessibility. Additionally, this review identifies critical limitations, including dataset scarcity, dialectal imbalance, annotation subjectivity, and methodological constraints. By synthesizing current knowledge, identifying research gaps, and suggesting future directions, this review supports the development of more robust, effective, and linguistically inclusive analytical methods. Ultimately, this work contributes significantly to natural language processing research and advances the creation of safer online environments for Arabic-speaking users.

引用

页码：69021 / 69038

页数：18

共 79 条

[1]

Abdelali A., 2020, P 6 AR NAT LANG PROC, DOI [https://doi.org/10.48550/arXiv.2004.02192, DOI 10.48550/ARXIV.2004.02192, 10.48550/arXiv.2004.02192]

[2] A Study of Arabic Social Media UsersPosting Behavior and Author's Gender Prediction [J].

Al-Ghadir, Abdulrahman I. ;

Azmi, Aqil M. .

COGNITIVE COMPUTATION, 2019, 11 (01) :71-86

[3] Dataset Construction for the Detection of Anti-Social Behaviour in Online Communication in Arabic [J].

Alakrot, Azalden ;

Murray, Liam ;

Nikolov, Nikola S. .

ARABIC COMPUTATIONAL LINGUISTICS, 2018, 142 :174-181

[4]

Alam Kazi Saeed, 2021, Proceedings of the Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV 2020), P710, DOI 10.1109/ICICV50876.2021.9388499

[5]

ALBayari Reem, 2021, Proceedings of the International Conference on Artificial Intelligence and Computer Vision (AICV2021). Advances in Intelligent Systems and Computing (AISC 1377), P375, DOI 10.1007/978-3-030-76346-6_35

[6] Cyberbullying Detection Model for Arabic Text Using Deep Learning [J].

Albayari, Reem ;

Abdallah, Sherief ;

Shaalan, Khaled .

JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2025, 24 (03)

[7] Instagram-Based Benchmark Dataset for Cyberbullying Detection in Arabic Text [J].

ALBayari, Reem ;

Abdallah, Sherief .

DATA, 2022, 7 (07)

[8] Detecting Arabic Cyberbullying Tweets Using Machine Learning [J].

Alduailaj, Alanoud Mohammed ;

Belghith, Aymen .

MACHINE LEARNING AND KNOWLEDGE EXTRACTION, 2023, 5 (01) :29-42

[9]

Alduailej AH, 2017, 2017 INTERNATIONAL CONFERENCE ON COMPUTER AND APPLICATIONS (ICCA), P389, DOI 10.1109/COMAPP.2017.8079791

[10]

AlFarah M. E., 2023, P INT S NETW COMP CO, P1, DOI [10.1109/isncc58260.2023.10323808, DOI 10.1109/ISNCC58260.2023.10323808]

← 1 2 3 4 5 6 7 8 →