Cyberbullying Detection Using PCA Extracted GLOVE Features and RoBERTaNet Transformer Learning Model

被引：0

作者：

Umer, Muhammad ^{[1
]}

Alabdulqader, Ebtisam Abdullah ^{[2
]}

Alarfaj, Aisha Ahmed ^{[3
]}

Cascone, Lucia ^{[4
]}

Nappi, Michele ^{[4
]}

机构：

[1] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan

[2] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Technol, Riyadh 11421, Saudi Arabia

[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11671, Saudi Arabia

[4] Univ Salerno, Dept Comp Sci, I-84084 Fisciano, Italy

来源：

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS | 2024年

关键词：

Cyberbullying; Principal component analysis; Accuracy; Feature extraction; Transformers; Radio frequency; Support vector machines; Cyberbullying detection; global vectors for word representation (GLOVE); natural language processing (NLP) for social media analysis; principle component analysis (PCA); robustly optimized bidirectional encoder representations from transformers approach (RoBERTa); transformer-based learning; CLASSIFICATION;

D O I：

10.1109/TCSS.2024.3422185

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Online platforms are nurturing social interactions, yet regrettably, they have also led to the proliferation of antisocial behaviors such as cyberbullying, trolling, and hate speech on a global scale. The identification of hate speech and aggression has become indispensable in the fight against cyberbullying and online harassment. Cyberbullying encompasses the use of aggressive and offensive language, including rude, insulting, hateful, and teasing comments, to inflict harm on individuals through social media platforms. Human moderation is both sluggish and costly, rendering it impractical in light of the exponential growth of data. Consequently, automated detection systems are imperative to effectively combat trolling. This study addresses the challenge of automatically discerning cyberbullying in tweets sourced from a publicly available cyberbullying dataset. The proposed methodology leverages the robustly optimized bidirectional encoder representations from transformers approach (RoBERTa), integrating principle component analysis (PCA) extracted global vectors for word representation (GLOVE) word embedding features. Furthermore, our proposed approach is benchmarked against state-of-the-art machine learning, deep learning, and transformer-based methods, utilizing the GLOVE word embedding technique. Statistical analyses reveal that our proposed model outperforms its counterparts, achieving a 0.98 accuracy and recall rate with 0.97 of precision and F1 score in detecting cyberbullying tweets. Results from k-fold cross validation further corroborate the superior performance of our proposed model.

引用

页数：10

共 50 条

[41] Arabic Cyberbullying Detection: Enhancing Performance by Using Ensemble Machine Learning
Haidar, Batoul
Chamoun, Maroun
Serhrouchni, Ahmed
2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 323 - 327
[42] Automatic detection of cyberbullying and threatening in Saudi tweets using machine learning
Alghamdi, Deema
Al-Motery, Rahaf
Alma'abdi, Reem
Alzamzami, Ohoud
Babour, Amal
INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2021, 8 (10): : 17 - 25
[43] Early Detection of Diabetic Retinopathy Using PCA-Firefly Based Deep Learning Model
Gadekallu, Thippa Reddy
Khare, Neelu
Bhattacharya, Sweta
Singh, Saurabh
Maddikunta, Praveen Kumar Reddy
Ra, In-Ho
Alazab, Mamoun
ELECTRONICS, 2020, 9 (02)
[44] ProTect: a hybrid deep learning model for proactive detection of cyberbullying on social media
Harshitha, T. Nitya
Prabu, M.
Suganya, E.
Sountharrajan, S.
Bavirisetti, Durga Prasad
Gadde, Navya
Uppu, Lakshmi Sahithi
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[45] An Ensemble Deep Learning Model for Oral Squamous Cell Carcinoma Detection Using Histopathological Image Analysis
Das, Madhusmita
Dash, Rasmita
Kumar Mishra, Sambit
Kumar Dalai, Asish
IEEE ACCESS, 2024, 12 : 127185 - 127197
[46] A Deep Features Extraction Model Based on the Transfer Learning Model and Vision Transformer "TLMViT" for Plant Disease Classification
Tabbakh, Amer
Barpanda, Soubhagya Sankar
IEEE ACCESS, 2023, 11 : 45377 - 45392
[47] An Automatic Defect Detection System for Synthetic Shuttlecocks Using Transformer Model
Lin, Ching-Sheng
Hsieh, Han-Yi
IEEE ACCESS, 2022, 10 : 37412 - 37421
[48] A Novel Outlier Detection Model for Vibration Signals Using Transformer Networks
Zhang, Ruiheng
Zhou, Quan
Tian, Lulu
Bai, Libing
Zhang, Jie
IEEE ACCESS, 2022, 10 : 57234 - 57241
[49] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
Muneer, Amgad
Alwadain, Ayed
Ragab, Mohammed Gamal
Alqushaibi, Alawi
INFORMATION, 2023, 14 (08)
[50] A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders
Tufail, Shahid
Iqbal, Hasan
Tariq, Mohd
Sarwat, Arif I.
IEEE ACCESS, 2025, 13 : 33783 - 33798

← 1 2 3 4 5 →