Cyberbullying Detection Using PCA Extracted GLOVE Features and RoBERTaNet Transformer Learning Model

被引:0
作者
Umer, Muhammad [1 ]
Alabdulqader, Ebtisam Abdullah [2 ]
Alarfaj, Aisha Ahmed [3 ]
Cascone, Lucia [4 ]
Nappi, Michele [4 ]
机构
[1] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
[2] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Technol, Riyadh 11421, Saudi Arabia
[3] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11671, Saudi Arabia
[4] Univ Salerno, Dept Comp Sci, I-84084 Fisciano, Italy
来源
IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS | 2024年
关键词
Cyberbullying; Principal component analysis; Accuracy; Feature extraction; Transformers; Radio frequency; Support vector machines; Cyberbullying detection; global vectors for word representation (GLOVE); natural language processing (NLP) for social media analysis; principle component analysis (PCA); robustly optimized bidirectional encoder representations from transformers approach (RoBERTa); transformer-based learning; CLASSIFICATION;
D O I
10.1109/TCSS.2024.3422185
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Online platforms are nurturing social interactions, yet regrettably, they have also led to the proliferation of antisocial behaviors such as cyberbullying, trolling, and hate speech on a global scale. The identification of hate speech and aggression has become indispensable in the fight against cyberbullying and online harassment. Cyberbullying encompasses the use of aggressive and offensive language, including rude, insulting, hateful, and teasing comments, to inflict harm on individuals through social media platforms. Human moderation is both sluggish and costly, rendering it impractical in light of the exponential growth of data. Consequently, automated detection systems are imperative to effectively combat trolling. This study addresses the challenge of automatically discerning cyberbullying in tweets sourced from a publicly available cyberbullying dataset. The proposed methodology leverages the robustly optimized bidirectional encoder representations from transformers approach (RoBERTa), integrating principle component analysis (PCA) extracted global vectors for word representation (GLOVE) word embedding features. Furthermore, our proposed approach is benchmarked against state-of-the-art machine learning, deep learning, and transformer-based methods, utilizing the GLOVE word embedding technique. Statistical analyses reveal that our proposed model outperforms its counterparts, achieving a 0.98 accuracy and recall rate with 0.97 of precision and F1 score in detecting cyberbullying tweets. Results from k-fold cross validation further corroborate the superior performance of our proposed model.
引用
收藏
页数:10
相关论文
共 50 条
  • [41] Arabic Cyberbullying Detection: Enhancing Performance by Using Ensemble Machine Learning
    Haidar, Batoul
    Chamoun, Maroun
    Serhrouchni, Ahmed
    2019 INTERNATIONAL CONFERENCE ON INTERNET OF THINGS (ITHINGS) AND IEEE GREEN COMPUTING AND COMMUNICATIONS (GREENCOM) AND IEEE CYBER, PHYSICAL AND SOCIAL COMPUTING (CPSCOM) AND IEEE SMART DATA (SMARTDATA), 2019, : 323 - 327
  • [42] Automatic detection of cyberbullying and threatening in Saudi tweets using machine learning
    Alghamdi, Deema
    Al-Motery, Rahaf
    Alma'abdi, Reem
    Alzamzami, Ohoud
    Babour, Amal
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2021, 8 (10): : 17 - 25
  • [43] Early Detection of Diabetic Retinopathy Using PCA-Firefly Based Deep Learning Model
    Gadekallu, Thippa Reddy
    Khare, Neelu
    Bhattacharya, Sweta
    Singh, Saurabh
    Maddikunta, Praveen Kumar Reddy
    Ra, In-Ho
    Alazab, Mamoun
    ELECTRONICS, 2020, 9 (02)
  • [44] ProTect: a hybrid deep learning model for proactive detection of cyberbullying on social media
    Harshitha, T. Nitya
    Prabu, M.
    Suganya, E.
    Sountharrajan, S.
    Bavirisetti, Durga Prasad
    Gadde, Navya
    Uppu, Lakshmi Sahithi
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [45] An Ensemble Deep Learning Model for Oral Squamous Cell Carcinoma Detection Using Histopathological Image Analysis
    Das, Madhusmita
    Dash, Rasmita
    Kumar Mishra, Sambit
    Kumar Dalai, Asish
    IEEE ACCESS, 2024, 12 : 127185 - 127197
  • [46] A Deep Features Extraction Model Based on the Transfer Learning Model and Vision Transformer "TLMViT" for Plant Disease Classification
    Tabbakh, Amer
    Barpanda, Soubhagya Sankar
    IEEE ACCESS, 2023, 11 : 45377 - 45392
  • [47] An Automatic Defect Detection System for Synthetic Shuttlecocks Using Transformer Model
    Lin, Ching-Sheng
    Hsieh, Han-Yi
    IEEE ACCESS, 2022, 10 : 37412 - 37421
  • [48] A Novel Outlier Detection Model for Vibration Signals Using Transformer Networks
    Zhang, Ruiheng
    Zhou, Quan
    Tian, Lulu
    Bai, Libing
    Zhang, Jie
    IEEE ACCESS, 2022, 10 : 57234 - 57241
  • [49] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
    Muneer, Amgad
    Alwadain, Ayed
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    INFORMATION, 2023, 14 (08)
  • [50] A Hybrid Machine Learning-Based Framework for Data Injection Attack Detection in Smart Grids Using PCA and Stacked Autoencoders
    Tufail, Shahid
    Iqbal, Hasan
    Tariq, Mohd
    Sarwat, Arif I.
    IEEE ACCESS, 2025, 13 : 33783 - 33798