RoBERTaNET: Enhanced RoBERTa Transformer Based Model for Cyberbullying Detection With GloVe Features

被引:7
作者
Jamjoom, Arwa A. [1 ]
Karamti, Hanen [2 ]
Umer, Muhammad [3 ]
Alsubai, Shtwai [4 ]
Kim, Tai-Hoon [5 ]
Ashraf, Imran [6 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Informat Syst, Jeddah 21589, Saudi Arabia
[2] Princess Nourah Bint Abdulrahman Univ, Coll Comp & Informat Sci, Dept Comp Sci, POB 84428, Riyadh 11671, Saudi Arabia
[3] Islamia Univ Bahawalpur, Dept Comp Sci & Informat Technol, Bahawalpur 63100, Pakistan
[4] Prince Sattam Bin Abdulaziz Univ, Coll Comp Engn & Sci, Dept Comp Sci, Al Kharj 11942, Saudi Arabia
[5] Chonnam Natl Univ, Sch Elect & Comp Engn, Yeosu Campus, Yeosu Si 59626, Jeollanam Do, South Korea
[6] Yeungnam Univ, Dept Informat & Commun Engn, Gyongsan 38541, South Korea
关键词
Cyberbullying; RoBERTa; GloVe; FastText; transformer based learning; SOCIAL MEDIA;
D O I
10.1109/ACCESS.2024.3386637
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Online platforms are fostering social interaction, but unfortunately, this has given rise to antisocial behaviors such as cyberbullying, trolling, and hate speech on a global scale. The detection of hate and aggression has become a vital aspect of combating cyberbullying and cyberharassment. Cyberbullying involves using aggressive and offensive language including rude, insulting, hateful, and teasing comments to harm individuals on social media platforms. Human moderation is both slow and expensive, making it impractical in the face of rapidly growing data. Automatic detection systems are essential to curb trolling effectively. This research deals with the challenge of automatically identifying cyberbullying in tweets from a publicly available cyberbullying dataset. This research work employs robustly optimized bidirectional encoder representations from the transformers approach (RoBERTa), utilizing global vectors for word representation (GloVe) word embedding features. The proposed approach is further compared with the state-of-the-art machine, deep, and transformer-based learning approaches with the FastText word embedding approach. Statistical results demonstrate that the proposed model outperforms others, achieving a 95% accuracy for detecting cyberbullying tweets. In addition, the model obtains 95%, 97%, and 96% for precision, recall, and F1 score, respectively. Results from k-fold cross-validation further affirm the supremacy of the proposed model with a mean accuracy of 95.07%.
引用
收藏
页码:58950 / 58959
页数:10
相关论文
共 39 条
[1]   Cervical Cancer Diagnosis Using Random Forest Classifier With SMOTE and Feature Reduction Techniques [J].
Abdoh, Sherif F. ;
Rizka, Mohamed Abo ;
Maghraby, Fahima A. .
IEEE ACCESS, 2018, 6 :59475-59485
[2]   Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms [J].
Agrawal, Sweta ;
Awekar, Amit .
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 :141-153
[3]   Industry 4.0 technologies and their applications in fighting COVID-19 pandemic using deep learning techniques [J].
Ahmad, Muhammad ;
Sadiq, Saima ;
Eshmawi, Ala' Abdulmajid ;
Alluhaidan, Ala Saleh ;
Umer, Muhammad ;
Ullah, Saleem ;
Nappi, Michele .
COMPUTERS IN BIOLOGY AND MEDICINE, 2022, 145
[4]   Predicting Cyberbullying on Social Media in the Big Data Era Using Machine Learning Algorithms: Review of Literature and Open Challenges [J].
Al-Garadi, Mohammed Ali ;
Hussain, Mohammad Rashid ;
Khan, Nawsher ;
Murtaza, Ghulam ;
Nweke, Henry Friday ;
Ali, Ihsan ;
Mujtaba, Ghulam ;
Chiroma, Haruna ;
Khattak, Hasan Ali ;
Gani, Abdullah .
IEEE ACCESS, 2019, 7 :70701-70718
[5]  
Alhloul A., 2023, Bullying tweets detection using CNN -attention
[6]  
AndrewMVD, 2022, Cyberbullying Classification Dataset
[7]  
[Anonymous], 2020, What is cyberbullying
[8]  
[Anonymous], 2007, Summary of Our Cyberbullying Research (2007-2019)
[9]  
Bokolo B. G., 2023, P IEEE C COMP COMM W, P1
[10]  
C. R. Center, 2019, Cyberbullying Data