ProTect: a hybrid deep learning model for proactive detection of cyberbullying on social media

被引:4
作者
Harshitha, T. Nitya [1 ]
Prabu, M. [1 ]
Suganya, E. [2 ]
Sountharrajan, S. [1 ]
Bavirisetti, Durga Prasad [3 ]
Gadde, Navya [1 ]
Uppu, Lakshmi Sahithi [1 ]
机构
[1] Amrita Vishwa Vidyapeetham, Amrita Sch Comp, Dept Comp Sci & Engn, Chennai, India
[2] Sri Sivasubramaniya Nadar Coll Engn, Dept Informat Technol, Chennai, India
[3] Norwegian Univ Sci & Technol NTNU, Dept Comp Sci, Trondheim, Norway
来源
FRONTIERS IN ARTIFICIAL INTELLIGENCE | 2024年 / 7卷
关键词
cyber bullying; deep learning; social media; text analysis; neural network; machine learning; data mining;
D O I
10.3389/frai.2024.1269366
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The emergence of social media has given rise to a variety of networking and communication opportunities, as well as the well-known issue of cyberbullying, which is continuously on the rise in the current world. Researchers have been actively addressing cyberbullying for a long time by applying machine learning and deep learning techniques. However, although these algorithms have performed well on artificial datasets, they do not provide similar results when applied to real-time datasets with high levels of noise and imbalance. Consequently, finding generic algorithms that can work on dynamic data available across several platforms is critical. This study used a unique hybrid random forest-based CNN model for text classification, combining the strengths of both approaches. Real-time datasets from Twitter and Instagram were collected and annotated to demonstrate the effectiveness of the proposed technique. The performance of various ML and DL algorithms was compared, and the RF-based CNN model outperformed them in accuracy and execution speed. This is particularly important for timely detection of bullying episodes and providing assistance to victims. The model achieved an accuracy of 96% and delivered results 3.4 seconds faster than standard CNN models.
引用
收藏
页数:11
相关论文
共 48 条
[1]  
Ahmed M. F., 2021, arXiv preprint arXiv:2106.04506
[2]  
Ahmed Md T., 2021, 2021 INT C ADV EL CO, DOI [10.1109/icaect49130.2021.9392608, DOI 10.1109/ICAECT49130.2021.9392608]
[3]  
Akhter A., 2023, Natural Language Processing Journal, V4, P027, DOI DOI 10.1016/J.NLP.2023.100027
[4]   Predicting Cyberbullying on Social Media in the Big Data Era Using Machine Learning Algorithms: Review of Literature and Open Challenges [J].
Al-Garadi, Mohammed Ali ;
Hussain, Mohammad Rashid ;
Khan, Nawsher ;
Murtaza, Ghulam ;
Nweke, Henry Friday ;
Ali, Ihsan ;
Mujtaba, Ghulam ;
Chiroma, Haruna ;
Khattak, Hasan Ali ;
Gani, Abdullah .
IEEE ACCESS, 2019, 7 :70701-70718
[5]  
Alam Kazi Saeed, 2021, Proceedings of the Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV 2020), P710, DOI 10.1109/ICICV50876.2021.9388499
[6]  
Amali HMAI, 2020, MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), P266, DOI [10.1109/mercon50084.2020.9185209, 10.1109/MERCon50084.2020.9185209]
[7]   Cyberbullying: Concepts, theories, and correlates informing evidence-based best practices for prevention [J].
Ansary, Nadia S. .
AGGRESSION AND VIOLENT BEHAVIOR, 2020, 50
[8]   Feature selection using an improved Chi-square for Arabic text classification [J].
Bahassine, Said ;
Madani, Abdellah ;
Al-Sarem, Mohammed ;
Kissi, Mohamed .
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2020, 32 (02) :225-231
[9]  
Banerjee V, 2019, INT CONF ADVAN COMPU, P604, DOI [10.1109/ICACCS.2019.8728378, 10.1109/icaccs.2019.8728378]
[10]   SMOTE: Synthetic minority over-sampling technique [J].
Chawla, Nitesh V. ;
Bowyer, Kevin W. ;
Hall, Lawrence O. ;
Kegelmeyer, W. Philip .
2002, American Association for Artificial Intelligence (16)