Enhancing cyberbullying detection: a comparative study of ensemble CNN-SVM and BERT models

被引:4
|
作者
Saini, Hiteshi [1 ]
Mehra, Himashri [1 ]
Rani, Ritu [1 ]
Jaiswal, Garima [2 ]
Sharma, Arun [1 ]
Dev, Amita [1 ]
机构
[1] Indira Gandhi Delhi Tech Univ Women, New Delhi, India
[2] Bennett Univ, Greater Noida, India
关键词
Cyberbullying detection; Machine learning; Deep learning; Ensemble learning; Online social networking; Social media; Support vector machine; Naive Bayes; Convolutional neural network; CNN-SVM; BERT;
D O I
10.1007/s13278-023-01158-w
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Technological improvements have increased the number of people who use online social networking sites, resulting in an increase in cyberbullying. Bullies can attack victims through a large network of online social networking platforms. Cyberbullying is an umbrella term encompassing a wide range of online abuse, including but not limited to harassment, doxing, and reputation attacks. These attacks frequently leave the victim(s) with persistent mental scars, leading to desperate measures such as depression, self-harm, and suicidal thoughts. Given the effects of cyberbullying, there is an urgent need to prosecute and prevent such crimes. This paper gives a comprehensive review as well the empirical analysis of the machine learning, ensemble based and transformer-based models for the cyberbullying detection. This paper proposes two architectures to efficiently detect cyberbullying pattern. The proposed ensemble model makes use of CNN to extract the relevant features and the classification is performed by the SVM. Another proposed architecture utilizes the pre-trained model BERT to detect cyberbullying behavior on online platforms. Both the proposed models were tested on two separate datasets and achieved maximum accuracy of 96.88 and 97.34% for ensemble and BERT models, respectively. This paper provides a thorough examination of the various methodologies used for cyberbullying detection and conducts an empirical and comparative analysis of the presented models with traditional and current algorithms.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Comparative Study of Model Optimization Techniques in Fine-Tuned CNN Models
    Poojary, Ramaprasad
    Pai, Akul
    2019 INTERNATIONAL CONFERENCE ON ELECTRICAL AND COMPUTING TECHNOLOGIES AND APPLICATIONS (ICECTA), 2019,
  • [32] Gamma function based ensemble of CNN models for breast cancer detection in histopathology images
    Majumdar, Samriddha
    Pramanik, Payel
    Sarkar, Ram
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 213
  • [33] Adaptive ensemble techniques leveraging BERT based models for multilingual hate speech detection in Korean and english
    Seohyun Yoo
    Eunbae Jeon
    Joonseo Hyeon
    Jaehyuk Cho
    Scientific Reports, 15 (1)
  • [34] A Comparative Study of Different Pre-Trained DeepLearning Models and Custom CNN for Pancreatic Tumor Detection
    Zavalsiz, Muhammed Talha
    Alhajj, Sleiman
    Sailunaz, Kashfia
    Ozyer, Tansel
    Alhajj, Reda
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (3A) : 515 - 526
  • [35] Acute lymphoblastic leukemia detection using ensemble features from multiple deep CNN models
    Abul Hasanaath, Ahmed
    Mohammed, Abdul Sami
    Latif, Ghazanfar
    Abdelhamid, Sherif E.
    Alghazo, Jaafar
    Abul Hussain, Ahmed
    ELECTRONIC RESEARCH ARCHIVE, 2024, 32 (04): : 2407 - 2423
  • [36] Enhancing Automated Lung Disease Detection: An Approached Using Multi Network Features and ECOC-SVM Ensemble
    Wong, Wei Kitt
    Tan, Darryl Wen Shen
    Juwono, Filbert H.
    Chew, Ing Ming
    Tiong, Teck Chai
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2024, 40 (05) : 1005 - 1016
  • [37] A Comparative Study of Ensemble Models for Predicting Road Traffic Congestion
    Bokaba, Tebogo
    Doorsamy, Wesley
    Paul, Babu Sena
    APPLIED SCIENCES-BASEL, 2022, 12 (03):
  • [38] Image Captioning Encoder–Decoder Models Using CNN-RNN Architectures: A Comparative Study
    K. Revati Suresh
    Arun Jarapala
    P. V. Sudeep
    Circuits, Systems, and Signal Processing, 2022, 41 : 5719 - 5742
  • [39] Enhancing Diagnostic Accuracy for Skin Cancer and COVID-19 Detection: A Comparative Study Using a Stacked Ensemble Method
    Qayyum, Hafza
    Rizvi, Syed Tahir Hussain
    Naeem, Muddasar
    Khalid, Umamah bint
    Abbas, Musarat
    Coronato, Antonio
    TECHNOLOGIES, 2024, 12 (09)
  • [40] Pretrained Transformer Language Models Versus Pretrained Word Embeddings for the Detection of Accurate Health Information on Arabic Social Media: Comparative Study
    Albalawi, Yahya
    Nikolov, Nikola S.
    Buckley, Jim
    JMIR FORMATIVE RESEARCH, 2022, 6 (06)