Evaluation of Different Machine Learning and Deep Learning Techniques for Hate Speech Detection

被引:1
作者
Shawkat, Nabil [1 ]
Saquer, Jamil [1 ]
Shatnawi, Hazim [1 ]
机构
[1] Missouri State Univ, Springfield, MO 65897 USA
来源
PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024 | 2024年
关键词
hate speech; machine learning; deep learning; BERT; text classification;
D O I
10.1145/3603287.3651218
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Detecting online hate speech is important for creating safer online spaces. In this paper, we evaluate the performance of several machine learning (ML) and deep learning (DL) models in detecting hate speech on three different datasets. We evaluate the performance of the traditional ML algorithms Support Vector Machines (SVM), Naive Bayes, Decision Trees, Random Forests, and Logistic Regression. We also evaluate the performance of deep learning Convolutional Neural Networks (CNN), Long Short Term Memory (LSTM), and the BERT pre-trained transformer model. Our experiments show that BERT outperformed all other models with F-1 scores of 90.6% on one dataset and 89.7% and 88.2% on the other two datasets. After that, CNN and LSTM outperformed the traditional ML algorithms with F1-scores over 80% on all three datasets. Among the traditional ML models, SVM performed best with the highest F1-score of 75.6%.
引用
收藏
页码:253 / 258
页数:6
相关论文
共 38 条
[1]   Transformer models for text-based emotion detection: a review of BERT-based approaches [J].
Acheampong, Francisca Adoma ;
Nunoo-Mensah, Henry ;
Chen, Wenyu .
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 54 (08) :5789-5829
[2]   Deep Learning for Detecting Cyberbullying Across Multiple Social Media Platforms [J].
Agrawal, Sweta ;
Awekar, Amit .
ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 :141-153
[3]  
Alsharif Osama, 2023, Rise in Hate Speech Over Gaza a Defining Moment
[4]  
[Anonymous], 2012, Detecting Insults in Social Commentary Dataset
[5]   "A Virus Has No Religion": Analyzing Islamophobia on Twitter During the COVID-19 Outbreak [J].
Chandra, Mohit ;
Reddy, Manvith ;
Sehgal, Shradha ;
Gupta, Saurabh ;
Buduru, Arun Balaji ;
Kumaraguru, Ponnurangam .
PROCEEDINGS OF THE 32ND ACM CONFERENCE ON HYPERTEXT AND SOCIAL MEDIA (HT '21), 2021, :67-77
[6]   "Subverting the Jewtocracy": Online Antisemitism Detection Using Multimodal Deep Learning [J].
Chandra, Mohit ;
Pailla, Dheeraj ;
Bhatia, Himanshu ;
Sanchawala, Aadilmehdi ;
Gupta, Manish ;
Shrivastava, Manish ;
Kumaraguru, Ponnurangam .
PROCEEDINGS OF THE 13TH ACM WEB SCIENCE CONFERENCE, WEBSCI 2021, 2020, :148-157
[7]  
COOK S, 2023, Cyberbullying facts and statistics for 2018-2023
[8]  
Davidson T., 2017, P INT AAAI C WEB SOC, P512, DOI DOI 10.1609/ICWSM.V11I1.14955
[9]  
Ekma Magnus, 2021, Learning Deep Learning: Theory and Practice of Neural Networks, Computer Vision, Natural Language Processing, and Transformers Using TensorFlow
[10]  
ElSherief M., 2018, P INT AAAI C WEB SOC, V12