Comparing Machine Learning and Deep Learning Techniques for Text Analytics: Detecting the Severity of Hate Comments Online

被引：4

作者：

Marshan, Alaa ^{[1
]}

Nizar, Farah Nasreen Mohamed ^{[2
]}

Ioannou, Athina ^{[3
]}

Spanaki, Konstantina ^{[4
]}

机构：

[1] Univ Surrey, Dept Comp Sci, Guildford, England

[2] Brunel Univ, Dept Comp Sci, London, England

[3] Univ Surrey, Surrey Business Sch, Guildford, England

[4] Audencia Business Sch, Nantes, France

来源：

INFORMATION SYSTEMS FRONTIERS | 2023年

关键词：

Machine learning; Deep learning; Hate speech; Social media; Text pre-processing; Text representation; Text analytics; SPEECH DETECTION; SOCIAL MEDIA;

D O I：

10.1007/s10796-023-10446-x

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Social media platforms have become an increasingly popular tool for individuals to share their thoughts and opinions with other people. However, very often people tend to misuse social media posting abusive comments. Abusive and harassing behaviours can have adverse effects on people's lives. This study takes a novel approach to combat harassment in online platforms by detecting the severity of abusive comments, that has not been investigated before. The study compares the performance of machine learning models such as Naive Bayes, Random Forest, and Support Vector Machine, with deep learning models such as Convolutional Neural Network (CNN) and Bi-directional Long Short-Term Memory (Bi-LSTM). Moreover, in this work we investigate the effect of text pre-processing on the performance of the machine and deep learning models, the feature set for the abusive comments was made using unigrams and bigrams for the machine learning models and word embeddings for the deep learning models. The comparison of the models' performances showed that the Random Forest with bigrams achieved the best overall performance with an accuracy of (0.94), a precision of (0.91), a recall of (0.94), and an F1 score of (0.92). The study develops an efficient model to detect severity of abusive language in online platforms, offering important implications both to theory and practice.

引用

页数：19

共 50 条

[1] Performance Comparison of Machine Learning and Deep Learning Algorithms in Detecting Online Hate Speech
Shibly, F. H. A.
Sharma, Uzzal
Naleer, H. M. M.
INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 695 - 706
[2] Detecting Hate Speech using Deep Learning Techniques
Paul, Chayan
Bora, Pronami
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 619 - 623
[3] Detecting Cognitive Distortions Through Machine Learning Text Analytics
Simms, T.
Ramstedt, C.
Rich, M.
Richards, M.
Martinez, T.
Giraud-Carrier, C.
2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 508 - 512
[4] Evaluation of Different Machine Learning and Deep Learning Techniques for Hate Speech Detection
Shawkat, Nabil
Saquer, Jamil
Shatnawi, Hazim
PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 253 - 258
[5] A Review on Text Sentiment Analysis With Machine Learning and Deep Learning Techniques
Mamani-Coaquira, Yonatan
Villanueva, Edwin
IEEE ACCESS, 2024, 12 : 193115 - 193130
[6] Anomaly detection in consumer review analytics for idea generation in product innovation: Comparing machine learning and deep learning techniques
Cui, Xiling
Zhu, Zhongshan
Liu, Libo
Zhou, Qiang
Liu, Qiang
TECHNOVATION, 2024, 134
[7] Healthcare predictive analytics using machine learning and deep learning techniques: a survey
Mohammed Badawy
Nagy Ramadan
Hesham Ahmed Hefny
Journal of Electrical Systems and Information Technology, 10 (1)
[8] Deep Learning Algorithms for Detecting Fake News in Online Text
Girgis, Sherry
Amer, Eslam
Gadallah, Mahmoud
PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 93 - 97
[9] Applying machine learning to text analytics
Riemer, Matthew
IBM Data Management Magazine, 2014, (06):
[10] Comparing the Robustness of Classical and Deep Learning Techniques for Text Classification
Quynh Tran
Shpileuskaya, Krystsina
Zaunseder, Elaine
Putzar, Larissa
Blankenburg, Sven
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,

← 1 2 3 4 5 →