Comparing Machine Learning and Deep Learning Techniques for Text Analytics: Detecting the Severity of Hate Comments Online

被引:4
|
作者
Marshan, Alaa [1 ]
Nizar, Farah Nasreen Mohamed [2 ]
Ioannou, Athina [3 ]
Spanaki, Konstantina [4 ]
机构
[1] Univ Surrey, Dept Comp Sci, Guildford, England
[2] Brunel Univ, Dept Comp Sci, London, England
[3] Univ Surrey, Surrey Business Sch, Guildford, England
[4] Audencia Business Sch, Nantes, France
关键词
Machine learning; Deep learning; Hate speech; Social media; Text pre-processing; Text representation; Text analytics; SPEECH DETECTION; SOCIAL MEDIA;
D O I
10.1007/s10796-023-10446-x
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Social media platforms have become an increasingly popular tool for individuals to share their thoughts and opinions with other people. However, very often people tend to misuse social media posting abusive comments. Abusive and harassing behaviours can have adverse effects on people's lives. This study takes a novel approach to combat harassment in online platforms by detecting the severity of abusive comments, that has not been investigated before. The study compares the performance of machine learning models such as Naive Bayes, Random Forest, and Support Vector Machine, with deep learning models such as Convolutional Neural Network (CNN) and Bi-directional Long Short-Term Memory (Bi-LSTM). Moreover, in this work we investigate the effect of text pre-processing on the performance of the machine and deep learning models, the feature set for the abusive comments was made using unigrams and bigrams for the machine learning models and word embeddings for the deep learning models. The comparison of the models' performances showed that the Random Forest with bigrams achieved the best overall performance with an accuracy of (0.94), a precision of (0.91), a recall of (0.94), and an F1 score of (0.92). The study develops an efficient model to detect severity of abusive language in online platforms, offering important implications both to theory and practice.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Performance Comparison of Machine Learning and Deep Learning Algorithms in Detecting Online Hate Speech
    Shibly, F. H. A.
    Sharma, Uzzal
    Naleer, H. M. M.
    INTERNATIONAL CONFERENCE ON INNOVATIVE COMPUTING AND COMMUNICATIONS, ICICC 2022, VOL 1, 2023, 473 : 695 - 706
  • [2] Detecting Hate Speech using Deep Learning Techniques
    Paul, Chayan
    Bora, Pronami
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2021, 12 (02) : 619 - 623
  • [3] Detecting Cognitive Distortions Through Machine Learning Text Analytics
    Simms, T.
    Ramstedt, C.
    Rich, M.
    Richards, M.
    Martinez, T.
    Giraud-Carrier, C.
    2017 IEEE INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI), 2017, : 508 - 512
  • [4] Evaluation of Different Machine Learning and Deep Learning Techniques for Hate Speech Detection
    Shawkat, Nabil
    Saquer, Jamil
    Shatnawi, Hazim
    PROCEEDINGS OF THE 2024 ACM SOUTHEAST CONFERENCE, ACMSE 2024, 2024, : 253 - 258
  • [5] A Review on Text Sentiment Analysis With Machine Learning and Deep Learning Techniques
    Mamani-Coaquira, Yonatan
    Villanueva, Edwin
    IEEE ACCESS, 2024, 12 : 193115 - 193130
  • [6] Anomaly detection in consumer review analytics for idea generation in product innovation: Comparing machine learning and deep learning techniques
    Cui, Xiling
    Zhu, Zhongshan
    Liu, Libo
    Zhou, Qiang
    Liu, Qiang
    TECHNOVATION, 2024, 134
  • [7] Healthcare predictive analytics using machine learning and deep learning techniques: a survey
    Mohammed Badawy
    Nagy Ramadan
    Hesham Ahmed Hefny
    Journal of Electrical Systems and Information Technology, 10 (1)
  • [8] Deep Learning Algorithms for Detecting Fake News in Online Text
    Girgis, Sherry
    Amer, Eslam
    Gadallah, Mahmoud
    PROCEEDINGS OF 2018 13TH INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND SYSTEMS (ICCES), 2018, : 93 - 97
  • [9] Applying machine learning to text analytics
    Riemer, Matthew
    IBM Data Management Magazine, 2014, (06):
  • [10] Comparing the Robustness of Classical and Deep Learning Techniques for Text Classification
    Quynh Tran
    Shpileuskaya, Krystsina
    Zaunseder, Elaine
    Putzar, Larissa
    Blankenburg, Sven
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,