A transfer learning approach for detecting offensive and hate speech on social media platforms

被引:10
|
作者
Priyadarshini, Ishaani [1 ]
Sahu, Sandipan [2 ]
Kumar, Raghvendra [3 ]
机构
[1] Univ Calif Berkeley, Sch Informat, Berkeley, CA USA
[2] Bengal Inst Technol, Dept Comp Sci & Engn, Kolkata, India
[3] GIET Univ, Dept Comp Sci & Engn, Gunupur, India
关键词
Hate speech; Transfer learning; Word2vec model; GloVe model; LSTM;
D O I
10.1007/s11042-023-14481-3
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the last few decades, the expansion of technology and the internet has led to the number of users proliferating on social media, with a simultaneous increase in hate speech. A critical concern is, hate speech is not only responsible for igniting violence and spreading hatred, but its detection also requires a considerable amount of computing resources and content monitoring by human experts and algorithms. While the research is an active area, and several artificial intelligence techniques have been proposed in the past to address the concern, the rise in the number of petabytes of the content generated calls for methods that will exhibit improved performance and reduced model development time. We propose a transfer learning approach for detecting hate and offensive speech on social media that deploys a pre-trained model for data analysis thereby promoting model reusability. We propose two transfer learning models, i.e. Google's Word2vec model using LSTM and GloVe Model using LSTM for the same and compare the performance of our proposed model against unigram and bigram language models for Naive Bayes (NB), Decision Trees (DT), and Support Vector Machines (SVM), which are also the baseline algorithms considered for analysis. The performance of the proposed models for classifying hate speech, offensive speech, and neutral speech is validated using metrics such as precision, recall, F-1 score, and support. The overall performance of the models across multiple datasets has been evaluated with respect to accuracy. In-depth experimental analysis and results depict that the proposed model is significantly robust for detecting hateful and offensive speech and also performs better than the considered baseline algorithms.
引用
收藏
页码:27473 / 27499
页数:27
相关论文
共 50 条
  • [41] Moralized language predicts hate speech on social media
    Solovev, Kirill
    Proellochs, Nicolas
    PNAS NEXUS, 2023, 2 (01):
  • [42] Multimodal Hate Speech Detection in Greek Social Media
    Perifanos, Konstantinos
    Goutsos, Dionysis
    MULTIMODAL TECHNOLOGIES AND INTERACTION, 2021, 5 (07)
  • [43] Legal Testing on Hate Speech Through Social Media
    Silambi, Erni Dwita
    Azis, Yuldiana Zesa
    Alputila, Marlyn Jane
    Septarini, Dina Fitri
    PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES (ICSS 2018), 2018, 226 : 1411 - 1414
  • [44] The eradication of hate speech on social media: a systematic review
    Gracia-Calandin, Javier
    Suarez-Montoya, Leonardo
    JOURNAL OF INFORMATION COMMUNICATION & ETHICS IN SOCIETY, 2023, 21 (04): : 406 - 421
  • [45] HATE SPEECH ON SOCIAL MEDIA: FREEDOM OF EXPRESSION AT A CROSSROADS
    Bueso, Laura Diez
    REVISTA CATALANA DE DRET PUBLIC, 2020, (61): : 50 - 64
  • [46] How Successful Is Transfer Learning for Detecting Anorexia on Social Media?
    Lopez-Ubeda, Pilar
    Plaza-del-Arco, Flor Miriam
    Diaz-Galiano, Manuel Carlos
    Martin-Valdivia, Maria-Teresa
    APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 16
  • [47] Context-Aware Deep Learning Model for Detection of Roman Urdu Hate Speech on Social Media Platform
    Bilal, Muhammad
    Khan, Atif
    Jan, Salman
    Musa, Shahrulniza
    IEEE ACCESS, 2022, 10 : 121133 - 121151
  • [48] Detecting offensive speech in conversational code-mixed dialogue on social media: A contextual dataset and benchmark experiments
    Madhu, Hiren
    Satapara, Shrey
    Modha, Sandip
    Mandl, Thomas
    Majumder, Prasenjit
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
  • [49] A Supervised Classification Approach for Detecting Hate Speech in English Tweets
    Kumar, N. Solomon Praveen
    Mythili, M. S.
    JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2023, 5 (04): : 55 - 66
  • [50] Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
    Sreelakshmi, K.
    Premjith, B.
    Chakravarthi, Bharathi Raja
    Soman, K. P.
    IEEE ACCESS, 2024, 12 : 20064 - 20090