A transfer learning approach for detecting offensive and hate speech on social media platforms

被引：10

作者：

Priyadarshini, Ishaani ^{[1
]}

Sahu, Sandipan ^{[2
]}

Kumar, Raghvendra ^{[3
]}

机构：

[1] Univ Calif Berkeley, Sch Informat, Berkeley, CA USA

[2] Bengal Inst Technol, Dept Comp Sci & Engn, Kolkata, India

[3] GIET Univ, Dept Comp Sci & Engn, Gunupur, India

来源：

MULTIMEDIA TOOLS AND APPLICATIONS | 2023年 / 82卷 / 18期

关键词：

Hate speech; Transfer learning; Word2vec model; GloVe model; LSTM;

D O I：

10.1007/s11042-023-14481-3

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Over the last few decades, the expansion of technology and the internet has led to the number of users proliferating on social media, with a simultaneous increase in hate speech. A critical concern is, hate speech is not only responsible for igniting violence and spreading hatred, but its detection also requires a considerable amount of computing resources and content monitoring by human experts and algorithms. While the research is an active area, and several artificial intelligence techniques have been proposed in the past to address the concern, the rise in the number of petabytes of the content generated calls for methods that will exhibit improved performance and reduced model development time. We propose a transfer learning approach for detecting hate and offensive speech on social media that deploys a pre-trained model for data analysis thereby promoting model reusability. We propose two transfer learning models, i.e. Google's Word2vec model using LSTM and GloVe Model using LSTM for the same and compare the performance of our proposed model against unigram and bigram language models for Naive Bayes (NB), Decision Trees (DT), and Support Vector Machines (SVM), which are also the baseline algorithms considered for analysis. The performance of the proposed models for classifying hate speech, offensive speech, and neutral speech is validated using metrics such as precision, recall, F-1 score, and support. The overall performance of the models across multiple datasets has been evaluated with respect to accuracy. In-depth experimental analysis and results depict that the proposed model is significantly robust for detecting hateful and offensive speech and also performs better than the considered baseline algorithms.

引用

页码：27473 / 27499

页数：27

共 50 条

[41] Moralized language predicts hate speech on social media
Solovev, Kirill
Proellochs, Nicolas
PNAS NEXUS, 2023, 2 (01):
[42] Multimodal Hate Speech Detection in Greek Social Media
Perifanos, Konstantinos
Goutsos, Dionysis
MULTIMODAL TECHNOLOGIES AND INTERACTION, 2021, 5 (07)
[43] Legal Testing on Hate Speech Through Social Media
Silambi, Erni Dwita
Azis, Yuldiana Zesa
Alputila, Marlyn Jane
Septarini, Dina Fitri
PROCEEDINGS OF THE 1ST INTERNATIONAL CONFERENCE ON SOCIAL SCIENCES (ICSS 2018), 2018, 226 : 1411 - 1414
[44] The eradication of hate speech on social media: a systematic review
Gracia-Calandin, Javier
Suarez-Montoya, Leonardo
JOURNAL OF INFORMATION COMMUNICATION & ETHICS IN SOCIETY, 2023, 21 (04): : 406 - 421
[45] HATE SPEECH ON SOCIAL MEDIA: FREEDOM OF EXPRESSION AT A CROSSROADS
Bueso, Laura Diez
REVISTA CATALANA DE DRET PUBLIC, 2020, (61): : 50 - 64
[46] How Successful Is Transfer Learning for Detecting Anorexia on Social Media?
Lopez-Ubeda, Pilar
Plaza-del-Arco, Flor Miriam
Diaz-Galiano, Manuel Carlos
Martin-Valdivia, Maria-Teresa
APPLIED SCIENCES-BASEL, 2021, 11 (04): : 1 - 16
[47] Context-Aware Deep Learning Model for Detection of Roman Urdu Hate Speech on Social Media Platform
Bilal, Muhammad
Khan, Atif
Jan, Salman
Musa, Shahrulniza
IEEE ACCESS, 2022, 10 : 121133 - 121151
[48] Detecting offensive speech in conversational code-mixed dialogue on social media: A contextual dataset and benchmark experiments
Madhu, Hiren
Satapara, Shrey
Modha, Sandip
Mandl, Thomas
Majumder, Prasenjit
EXPERT SYSTEMS WITH APPLICATIONS, 2023, 215
[49] A Supervised Classification Approach for Detecting Hate Speech in English Tweets
Kumar, N. Solomon Praveen
Mythili, M. S.
JOURNAL OF ADVANCED APPLIED SCIENTIFIC RESEARCH, 2023, 5 (04): : 55 - 66
[50] Detection of Hate Speech and Offensive Language CodeMix Text in Dravidian Languages Using Cost-Sensitive Learning Approach
Sreelakshmi, K.
Premjith, B.
Chakravarthi, Bharathi Raja
Soman, K. P.
IEEE ACCESS, 2024, 12 : 20064 - 20090

← 1 2 3 4 5 →