Data Augmentation for Improving Explainability of Hate Speech Detection

被引：0

作者：

Ansari, Gunjan ^{[1
]}

Kaur, Parmeet ^{[2
]}

Saxena, Chandni ^{[3
]}

机构：

[1] JSS Acad Tech Educ, Dept Informat Technol, Noida, India

[2] Jaypee Inst Informat Technol, Dept Comp Sci & Informat Technol, Noida, India

[3] Chinese Univ Hong Kong, SAR, Hong Kong, Peoples R China

来源：

ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING | 2024年 / 49卷 / 03期

关键词：

Hate speech; Cyberbullying; Explainable AI; Data augmentation; LIME; Integrated gradient;

D O I：

10.1007/s13369-023-08100-4

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

The paper presents a novel data augmentation-based approach to develop explainable, deep learning models for hate speech detection. Hate speech is widely prevalent on online social media but difficult to detect automatically due to challenges of natural language processing and complexity of hate speech. Further, the decisions of the existing solutions possess constrained explainability since limited annotated data are available for training and testing of models. Therefore, this work proposes the use of text-based data augmentation for improving the performance and explainability of deep learning models. Techniques based on easy data augmentation, bidirectional encoder representations from transformers and back translation have been utilized for data augmentation. Convolutional neural networks and long short-term memory models are trained with augmented data and evaluated on two publicly available datasets for hate speech detection. Methods of LIME and integrated gradients are used to retrieve explanations of the deep learning models. A diagnostic study is conducted on test samples to check for improvement in the models as a result of the data augmentation. The experimental results verify that the proposed approach improves the explainability as well as the accuracy of hate speech detection.

引用

页码：3609 / 3621

页数：13

共 50 条

[21] Improving Low Resource Turkish Speech Recognition with Data Augmentation and TTS
Gokay, Ramazan
Yalcin, Hulya
2019 16TH INTERNATIONAL MULTI-CONFERENCE ON SYSTEMS, SIGNALS & DEVICES (SSD), 2019, : 357 - 360
[22] Improving speech recognition using data augmentation and acoustic model fusion
Rebai, Ilyes
BenAyed, Yessine
Mahdi, Walid
Lorre, Jean-Pierre
KNOWLEDGE-BASED AND INTELLIGENT INFORMATION & ENGINEERING SYSTEMS, 2017, 112 : 316 - 322
[23] Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition
Sudro, Protima Nomo
Das, Rohan Kumar
Sinha, Rohit
Prasanna, S. R. Mahadeva
2021 ASIA-PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE (APSIPA ASC), 2021, : 484 - 490
[24] Hate speech detection with ADHAR: a multi-dialectal hate speech corpus in Arabic
Charfi, Anis
Besghaier, Mabrouka
Akasheh, Raghda
Atalla, Andria
Zaghouani, Wajdi
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[25] Automated Hate Speech Detection on Twitter
Koushik, Garima
Rajeswari, K.
Muthusamy, Suresh Kannan
2019 5TH INTERNATIONAL CONFERENCE ON COMPUTING, COMMUNICATION, CONTROL AND AUTOMATION (ICCUBEA), 2019,
[26] Levantine hate speech detection in twitter
Medyan AbdelHamid
Assef Jafar
Yasser Rahal
Social Network Analysis and Mining, 2022, 12
[27] Topic Oriented Hate Speech Detection
Jamil, Raihan
Khan, Mohammad Abdullah Al Nayeem
Anwar, Md Musfique
HYBRID INTELLIGENT SYSTEMS, HIS 2021, 2022, 420 : 365 - 375
[28] Levantine hate speech detection in twitter
AbdelHamid, Medyan
Jafar, Assef
Rahal, Yasser
SOCIAL NETWORK ANALYSIS AND MINING, 2022, 12 (01)
[29] Mechanisms of Improving Institutional Capacities of the State to Prevent Hate Speech and Hate Crimes
Dokmanovic, Mirjana
TEMIDA, 2014, 17 (02) : 3 - 26
[30] Data-Driven and Psycholinguistics-Motivated Approaches to Hate Speech Detection
Silva, Samuel Caetano
Ferreira, Thiago Castro
Silva Ramos, Ricelli Moreira
Paraboni, Ivandre
COMPUTACION Y SISTEMAS, 2020, 24 (03): : 1179 - 1188

← 1 2 3 4 5 →