Data Augmentation for Improving Explainability of Hate Speech Detection

被引:0
|
作者
Ansari, Gunjan [1 ]
Kaur, Parmeet [2 ]
Saxena, Chandni [3 ]
机构
[1] JSS Acad Tech Educ, Dept Informat Technol, Noida, India
[2] Jaypee Inst Informat Technol, Dept Comp Sci & Informat Technol, Noida, India
[3] Chinese Univ Hong Kong, SAR, Hong Kong, Peoples R China
关键词
Hate speech; Cyberbullying; Explainable AI; Data augmentation; LIME; Integrated gradient;
D O I
10.1007/s13369-023-08100-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The paper presents a novel data augmentation-based approach to develop explainable, deep learning models for hate speech detection. Hate speech is widely prevalent on online social media but difficult to detect automatically due to challenges of natural language processing and complexity of hate speech. Further, the decisions of the existing solutions possess constrained explainability since limited annotated data are available for training and testing of models. Therefore, this work proposes the use of text-based data augmentation for improving the performance and explainability of deep learning models. Techniques based on easy data augmentation, bidirectional encoder representations from transformers and back translation have been utilized for data augmentation. Convolutional neural networks and long short-term memory models are trained with augmented data and evaluated on two publicly available datasets for hate speech detection. Methods of LIME and integrated gradients are used to retrieve explanations of the deep learning models. A diagnostic study is conducted on test samples to check for improvement in the models as a result of the data augmentation. The experimental results verify that the proposed approach improves the explainability as well as the accuracy of hate speech detection.
引用
收藏
页码:3609 / 3621
页数:13
相关论文
共 50 条
  • [31] A curated dataset for hate speech detection on social media text
    Mody, Devansh
    Huang, YiDong
    de Oliveira, Thiago Eustaquio Alves
    DATA IN BRIEF, 2023, 46
  • [32] DATA AUGMENTATION BASED ON VOWEL STRETCH FOR IMPROVING CHILDREN'S SPEECH RECOGNITION
    Nagano, Tohru
    Fukuda, Takashi
    Suzuki, Masayuki
    Kurata, Gakuto
    2019 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU 2019), 2019, : 502 - 508
  • [33] Improving Accented Speech Recognition using Data Augmentation based on Unsupervised Text-to-Speech Synthesis
    Cong-Thanh Do
    Imai, Shuhei
    Doddipatla, Rama
    Hain, Thomas
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 136 - 140
  • [34] Data augmentation for speech separation
    Alex, Ashish
    Wang, Lin
    Gastaldo, Paolo
    Cavallaro, Andrea
    SPEECH COMMUNICATION, 2023, 152
  • [35] Explainable hate speech detection using LIME
    Joan L. Imbwaga
    Nagaratna B. Chittaragi
    Shashidhar G. Koolagudi
    International Journal of Speech Technology, 2024, 27 (3) : 793 - 815
  • [36] The effect of gender bias on hate speech detection
    Furkan Şahinuç
    Eyup Halit Yilmaz
    Cagri Toraman
    Aykut Koç
    Signal, Image and Video Processing, 2023, 17 : 1591 - 1597
  • [37] Is hate speech detection the solution the world wants?
    Parker, Sara
    Ruths, Derek
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (10)
  • [38] A survey of explainable AI techniques for detection of fake news and hate speech on social media platforms
    Gongane, Vaishali U.
    Munot, Mousami V.
    Anuse, Alwin D.
    JOURNAL OF COMPUTATIONAL SOCIAL SCIENCE, 2024, 7 (01): : 587 - 623
  • [39] The effect of gender bias on hate speech detection
    Sahinuc, Furkan
    Yilmaz, Eyup Halit
    Toraman, Cagri
    Koc, Aykut
    SIGNAL IMAGE AND VIDEO PROCESSING, 2023, 17 (04) : 1591 - 1597
  • [40] A Survey on Automatic Detection of Hate Speech in Text
    Fortuna, Paula
    Nunes, Sergio
    ACM COMPUTING SURVEYS, 2018, 51 (04)