Classifying Offensive Speech of Bangla Text and Analysis Using Explainable AI

被引:6
|
作者
Aporna, Amena Akter [1 ]
Azad, Istinub [1 ]
Amlan, Nibraj Safwan [1 ]
Mehedi, Md Humaion Kabir [1 ]
Mahbub, Mohammed Julfikar Ali [1 ]
Rasel, Annajiat Alim [1 ]
机构
[1] Brac Univ, Dept Comp Sci & Engn, 66 Mohakhali, Dhaka 1212, Bangladesh
来源
ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I | 2022年 / 1613卷
关键词
Bangla offensive speech classification; Explainable AI; NLP; CNN; DNN;
D O I
10.1007/978-3-031-12638-3_12
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid rise of social networking websites and blogging sites not only provides freedom of expression or speech, but also allows people to express society-prohibited behaviors such as online harassment and cyberbullying, which are known as offensive speech or hate speech. Despite the fact that various research work has been done on detecting hate or abusive speech on social networking websites in the English language, the opportunities for research for detecting offensive or abusive speech in the Bengali language remain open due to the computational resource constraints or the lack of standard-labeled datasets for accurate or effective Natural Language Processing (NLP) of Bangla language. In this paper, an Explainable AI approach is used for analysis as well as for detecting offensive comments or speech in the Bengali language is proposed. Moreover, Convolutional Neural Network (CNN) model is used to extract and classify features. Since the Neural Network is time-consuming for extracting features from the dataset, our proposed approach allows people to save time and effort. In the dataset, we classified all user's comments from social media comment sections into four categories: religious, personal, geopolitical, and political. Our proposed model successfully detects Bangla offensive speeches from the dataset (Bengali Hate Speech Dataset) by evaluating Machine Learning algorithms like linear and tree-based models and Neural Networks like CNN, Bi-LSTM, Conv-LSTM, and SVM models. Moreover, we calculate scores for completeness and sufficiency to assess the quality of explanations in terms of fidelity, achieving the results with the accuracy of 78% score, significantly outperforming ML and DNN baselines.
引用
收藏
页码:133 / 144
页数:12
相关论文
共 50 条
  • [1] Interpretable Bangla Sarcasm Detection using BERT and Explainable AI
    Anan, Ramisa
    Apon, Tasnim Sakib
    Hossain, Zeba Tahsin
    Modhu, Elizabeth Antora
    Mondal, Sudipta
    Alam, Md. Golam Rabiul
    2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1280 - 1286
  • [2] Speech Synthesis for Bangla Text to Speech Conversion
    Arafat, Mohammad Yasir
    Fahrin, Sanjana
    Islam, Md. Jamirul
    Siddiquee, Md. Ashraf
    Khan, Afsana
    Kotwal, Mohammed Rokibul Alam
    Huda, Mohammad Nurul
    8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
  • [3] BAAD: A multipurpose dataset for automatic Bangla offensive speech recognition
    Hossain, Md. Fahad
    Supto, Md. Al Abid
    Chowdhury, Zannat
    Chowdhury, Hana Sultan
    Abujar, Sheikh
    DATA IN BRIEF, 2023, 48
  • [4] Bangla text normalization for text-to-speech synthesizer using machine learning algorithms
    Islam, Md. Rezaul
    Ahmad, Arif
    Rahman, Mohammad Shahidur
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)
  • [5] Detection of Hate and Offensive Speech in Text
    Wani, Abid Hussain
    Molvi, Nahida Shafi
    Ashraf, Sheikh Ishrah
    INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 87 - 93
  • [6] Phoneme based Bangla Text to Speech Conversion
    Uddin, Mir Ashraf
    Sakib, Nazmus
    Rupu, Esrat Farjana
    Hossain, Md. Afzal
    Huda, Md. Nurul
    2015 18th International Conference on Computer and Information Technology (ICCIT), 2015, : 531 - 533
  • [7] Sentiment Analysis on Bangla and Romanized Bangla Text using Deep Recurrent Models
    Hassan, Asif
    Amin, Mohammad Rashedul
    Al Azad, Abul Kalam
    Mohammed, Nabeel
    2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE (IWCI), 2016, : 51 - 56
  • [8] Hate Speech Detection in Audio Using SHAP - An Explainable AI
    Imbwaga, Joan L.
    Chittaragi, Nagaratna B.
    Koolagudi, Shashidhar G.
    ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 : 289 - 304
  • [9] Duration Modeling for Bangla Text to Speech Synthesis System
    Roy, Rajib
    Basu, Tulika
    Saha, Arup
    Basu, Joyanta
    Das Mandal, Shyamal Kr
    RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 199 - 204
  • [10] Text normalization and diphone preparation for Bangla speech synthesis
    Rashid M.M.
    Hussain A.
    Rahman M.S.
    Journal of Multimedia, 2010, 5 (06): : 551 - 559