Classifying Offensive Speech of Bangla Text and Analysis Using Explainable AI

被引：6

作者：

Aporna, Amena Akter ^{[1
]}

Azad, Istinub ^{[1
]}

Amlan, Nibraj Safwan ^{[1
]}

Mehedi, Md Humaion Kabir ^{[1
]}

Mahbub, Mohammed Julfikar Ali ^{[1
]}

Rasel, Annajiat Alim ^{[1
]}

机构：

[1] Brac Univ, Dept Comp Sci & Engn, 66 Mohakhali, Dhaka 1212, Bangladesh

来源：

ADVANCES IN COMPUTING AND DATA SCIENCES (ICACDS 2022), PT I | 2022年 / 1613卷

关键词：

Bangla offensive speech classification; Explainable AI; NLP; CNN; DNN;

D O I：

10.1007/978-3-031-12638-3_12

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapid rise of social networking websites and blogging sites not only provides freedom of expression or speech, but also allows people to express society-prohibited behaviors such as online harassment and cyberbullying, which are known as offensive speech or hate speech. Despite the fact that various research work has been done on detecting hate or abusive speech on social networking websites in the English language, the opportunities for research for detecting offensive or abusive speech in the Bengali language remain open due to the computational resource constraints or the lack of standard-labeled datasets for accurate or effective Natural Language Processing (NLP) of Bangla language. In this paper, an Explainable AI approach is used for analysis as well as for detecting offensive comments or speech in the Bengali language is proposed. Moreover, Convolutional Neural Network (CNN) model is used to extract and classify features. Since the Neural Network is time-consuming for extracting features from the dataset, our proposed approach allows people to save time and effort. In the dataset, we classified all user's comments from social media comment sections into four categories: religious, personal, geopolitical, and political. Our proposed model successfully detects Bangla offensive speeches from the dataset (Bengali Hate Speech Dataset) by evaluating Machine Learning algorithms like linear and tree-based models and Neural Networks like CNN, Bi-LSTM, Conv-LSTM, and SVM models. Moreover, we calculate scores for completeness and sufficiency to assess the quality of explanations in terms of fidelity, achieving the results with the accuracy of 78% score, significantly outperforming ML and DNN baselines.

引用

页码：133 / 144

页数：12

共 50 条

[1] Interpretable Bangla Sarcasm Detection using BERT and Explainable AI
Anan, Ramisa
Apon, Tasnim Sakib
Hossain, Zeba Tahsin
Modhu, Elizabeth Antora
Mondal, Sudipta
Alam, Md. Golam Rabiul
2023 IEEE 13TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE, CCWC, 2023, : 1280 - 1286
[2] Speech Synthesis for Bangla Text to Speech Conversion
Arafat, Mohammad Yasir
Fahrin, Sanjana
Islam, Md. Jamirul
Siddiquee, Md. Ashraf
Khan, Afsana
Kotwal, Mohammed Rokibul Alam
Huda, Mohammad Nurul
8TH INTERNATIONAL CONFERENCE ON SOFTWARE, KNOWLEDGE, INFORMATION MANAGEMENT AND APPLICATIONS (SKIMA 2014), 2014,
[3] BAAD: A multipurpose dataset for automatic Bangla offensive speech recognition
Hossain, Md. Fahad
Supto, Md. Al Abid
Chowdhury, Zannat
Chowdhury, Hana Sultan
Abujar, Sheikh
DATA IN BRIEF, 2023, 48
[4] Bangla text normalization for text-to-speech synthesizer using machine learning algorithms
Islam, Md. Rezaul
Ahmad, Arif
Rahman, Mohammad Shahidur
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2024, 36 (01)
[5] Detection of Hate and Offensive Speech in Text
Wani, Abid Hussain
Molvi, Nahida Shafi
Ashraf, Sheikh Ishrah
INTELLIGENT HUMAN COMPUTER INTERACTION (IHCI 2019), 2020, 11886 : 87 - 93
[6] Phoneme based Bangla Text to Speech Conversion
Uddin, Mir Ashraf
Sakib, Nazmus
Rupu, Esrat Farjana
Hossain, Md. Afzal
Huda, Md. Nurul
2015 18th International Conference on Computer and Information Technology (ICCIT), 2015, : 531 - 533
[7] Sentiment Analysis on Bangla and Romanized Bangla Text using Deep Recurrent Models
Hassan, Asif
Amin, Mohammad Rashedul
Al Azad, Abul Kalam
Mohammed, Nabeel
2016 INTERNATIONAL WORKSHOP ON COMPUTATIONAL INTELLIGENCE (IWCI), 2016, : 51 - 56
[8] Hate Speech Detection in Audio Using SHAP - An Explainable AI
Imbwaga, Joan L.
Chittaragi, Nagaratna B.
Koolagudi, Shashidhar G.
ADVANCED NETWORK TECHNOLOGIES AND INTELLIGENT COMPUTING, ANTIC 2023, PT II, 2024, 2091 : 289 - 304
[9] Duration Modeling for Bangla Text to Speech Synthesis System
Roy, Rajib
Basu, Tulika
Saha, Arup
Basu, Joyanta
Das Mandal, Shyamal Kr
RECENT ADVANCES OF ASIAN LANGUAGE PROCESSING TECHNOLOGIES, 2008, : 199 - 204
[10] Text normalization and diphone preparation for Bangla speech synthesis
Rashid M.M.
Hussain A.
Rahman M.S.
Journal of Multimedia, 2010, 5 (06): : 551 - 559

← 1 2 3 4 5 →