Towards safer online communities: Deep learning and explainable AI for hate speech detection and classification

被引：9

作者：

Kibriya, Hareem ^{[1
]}

Siddiqa, Ayesha ^{[1
]}

Khan, Wazir Zada ^{[1
]}

Khan, Muhammad Khurram ^{[2
]}

机构：

[1] Univ Wah, Dept Comp Sci, Wah Cantt 47040, Pakistan

[2] King Saud Univ, Ctr Excellence Informat Assurance, Riyadh 11451, Saudi Arabia

来源：

COMPUTERS & ELECTRICAL ENGINEERING | 2024年 / 116卷

关键词：

Hate speech detection; Social media; Deep learning; Explainable Artificial Intelligence; Machine learning; Toxic comments; Hate speech;

D O I：

10.1016/j.compeleceng.2024.109153

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

The internet and social media facilitate widespread idea sharing but also contribute to cybercrimes and harmful behaviors, notably the dissemination of abusive and hateful speech, which poses a significant threat to societal cohesion. Hence, prompt and accurate detection of such harmful content is crucial. To address this issue, our study introduces a fully automated end-toend model for hate speech detection and classification using Natural Language Processing and Deep Learning techniques. The proposed architecture comprising embedding, Convolutional, bidirectional Recurrent Neural Network, and bidirectional Long Short Term Memory layers, achieved the highest accuracy of 98.5%. Additionally, we employ explainable AI techniques, such as SHapley Additive exPlanations (SHAP) and Local Interpretable Model-agnostic Explanations (LIME), to gain insights into the performance of the proposed framework. This comprehensive approach meets the pressing demand for swift and precise detection and categorization of harmful online content.

引用

页数：15

共 24 条

[1]

abalegalfactcheck, 2023, Hate speech - ABA legal fact check - American bar association

[2] Social media content classification and community detection using deep learning and graph analytics [J].

Ali, Mohsan ;

Hassan, Mehdi ;

Kifayat, Kashif ;

Kim, Jin Young ;

Hakak, Saqib ;

Khan, Muhammad Khurram .

TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 2023, 188

[3]

Andraz Pelicon, 2019, P 13 INT WORKSHOP SE, P604

[4]

[Anonymous], 2023, Introduction to recurrent neural network - GeeksforGeeks

[5]

[Anonymous], 2023, Supplemental 2021 hate crime statistics

[6] Model-Agnostic Meta-Learning for Multilingual Hate Speech Detection [J].

Awal, Md Rabiul ;

Lee, Roy Ka-Wei ;

Tanwar, Eshaan ;

Garg, Tanmay ;

Chakraborty, Tanmoy .

IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024, 11 (01) :1086-1095

[7]

Basile Valerio, 2019, P 13 INT WORKSH SEM, P54, DOI [DOI 10.18653/V1/S19-2007, DOI 10.18653/V1/S19-2007.HTTPS://ACLANTHOLOGY.ORG/S19-2007]

[8]

Davidson T., 2017, P 11 INT AAAI C WEB, V11, DOI DOI 10.1609/ICWSM.V11I1.14955

[9] Attentional Multi-Channel Convolution With Bidirectional LSTM Cell Toward Hate Speech Prediction [J].

Fazil, Mohd ;

Khan, Shakir ;

Albahlal, Bader M. M. ;

Alotaibi, Reemiah Muneer ;

Siddiqui, Tamanna ;

Shah, Mohd Asif .

IEEE ACCESS, 2023, 11 :16801-16811

[10] Improving hate speech detection using Cross-Lingual Learning [J].

Firmino, Anderson Almeida ;

Baptista, Claudio de Souza ;

de Paiva, Anselmo Cardoso .

EXPERT SYSTEMS WITH APPLICATIONS, 2024, 235

← 1 2 3 →