Abusive Comment Detection from Bengali-English Code-Mixed Social Media Texts Using Ensemble of Deep Learning

被引：0

作者：

Fahim, Iftekhar ^{[1
]}

Ahsan, Shawly ^{[1
]}

Hoque, Mohammed Moshiul ^{[1
]}

机构：

[1] Chittagong Univ Engn & Technol, Chattogram 4349, Bangladesh

来源：

ARTIFICIAL INTELLIGENCE AND KNOWLEDGE PROCESSING, AIKP 2024 | 2025年 / 2228卷

关键词：

Natural language processing; Code-mixing; Deep learning; Text processing; Abusive content detection; AGREEMENT;

D O I：

10.1007/978-3-031-73477-9_18

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Code-mixing, which involves seamlessly combining multiple languages within a single text, has become increasingly common on social media platforms. The pervasiveness of aggressive content and offensive language on social media presents significant challenges, necessitating the development of automatic detection methods. This problem becomes more complex when dealing with code-mixed text owing to the cultural nuances of different languages. Although efforts to identify abusive content in code-mixed text have primarily concentrated on high-resource languages, research on resource-constrained languages, such as Bengali mixed with English, still needs to be completed. Some studies have aimed at detecting abusive content in transliterated Bengali texts. However, there is a notable absence of research addressing the detection of abusive content in Bengali-English code-mixed texts. To address this gap, this paper presents a custom-built Bengali-English code-mixed dataset containing 2700 annotated comments categorized as abusive and non-abusive. To facilitate research in this area, this work proposes an ensemble of deep learning (DL) models: CNN (using GloVe embeddings), LSTM (implemented with Keras), and BiLSTM (utilizing FastText embeddings). The ensemble approach attained the most elevated weighted f1-score of 0.81. This research aims to tackle the growing issue of abusive content in code-mixed data, creating safer and more inclusive online environments.

引用

页码：252 / 267

页数：16

共 50 条

[31] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
Fernando, W. S. S.
Weerasinghe, Ruvan
Bandara, E. R. A. D.
2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
[32] Sentiment Analysis of Code-mixed Social Media Data on Philippine UAQTE using Fine-tuned mBERT Model
Maceda, Lany L.
Satuito, Arlene A.
Abisado, Mideth B.
INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 706 - 711
[33] Detecting Traffic Information From Social Media Texts With Deep Learning Approaches
Chen, Yuanyuan
Lv, Yisheng
Wang, Xiao
Li, Lingxi
Wang, Fei-Yue
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (08) : 3049 - 3058
[34] An ensemble deep learning technique for detecting suicidal ideation from posts in social media platforms
Renjith, Shini
Abraham, Annie
Jyothi, Surya B.
Chandran, Lekshmi
Thomson, Jincy
JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9564 - 9575
[35] Comparative Analysis of Social Media Hate Detection over Code Mixed Hindi-English Language
Pareek, Kapil
Choudhary, Arjun
Tripathi, Ashish
Mishra, K. K.
ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 551 - 561
[36] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
Muneer, Amgad
Alwadain, Ayed
Ragab, Mohammed Gamal
Alqushaibi, Alawi
INFORMATION, 2023, 14 (08)
[37] Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor Detection in Social Media
Kotteti, Chandra Mouli Madhav
Dong, Xishuang
Qian, Lijun
APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 21
[38] Running a Sustainable Social Media Business: The Use of Deep Learning Methods in Online-Comment Short Texts
Lin, Weibin
Zhang, Qian
Wu, Yenchun Jim
Chen, Tsung-Chun
SUSTAINABILITY, 2023, 15 (11)
[39] Detection of Suicide Ideation in Social Media Forums Using Deep Learning
Tadesse, Michael Mesfin
Lin, Hongfei
Xu, Bo
Yang, Liang
ALGORITHMS, 2020, 13 (01)
[40] Towards Explainability in Using Deep Learning for the Detection of Anorexia in Social Media
Amini, Hessam
Kosseim, Leila
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2020), 2020, 12089 : 225 - 235

← 1 2 3 4 5 →