Abusive Comment Detection from Bengali-English Code-Mixed Social Media Texts Using Ensemble of Deep Learning

被引:0
|
作者
Fahim, Iftekhar [1 ]
Ahsan, Shawly [1 ]
Hoque, Mohammed Moshiul [1 ]
机构
[1] Chittagong Univ Engn & Technol, Chattogram 4349, Bangladesh
来源
ARTIFICIAL INTELLIGENCE AND KNOWLEDGE PROCESSING, AIKP 2024 | 2025年 / 2228卷
关键词
Natural language processing; Code-mixing; Deep learning; Text processing; Abusive content detection; AGREEMENT;
D O I
10.1007/978-3-031-73477-9_18
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Code-mixing, which involves seamlessly combining multiple languages within a single text, has become increasingly common on social media platforms. The pervasiveness of aggressive content and offensive language on social media presents significant challenges, necessitating the development of automatic detection methods. This problem becomes more complex when dealing with code-mixed text owing to the cultural nuances of different languages. Although efforts to identify abusive content in code-mixed text have primarily concentrated on high-resource languages, research on resource-constrained languages, such as Bengali mixed with English, still needs to be completed. Some studies have aimed at detecting abusive content in transliterated Bengali texts. However, there is a notable absence of research addressing the detection of abusive content in Bengali-English code-mixed texts. To address this gap, this paper presents a custom-built Bengali-English code-mixed dataset containing 2700 annotated comments categorized as abusive and non-abusive. To facilitate research in this area, this work proposes an ensemble of deep learning (DL) models: CNN (using GloVe embeddings), LSTM (implemented with Keras), and BiLSTM (utilizing FastText embeddings). The ensemble approach attained the most elevated weighted f1-score of 0.81. This research aims to tackle the growing issue of abusive content in code-mixed data, creating safer and more inclusive online environments.
引用
收藏
页码:252 / 267
页数:16
相关论文
共 50 条
  • [31] Sinhala Hate Speech Detection in Social Media Using Machine Learning and Deep Learning
    Fernando, W. S. S.
    Weerasinghe, Ruvan
    Bandara, E. R. A. D.
    2022 22ND INTERNATIONAL CONFERENCE ON ADVANCES IN ICT FOR EMERGING REGIONS (ICTER), 2022,
  • [32] Sentiment Analysis of Code-mixed Social Media Data on Philippine UAQTE using Fine-tuned mBERT Model
    Maceda, Lany L.
    Satuito, Arlene A.
    Abisado, Mideth B.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2023, 14 (07) : 706 - 711
  • [33] Detecting Traffic Information From Social Media Texts With Deep Learning Approaches
    Chen, Yuanyuan
    Lv, Yisheng
    Wang, Xiao
    Li, Lingxi
    Wang, Fei-Yue
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2019, 20 (08) : 3049 - 3058
  • [34] An ensemble deep learning technique for detecting suicidal ideation from posts in social media platforms
    Renjith, Shini
    Abraham, Annie
    Jyothi, Surya B.
    Chandran, Lekshmi
    Thomson, Jincy
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2022, 34 (10) : 9564 - 9575
  • [35] Comparative Analysis of Social Media Hate Detection over Code Mixed Hindi-English Language
    Pareek, Kapil
    Choudhary, Arjun
    Tripathi, Ashish
    Mishra, K. K.
    ADVANCES IN DATA AND INFORMATION SCIENCES, 2022, 318 : 551 - 561
  • [36] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
    Muneer, Amgad
    Alwadain, Ayed
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    INFORMATION, 2023, 14 (08)
  • [37] Ensemble Deep Learning on Time-Series Representation of Tweets for Rumor Detection in Social Media
    Kotteti, Chandra Mouli Madhav
    Dong, Xishuang
    Qian, Lijun
    APPLIED SCIENCES-BASEL, 2020, 10 (21): : 1 - 21
  • [38] Running a Sustainable Social Media Business: The Use of Deep Learning Methods in Online-Comment Short Texts
    Lin, Weibin
    Zhang, Qian
    Wu, Yenchun Jim
    Chen, Tsung-Chun
    SUSTAINABILITY, 2023, 15 (11)
  • [39] Detection of Suicide Ideation in Social Media Forums Using Deep Learning
    Tadesse, Michael Mesfin
    Lin, Hongfei
    Xu, Bo
    Yang, Liang
    ALGORITHMS, 2020, 13 (01)
  • [40] Towards Explainability in Using Deep Learning for the Detection of Anorexia in Social Media
    Amini, Hessam
    Kosseim, Leila
    NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2020), 2020, 12089 : 225 - 235