Cyberbullying Detection using BERT for Telugu Language

被引:0
|
作者
Talasila, Sri Lakshmi [1 ]
Kothuri, Dharani Priya [1 ]
Manchiraju, Savithri Jahnavi [1 ]
Mallavalli, Mutyala Sai Sasank [1 ]
Dande, Lourdu Gnana Harshith [1 ]
机构
[1] Prasad V Potluri Siddhartha Inst Technol, Comp Sci & Engn, Vijayawada, India
来源
2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024 | 2024年
关键词
Cyberbullying; Telugu; Bidirectional Encoder Representations from Transformers (BERT); Bullying Preprocessing; Harassment; Language; Social Media;
D O I
10.1109/ICPCSN62568.2024.00077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid proliferation of online communication has introduced cyberbullying as a significant concern affecting individuals' well-being. Existing research employs various techniques like Tf-Idf, XLM-RoBERTa, and machine learning algorithms such as Logistic Regression, Random Forest, and Naive Bayes to detect cyberbullying across mixed and bilingual languages. However, these approaches often struggle with accuracy and fail to effectively discern cyberbullying instances due to language nuances and context misinterpretation. Key challenges faced by previous systems include limited linguistic coverage, contextual understanding, and nuanced interpretation of cyberbullying. The new advancement to address these challenges is the implementation of BERT (Bidirectional Encoder Representations from Transformers) architecture by leveraging bidirectional context understanding, allowing it to capture subtle linguistic nuances and contextual cues, thereby improving accuracy and contextual understanding. The proposed model is advancing further by integrating specialized models like IndicBERT, specifically tailored for languages like Telugu. By focusing on contextual nuances, our model aims to improve precision and accuracy of cyberbullying detection for a local language, Telugu content. This study has developed a local language, Telugu dataset comprising 27,000 sentences and achieve an accuracy rate of 90%, highlighting the efficacy of our approach in overcoming these challenges and contributing to online safety.
引用
收藏
页码:454 / 461
页数:8
相关论文
共 50 条
  • [1] Telugu named entity recognition using bert
    Gorla, SaiKiranmai
    Tangeda, Sai Sharan
    Neti, Lalita Bhanu Murthy
    Malapati, Aruna
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2022, 14 (02) : 127 - 140
  • [2] Telugu named entity recognition using bert
    SaiKiranmai Gorla
    Sai Sharan Tangeda
    Lalita Bhanu Murthy Neti
    Aruna Malapati
    International Journal of Data Science and Analytics, 2022, 14 : 127 - 140
  • [3] Cyberbullying Detection Using Bidirectional Encoder Representations from Transformers (BERT)
    Sujud, Razan
    Fahs, Walid
    Khatoun, Rida
    Chbib, Fadlallah
    2024 IEEE INTERNATIONAL MEDITERRANEAN CONFERENCE ON COMMUNICATIONS AND NETWORKING, MEDITCOM 2024, 2024, : 257 - 262
  • [4] CyberBERT: BERT for cyberbullying identification BERT for cyberbullying identification
    Paul, Sayanta
    Saha, Sriparna
    MULTIMEDIA SYSTEMS, 2022, 28 (06) : 1897 - 1904
  • [5] Cyberbullying Detection on Social Media Using Stacking Ensemble Learning and Enhanced BERT
    Muneer, Amgad
    Alwadain, Ayed
    Ragab, Mohammed Gamal
    Alqushaibi, Alawi
    INFORMATION, 2023, 14 (08)
  • [6] CyberBERT: BERT for cyberbullying identificationBERT for cyberbullying identification
    Sayanta Paul
    Sriparna Saha
    Multimedia Systems, 2022, 28 : 1897 - 1904
  • [7] Does BERT Pay Attention to Cyberbullying?
    Elsafoury, Fatma
    Katsigiannis, Stamos
    Wilson, Steven R.
    Ramzan, Naeem
    SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1900 - 1904
  • [8] Cyberbullying detection system focusing on the isiXhosa language
    Matomela, Vuyokazi
    Henney, Andre J.
    2022 CONFERENCE ON INFORMATION COMMUNICATIONS TECHNOLOGY AND SOCIETY (ICTAS), 2022, : 93 - 98
  • [9] Automatic detection of cyberbullying behaviour on social media using Stacked Bi-Gru attention with BERT model
    Mali, Mohan K.
    Pawar, Ranjeet R.
    Shinde, Sandeep A.
    Kale, Satish D.
    Mulik, Sameer, V
    Jagtap, Asmita A.
    Tambewagh, Pratibha A.
    Rajput, Punam U.
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 262
  • [10] Enhancing cyberbullying detection: a comparative study of ensemble CNN-SVM and BERT models
    Saini, Hiteshi
    Mehra, Himashri
    Rani, Ritu
    Jaiswal, Garima
    Sharma, Arun
    Dev, Amita
    SOCIAL NETWORK ANALYSIS AND MINING, 2023, 14 (01)