Cyberbullying Detection using BERT for Telugu Language

被引:0
|
作者
Talasila, Sri Lakshmi [1 ]
Kothuri, Dharani Priya [1 ]
Manchiraju, Savithri Jahnavi [1 ]
Mallavalli, Mutyala Sai Sasank [1 ]
Dande, Lourdu Gnana Harshith [1 ]
机构
[1] Prasad V Potluri Siddhartha Inst Technol, Comp Sci & Engn, Vijayawada, India
来源
2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024 | 2024年
关键词
Cyberbullying; Telugu; Bidirectional Encoder Representations from Transformers (BERT); Bullying Preprocessing; Harassment; Language; Social Media;
D O I
10.1109/ICPCSN62568.2024.00077
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The rapid proliferation of online communication has introduced cyberbullying as a significant concern affecting individuals' well-being. Existing research employs various techniques like Tf-Idf, XLM-RoBERTa, and machine learning algorithms such as Logistic Regression, Random Forest, and Naive Bayes to detect cyberbullying across mixed and bilingual languages. However, these approaches often struggle with accuracy and fail to effectively discern cyberbullying instances due to language nuances and context misinterpretation. Key challenges faced by previous systems include limited linguistic coverage, contextual understanding, and nuanced interpretation of cyberbullying. The new advancement to address these challenges is the implementation of BERT (Bidirectional Encoder Representations from Transformers) architecture by leveraging bidirectional context understanding, allowing it to capture subtle linguistic nuances and contextual cues, thereby improving accuracy and contextual understanding. The proposed model is advancing further by integrating specialized models like IndicBERT, specifically tailored for languages like Telugu. By focusing on contextual nuances, our model aims to improve precision and accuracy of cyberbullying detection for a local language, Telugu content. This study has developed a local language, Telugu dataset comprising 27,000 sentences and achieve an accuracy rate of 90%, highlighting the efficacy of our approach in overcoming these challenges and contributing to online safety.
引用
收藏
页码:454 / 461
页数:8
相关论文
共 50 条
  • [31] Cyberbullying Detection
    Haidar, Batoul
    Chamoun, Maroun
    Yamout, Fadi
    UKSIM-AMSS 10TH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS), 2016, : 161 - 171
  • [32] A HIERARCHICAL APPROACH FOR TIMELY CYBERBULLYING DETECTION
    Nazar, Imara
    Zois, Daphney-Stavroula
    Yao, Mengfan
    2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 190 - 195
  • [33] Cyberbullying detection: an ensemble learning approach
    Roy, Pradeep Kumar
    Singh, Ashish
    Tripathy, Asis Kumar
    Das, Tapan Kumar
    INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (03) : 315 - 324
  • [34] Cyberbullying Detection and Classification Using Information Retrieval Algorithm
    Nandhini, B. Sri
    Sheeba, J., I
    ICARCSET'15: PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH IN COMPUTER SCIENCE ENGINEERING & TECHNOLOGY (ICARCSET - 2015), 2015,
  • [35] Classification of Cyberbullying Sinhala Language Comments on Social Media
    Amali, H. M. A. Ishara
    Jayalal, Shantha
    MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 266 - 271
  • [36] Using Fuzzy Fingerprints for Cyberbullying Detection in Social Networks
    Rosa, Hugo
    Carvalho, Joao P.
    Calado, Pavel
    Martins, Bruno
    Ribeiro, Ricardo
    Coheur, Luisa
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [37] Automatic Detection of Cyberbullying and Abusive Language in Arabic Content on Social Networks: A Survey
    Khairy, Marwa
    Mahmoud, Tarek M.
    Abd-El-Hafeez, Tarek
    AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 156 - 166
  • [38] Comparative performance of ensemble machine learning for Arabic cyberbullying and offensive language detection
    Khairy, Marwa
    Mahmoud, Tarek M. M.
    Omar, Ahmed
    Abd El-Hafeez, Tarek
    LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (02) : 695 - 712
  • [39] Cyberbullying detection from tweets using deep learning
    Bharti, Shubham
    Yadav, Arun Kumar
    Kumar, Mohit
    Yadav, Divakar
    KYBERNETES, 2022, 51 (09) : 2695 - 2711
  • [40] Cyberbullying Detection by Using Artificial Neural Network Models
    Bozyigit, Alican
    Utku, Semih
    Nasiboglu, Efendi
    2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 520 - 524