Cyberbullying Detection using BERT for Telugu Language

被引：0

作者：

Talasila, Sri Lakshmi ^{[1
]}

Kothuri, Dharani Priya ^{[1
]}

Manchiraju, Savithri Jahnavi ^{[1
]}

Mallavalli, Mutyala Sai Sasank ^{[1
]}

Dande, Lourdu Gnana Harshith ^{[1
]}

机构：

[1] Prasad V Potluri Siddhartha Inst Technol, Comp Sci & Engn, Vijayawada, India

来源：

2024 4TH INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND SOCIAL NETWORKING, ICPCSN 2024 | 2024年

关键词：

Cyberbullying; Telugu; Bidirectional Encoder Representations from Transformers (BERT); Bullying Preprocessing; Harassment; Language; Social Media;

D O I：

10.1109/ICPCSN62568.2024.00077

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The rapid proliferation of online communication has introduced cyberbullying as a significant concern affecting individuals' well-being. Existing research employs various techniques like Tf-Idf, XLM-RoBERTa, and machine learning algorithms such as Logistic Regression, Random Forest, and Naive Bayes to detect cyberbullying across mixed and bilingual languages. However, these approaches often struggle with accuracy and fail to effectively discern cyberbullying instances due to language nuances and context misinterpretation. Key challenges faced by previous systems include limited linguistic coverage, contextual understanding, and nuanced interpretation of cyberbullying. The new advancement to address these challenges is the implementation of BERT (Bidirectional Encoder Representations from Transformers) architecture by leveraging bidirectional context understanding, allowing it to capture subtle linguistic nuances and contextual cues, thereby improving accuracy and contextual understanding. The proposed model is advancing further by integrating specialized models like IndicBERT, specifically tailored for languages like Telugu. By focusing on contextual nuances, our model aims to improve precision and accuracy of cyberbullying detection for a local language, Telugu content. This study has developed a local language, Telugu dataset comprising 27,000 sentences and achieve an accuracy rate of 90%, highlighting the efficacy of our approach in overcoming these challenges and contributing to online safety.

引用

页码：454 / 461

页数：8

共 50 条

[31] Cyberbullying Detection
Haidar, Batoul
Chamoun, Maroun
Yamout, Fadi
UKSIM-AMSS 10TH EUROPEAN MODELLING SYMPOSIUM ON COMPUTER MODELLING AND SIMULATION (EMS), 2016, : 161 - 171
[32] A HIERARCHICAL APPROACH FOR TIMELY CYBERBULLYING DETECTION
Nazar, Imara
Zois, Daphney-Stavroula
Yao, Mengfan
2019 IEEE DATA SCIENCE WORKSHOP (DSW), 2019, : 190 - 195
[33] Cyberbullying detection: an ensemble learning approach
Roy, Pradeep Kumar
Singh, Ashish
Tripathy, Asis Kumar
Das, Tapan Kumar
INTERNATIONAL JOURNAL OF COMPUTATIONAL SCIENCE AND ENGINEERING, 2022, 25 (03) : 315 - 324
[34] Cyberbullying Detection and Classification Using Information Retrieval Algorithm
Nandhini, B. Sri
Sheeba, J., I
ICARCSET'15: PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON ADVANCED RESEARCH IN COMPUTER SCIENCE ENGINEERING & TECHNOLOGY (ICARCSET - 2015), 2015,
[35] Classification of Cyberbullying Sinhala Language Comments on Social Media
Amali, H. M. A. Ishara
Jayalal, Shantha
MERCON 2020: 6TH INTERNATIONAL MULTIDISCIPLINARY MORATUWA ENGINEERING RESEARCH CONFERENCE (MERCON), 2020, : 266 - 271
[36] Using Fuzzy Fingerprints for Cyberbullying Detection in Social Networks
Rosa, Hugo
Carvalho, Joao P.
Calado, Pavel
Martins, Bruno
Ribeiro, Ricardo
Coheur, Luisa
2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
[37] Automatic Detection of Cyberbullying and Abusive Language in Arabic Content on Social Networks: A Survey
Khairy, Marwa
Mahmoud, Tarek M.
Abd-El-Hafeez, Tarek
AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 156 - 166
[38] Comparative performance of ensemble machine learning for Arabic cyberbullying and offensive language detection
Khairy, Marwa
Mahmoud, Tarek M. M.
Omar, Ahmed
Abd El-Hafeez, Tarek
LANGUAGE RESOURCES AND EVALUATION, 2024, 58 (02) : 695 - 712
[39] Cyberbullying detection from tweets using deep learning
Bharti, Shubham
Yadav, Arun Kumar
Kumar, Mohit
Yadav, Divakar
KYBERNETES, 2022, 51 (09) : 2695 - 2711
[40] Cyberbullying Detection by Using Artificial Neural Network Models
Bozyigit, Alican
Utku, Semih
Nasiboglu, Efendi
2019 4TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2019, : 520 - 524

← 1 2 3 4 5 →