Addressing cyberbullying in Urdu tweets: a comprehensive dataset and detection system

被引:0
|
作者
Adeeba F. [1 ]
Yousuf M.I. [1 ]
Anwer I. [2 ]
Tariq S.U. [1 ]
Ashfaq A. [1 ]
Naqeeb M. [1 ]
机构
[1] Department of Computer Science, University of Engineering and Technology Lahore, Punjab, Lahore
[2] Department of Transportation Engineering and Management, University of Engineering and Technology Lahore, Punjab, Lahore
关键词
Artificial Intelligence; Cyberbullying annotation guidelines; Natural Language and Speech; Network Science and Online Social Networks; Sentiment Analysis; Text Mining; Urdu cyberbullying detection; Urdu sentiment analysis; Urdu tweets dataset;
D O I
10.7717/PEERJ-CS.1963
中图分类号
学科分类号
摘要
The prevalence of cyberbullying has reached an alarming rate, affecting approximately 54% of teenagers who experience various forms of cyberbullying, including offensive hate speech, threats, and racism. This research introduces a comprehensive dataset and system for cyberbullying detection in Urdu tweets, leveraging a spectrum of machine learning approaches including traditional models and advanced deep learning techniques. The objectives of this study are threefold. Firstly, a dataset consisting of 12,500 annotated tweets in Urdu is created, and it is made publicly available to the research community. Secondly, annotation guidelines for Urdu text with appropriate labels for cyberbullying detection are developed. Finally, a series of experiments is conducted to assess the performance of machine learning and deep learning techniques in detecting cyberbullying. The results indicate that fastText deep learning models outperform other models in cyberbullying detection. This study demonstrates its efficacy in effectively detecting and classifying cyberbullying incidents in Urdu tweets, contributing to the broader effort of creating a safer digital environment. © 2024 Adeeba et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.
引用
收藏
相关论文
共 14 条
  • [11] Secure Bluetooth Communication in Smart Healthcare Systems: A Novel Community Dataset and Intrusion Detection System
    Zubair, Mohammed
    Ghubaish, Ali
    Unal, Devrim
    Al-Ali, Abdulla
    Reimann, Thomas
    Alinier, Guillaume
    Hammoudeh, Mohammad
    Qadir, Junaid
    SENSORS, 2022, 22 (21)
  • [12] Frailty Insights Detection System (FIDS)-A Comprehensive and Intuitive Dashboard Using Artificial Intelligence and Web Technologies
    Ciubotaru, Bogdan-Iulian
    Sasu, Gabriel-Vasilica
    Goga, Nicolae
    Vasilateanu, Andrei
    Marin, Iuliana
    Pavaloiu, Ionel-Bujorel
    Gligore, Claudiu Teodor Ion
    APPLIED SCIENCES-BASEL, 2024, 14 (16):
  • [13] Deep learning for crescent detection and recognition: Implementation of Mask R-CNN to the observational Lunar dataset collected with the Robotic Lunar Telescope System
    Muztaba, R.
    Malasan, H. L.
    Djamal, M.
    ASTRONOMY AND COMPUTING, 2023, 45
  • [14] Quantum-Inspired Interpretable AI-Empowered Decision Support System for Detection of Early-Stage Rheumatoid Arthritis in Primary Care Using Scarce Dataset
    Rahimi, Samira Abbasgholizadeh
    Kolahdoozi, Mojtaba
    Mitra, Arka
    Salmeron, Jose L.
    Navali, Amir Mohammad
    Sadeghpour, Alireza
    Mir Mohammadi, Seyed Amir
    MATHEMATICS, 2022, 10 (03)