Addressing cyberbullying in Urdu tweets: a comprehensive dataset and detection system

被引:0
|
作者
Adeeba F. [1 ]
Yousuf M.I. [1 ]
Anwer I. [2 ]
Tariq S.U. [1 ]
Ashfaq A. [1 ]
Naqeeb M. [1 ]
机构
[1] Department of Computer Science, University of Engineering and Technology Lahore, Punjab, Lahore
[2] Department of Transportation Engineering and Management, University of Engineering and Technology Lahore, Punjab, Lahore
关键词
Artificial Intelligence; Cyberbullying annotation guidelines; Natural Language and Speech; Network Science and Online Social Networks; Sentiment Analysis; Text Mining; Urdu cyberbullying detection; Urdu sentiment analysis; Urdu tweets dataset;
D O I
10.7717/PEERJ-CS.1963
中图分类号
学科分类号
摘要
The prevalence of cyberbullying has reached an alarming rate, affecting approximately 54% of teenagers who experience various forms of cyberbullying, including offensive hate speech, threats, and racism. This research introduces a comprehensive dataset and system for cyberbullying detection in Urdu tweets, leveraging a spectrum of machine learning approaches including traditional models and advanced deep learning techniques. The objectives of this study are threefold. Firstly, a dataset consisting of 12,500 annotated tweets in Urdu is created, and it is made publicly available to the research community. Secondly, annotation guidelines for Urdu text with appropriate labels for cyberbullying detection are developed. Finally, a series of experiments is conducted to assess the performance of machine learning and deep learning techniques in detecting cyberbullying. The results indicate that fastText deep learning models outperform other models in cyberbullying detection. This study demonstrates its efficacy in effectively detecting and classifying cyberbullying incidents in Urdu tweets, contributing to the broader effort of creating a safer digital environment. © 2024 Adeeba et al. Distributed under Creative Commons CC-BY 4.0. All Rights Reserved.
引用
收藏
相关论文
共 14 条
  • [1] Addressing cyberbullying in Urdu tweets: a comprehensive dataset and detection system
    Adeeba, Farah
    Yousuf, Muhammad Irfan
    Anwer, Izza
    Tariq, Sardar Umair
    Ashfaq, Abdullah
    Naqeeb, Malik
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [2] Assessing Urdu Language Processing Tools via Statistical and Outlier Detection Methods on Urdu Tweets
    Zoya
    Latif, Seemab
    Latif, Rabia
    Majeed, Hammad
    Jamail, Nor Shahida Mohd
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2023, 22 (10)
  • [3] Improving Hate Speech Detection of Urdu Tweets Using Sentiment Analysis
    Ali, Muhammad Z.
    Ehsan-Ul-Haq
    Rauf, Sahar
    Javed, Kashif
    Hussain, Sarmad
    IEEE ACCESS, 2021, 9 : 84296 - 84305
  • [4] Automatic detection of cyberbullying and threatening in Saudi tweets using machine learning
    Alghamdi, Deema
    Al-Motery, Rahaf
    Alma'abdi, Reem
    Alzamzami, Ohoud
    Babour, Amal
    INTERNATIONAL JOURNAL OF ADVANCED AND APPLIED SCIENCES, 2021, 8 (10): : 17 - 25
  • [5] Detection of Sarcasm in Urdu Tweets Using Deep Learning and Transformer Based Hybrid Approaches
    Hassan, Muhammad Ehtisham
    Hussain, Masroor
    Maab, Iffat
    Habib, Usman
    Khan, Muhammad Attique
    Masood, Anum
    IEEE ACCESS, 2024, 12 : 61542 - 61555
  • [6] Cyberbullying Detection by Sentiment Analysis of Tweets' Contents Written in Arabic in Saudi Arabia Society
    Almutairi, Amjad Rasmi
    Al-Hagery, Muhammad Abdullah
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2021, 21 (03): : 112 - 119
  • [7] BulliShield: A Smart Cyberbullying Detection and Reporting System
    Tahmid, Farhan Ishrak
    Akbar, Farhana
    Rahman, Ahsanur
    PROCEEDINGS 2024 SEVENTH INTERNATIONAL WOMEN IN DATA SCIENCE CONFERENCE AT PRINCE SULTAN UNIVERSITY, WIDS-PSU 2024, 2024, : 198 - 203
  • [8] Improving Text Emotion Detection Through Comprehensive Dataset Quality Analysis
    Langure, Alejandro de Leon
    Zareei, Mahdi
    IEEE ACCESS, 2024, 12 : 166512 - 166536
  • [9] A comprehensive review of AI based intrusion detection system
    Sowmya T.
    Mary Anita E.A.
    Measurement: Sensors, 2023, 28
  • [10] A comprehensive review on fault detection and analysis in the pumping system
    Dutta N.
    Kaliannan P.
    Paramasivam S.
    International Journal of Ambient Energy, 2022, 43 (01) : 6878 - 6898