Identifying Cyberbullying Roles in Social Media

被引:0
作者
Sandoval, Manuel [1 ]
Abuhamad, Mohammed [1 ]
Furman, Patrick [1 ]
Nazari, Mujtaba [1 ]
Hall, Deborah L. [2 ]
Silva, Yasin N. [1 ]
机构
[1] Loyola Chicago Univ, Chicago, IL 60660 USA
[2] Arizona State Univ, Glendale, AZ 85306 USA
来源
SOCIAL NETWORKS ANALYSIS AND MINING, ASONAM 2024, PT III | 2025年 / 15213卷
关键词
cyberbullying; role detection; social media; LLM; PARTICIPANT ROLES;
D O I
10.1007/978-3-031-78548-1_26
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Social media has revolutionized communication, allowing people worldwide to connect and interact instantly. However, it has also led to increases in cyberbullying, which poses a significant threat to children and adolescents globally, affecting their mental health and well-being. It is critical to accurately detect the roles of individuals involved in cyberbullying incidents to effectively address the issue on a large scale. This study explores the use of machine learning models to detect the roles involved in cyberbullying interactions. After examining the AMiCA dataset and addressing class imbalance issues, we evaluate the performance of various models built with four underlying LLMs (i.e. BERT, RoBERTa, T5, and GPT-2) for role detection. Our analysis shows that oversampling techniques help improve model performance. The best model, a fine-tuned RoBERTa using oversampled data, achieved an overall F1 score of 83.5%, increasing to 89.3% after applying a prediction threshold. The top-2 F1 score without thresholding was 95.7%. Our method outperforms previously proposed models. After investigating the per-class model performance and confidence scores, we show that the models perform well in classes with more samples and less contextual confusion (e.g. Bystander Other), but struggle with classes with fewer samples (e.g. Bystander Assistant) and more contextual ambiguity (e.g. Harasser and Victim). This work highlights current strengths and limitations in the development of accurate models with limited data and complex scenarios.
引用
收藏
页码:355 / 370
页数:16
相关论文
共 26 条
[1]   'Can I afford to help?' How affordances of communication modalities guide bystanders' helping intentions towards harassment on social network sites [J].
Bastiaensens, Sara ;
Vandebosch, Heidi ;
Poels, Karolien ;
Van Cleemput, Katrien ;
DeSmet, Ann ;
De Bourdeaudhuij, Ilse .
BEHAVIOUR & INFORMATION TECHNOLOGY, 2015, 34 (04) :425-435
[2]  
Cheng L, 2019, PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P5829
[3]  
Cheng L, 2019, Data Min, P235
[4]  
Dadvar M., 2012, P 12 DUTCH BELG INF, P23
[5]  
Dadvar M, 2018, Arxiv, DOI arXiv:1812.08046
[6]   Me and Others Around: The Roles of Personal and Social Norms in Chinese Adolescent Bystanders' Responses Toward Cyberbullying [J].
Dang, Jianning ;
Liu, Li .
JOURNAL OF INTERPERSONAL VIOLENCE, 2022, 37 (9-10) :NP6329-NP6354
[7]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[8]  
Hamlett M., 2022, P INT AAAI C WEB SOC, V16, P1251, DOI 10.1609/icwsm.v16i1.19376
[9]   ADASYN: Adaptive Synthetic Sampling Approach for Imbalanced Learning [J].
He, Haibo ;
Bai, Yang ;
Garcia, Edwardo A. ;
Li, Shutao .
2008 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-8, 2008, :1322-1328
[10]   Automatic classification of participant roles in cyberbullying: Can we detect victims, bullies, and bystanders in social media text? [J].
Jacobs, Gilles ;
Van Hee, Cynthia ;
Hoste, Veronique .
NATURAL LANGUAGE ENGINEERING, 2022, 28 (02) :141-166