Experiments in Modeling Disagreement

被引:0
作者
Tahaei, Narjes [1 ]
Bergler, Sabine [1 ]
机构
[1] Concordia Univ, CLaC Lab, Montreal, PQ, Canada
来源
ARTIFICIAL NEURAL NETWORKS IN PATTERN RECOGNITION, ANNPR 2024 | 2024年 / 15154卷
关键词
demographics; sexism; disagreement; CLASSIFICATION;
D O I
10.1007/978-3-031-71602-7_21
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This research examines how annotators' features influence the labeling of sexist content in social media datasets, with a specific focus on the EXIST dataset, which includes direct sexist messages, reports, and descriptions of sexist experiences and stereotypes. By comparing the use of gold labels derived by majority vote with individual annotator labels, we found that incorporating annotator labels into the input tokens enhances model performance in predicting gold labels. Our study further investigates the impact of additionally integrating annotators' demographic information into BERT models to enhance performance in subjective natural language processing tasks. We find that integrating such demographic data into the input leads to improved model performance.
引用
收藏
页码:245 / 255
页数:11
相关论文
共 24 条
[1]  
Alm C.O., 2011, P 49 ANN M ASS COMP, V2, P107
[2]  
Almanea D, 2022, LREC 2022: THIRTEEN INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, P2282
[3]   Automatic Identification and Classification of Misogynistic Language on Twitter [J].
Anzovino, Maria ;
Fersini, Elisabetta ;
Rosso, Paolo .
NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2018), 2018, 10859 :57-64
[4]  
Chiril P, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P4055
[5]   Dealing with Disagreements: Looking Beyond the Majority Vote in Subjective Annotations [J].
Davani, Aida Mostafazadeh ;
Diaz, Mark ;
Prabhakaran, Vinodkumar .
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 :92-110
[6]  
Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[7]   Jury Learning: Integrating Dissenting Voices into Machine Learning Models [J].
Gordon, Mitchell L. ;
Lam, Michelle S. ;
Park, Joon Sung ;
Patel, Kayur ;
Hancock, Jeff T. ;
Hashimoto, Tatsunori ;
Bernstein, Michael S. .
PROCEEDINGS OF THE 2022 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI' 22), 2022,
[8]  
Guest E, 2021, 16TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EACL 2021), P1336
[9]   Understanding international perceptions of the severity of harmful content online [J].
Jiang, Jialun Aaron ;
Scheuerman, Morgan Klaus ;
Fiesler, Casey ;
Brubaker, Jed R. .
PLOS ONE, 2021, 16 (08)
[10]   Parting Crowds: Characterizing Divergent Interpretations in Crowdsourced Annotation Tasks [J].
Kairam, Sanjay ;
Heer, Jeffrey .
ACM CONFERENCE ON COMPUTER-SUPPORTED COOPERATIVE WORK AND SOCIAL COMPUTING (CSCW 2016), 2016, :1637-1648