Is my stance the same as your stance? A cross validation study of stance detection datasets

被引:8
作者
Ng, Lynnette Hui Xian [1 ]
Carley, Kathleen M. [1 ]
机构
[1] Carnegie Mellon Univ, CASOS, Inst Software Res, 5000 Forbes Ave, Pittsburgh, PA 15213 USA
基金
美国安德鲁·梅隆基金会;
关键词
Stance detection; Natural language processing; Cross validation; Machine learning; Twitter;
D O I
10.1016/j.ipm.2022.103070
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stance detection identifies a person's evaluation of a subject, and is a crucial component for many downstream applications. In application, stance detection requires training a machine learning model on an annotated dataset and applying the model on another to predict stances of text snippets. This cross-dataset model generalization poses three central questions, which we investigate using stance classification models on 7 publicly available English Twitter datasets ranging from 297 to 48,284 instances. (1) Are stance classification models generalizable across datasets? We construct a single dataset model to train/test dataset-against-dataset, finding models do not generalize well (avg F1=0.33). (2) Can we improve the generalizability by aggregating datasets? We find a multi dataset model built on the aggregation of datasets has an improved performance (avg F1=0.69). (3) Given a model built on multiple datasets, how much additional data is required to fine-tune it? We find it challenging to ascertain a minimum number of data points due to the lack of pattern in performance. Investigating possible reasons for the choppy model performance we find that texts are not easily differentiable by stances, nor are annotations consistent within and across datasets. Our observations emphasize the need for an aggregated dataset as well as consistent labels for the generalizability of models.
引用
收藏
页数:15
相关论文
共 64 条
  • [1] Aker Ahmet, 2017, P INT C RECENT ADV N, P31, DOI 10.26615/978-954-452- 049- 6_ 005
  • [2] Your stance is exposed! Analysing possible factors for stance detection on social media
    Aldayel A.
    Magdy W.
    [J]. Proceedings of the ACM on Human-Computer Interaction, 2019, 3 (CSCW)
  • [3] Stance detection on social media: State of the art and trends
    ALDayel, Abeer
    Magdy, Walid
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [4] Augenstein I., 2016, P 10 INT WORKSH SEM, P389, DOI DOI 10.18653/V1/S16-1063
  • [5] Conforti C, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1715
  • [6] Darwish K., 2017, Lecture Notes in Computer Science, P143, DOI https://doi.org/10.1007/978-3-319-67217-5_10
  • [7] Darwish K., 2017, P 2017 IEEEACM INT C, P145, DOI 10.1145/3110025.3110112
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] DuBois JohnW., 2007, Pragmatics Beyond NewSeries, DOI [DOI 10.1075/PBNS.164.07DU, 10.1075/pbns.164.07du]
  • [10] Lillie AE, 2019, Arxiv, DOI arXiv:1907.00181