Zero-shot stance detection via multi-perspective contrastive with unlabeled data

被引:7
作者
Jiang, Yan [1 ,2 ]
Gao, Jinhua [1 ]
Shen, Huawei [1 ,2 ]
Cheng, Xueqi [2 ,3 ]
机构
[1] Chinese Acad Sci, Inst Comp Technol, Data Intelligence Syst Res Ctr, Beijing, Peoples R China
[2] Univ Chinese Acad Sci, Beijing, Peoples R China
[3] Chinese Acad Sci, Inst Comp Technol, Key Lab Network Data Sci & Technol, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Stance detection; Contrastive learning; Unlabeled data; Zero-shot;
D O I
10.1016/j.ipm.2023.103361
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stance detection is to distinguish whether the text's author supports, opposes, or maintains a neutral stance towards a given target. In most real-world scenarios, stance detection needs to work in a zero-shot manner, i.e., predicting stances for unseen targets without labeled data. One critical challenge of zero-shot stance detection is the absence of contextual information on the targets. Current works mostly concentrate on introducing external knowledge to supplement information about targets, but the noisy schema-linking process hinders their performance in practice. To combat this issue, we argue that previous studies have ignored the extensive target -related information inhabited in the unlabeled data during the training phase, and propose a simple yet efficient Multi-Perspective Contrastive Learning Framework for zero-shot stance detection. Our framework is capable of leveraging information not only from labeled data but also from extensive unlabeled data. To this end, we design target-oriented contrastive learning and label-oriented contrastive learning to capture more comprehensive target representation and more distinguishable stance features. We conduct extensive experiments on three widely adopted datasets (from 4870 to 33,090 instances), namely SemEval-2016, WT-WT, and VAST. Our framework achieves 53.6%, 77.1%, and 72.4% macro-average F1 scores on these three datasets, showing 2.71% and 0.25% improvements over state-of-the-art baselines on the SemEval-2016 and WT-WT datasets and comparable results on the more challenging VAST dataset.
引用
收藏
页数:15
相关论文
共 43 条
  • [1] Stance detection on social media: State of the art and trends
    ALDayel, Abeer
    Magdy, Walid
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [2] Allaway E, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P4756
  • [3] Allaway E, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P8913
  • [4] Augenstein I., 2016, P 2016 C EMP METH NA, P876, DOI 10.18653/v1/d16-1084
  • [5] Bordes Antoine, 2013, PROCADV NEURAL INF P, V26
  • [6] Chen QB, 2022, Arxiv, DOI arXiv:2201.08702
  • [7] Clark T., 2021, INTEGRATING TRANSFOR, P304, DOI [DOI 10.18653/V1/2021.WNUT-1.34, 10.18653/v1/2021.wnut-1.34]
  • [8] Conforti C, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1715
  • [9] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [10] Commonsense Knowledge Enhanced Memory Network for Stance Classification
    Du, Jiachen
    Gui, Lin
    Xu, Ruifeng
    Xia, Yunqing
    Wang, Xuan
    [J]. IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 102 - 109