Zero-shot stance detection via multi-perspective contrastive with unlabeled data

被引：7

作者：

Jiang, Yan ^{[1
,2
]}

Gao, Jinhua ^{[1
]}

Shen, Huawei ^{[1
,2
]}

Cheng, Xueqi ^{[2
,3
]}

机构：

[1] Chinese Acad Sci, Inst Comp Technol, Data Intelligence Syst Res Ctr, Beijing, Peoples R China

[2] Univ Chinese Acad Sci, Beijing, Peoples R China

[3] Chinese Acad Sci, Inst Comp Technol, Key Lab Network Data Sci & Technol, Beijing, Peoples R China

来源：

INFORMATION PROCESSING & MANAGEMENT | 2023年 / 60卷 / 04期

基金：

中国国家自然科学基金;

关键词：

Stance detection; Contrastive learning; Unlabeled data; Zero-shot;

D O I：

10.1016/j.ipm.2023.103361

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Stance detection is to distinguish whether the text's author supports, opposes, or maintains a neutral stance towards a given target. In most real-world scenarios, stance detection needs to work in a zero-shot manner, i.e., predicting stances for unseen targets without labeled data. One critical challenge of zero-shot stance detection is the absence of contextual information on the targets. Current works mostly concentrate on introducing external knowledge to supplement information about targets, but the noisy schema-linking process hinders their performance in practice. To combat this issue, we argue that previous studies have ignored the extensive target -related information inhabited in the unlabeled data during the training phase, and propose a simple yet efficient Multi-Perspective Contrastive Learning Framework for zero-shot stance detection. Our framework is capable of leveraging information not only from labeled data but also from extensive unlabeled data. To this end, we design target-oriented contrastive learning and label-oriented contrastive learning to capture more comprehensive target representation and more distinguishable stance features. We conduct extensive experiments on three widely adopted datasets (from 4870 to 33,090 instances), namely SemEval-2016, WT-WT, and VAST. Our framework achieves 53.6%, 77.1%, and 72.4% macro-average F1 scores on these three datasets, showing 2.71% and 0.25% improvements over state-of-the-art baselines on the SemEval-2016 and WT-WT datasets and comparable results on the more challenging VAST dataset.

引用

页数：15

共 43 条

[1] Stance detection on social media: State of the art and trends
ALDayel, Abeer
Magdy, Walid
[J]. INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
[2] Allaway E, 2021, 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), P4756
[3] Allaway E, 2020, PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), P8913
[4] Augenstein I., 2016, P 2016 C EMP METH NA, P876, DOI 10.18653/v1/d16-1084
[5] Bordes Antoine, 2013, PROCADV NEURAL INF P, V26
[6] Chen QB, 2022, Arxiv, DOI arXiv:2201.08702
[7] Clark T., 2021, INTEGRATING TRANSFOR, P304, DOI [DOI 10.18653/V1/2021.WNUT-1.34, 10.18653/v1/2021.wnut-1.34]
[8] Conforti C, 2020, 58TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2020), P1715
[9] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[10] Commonsense Knowledge Enhanced Memory Network for Stance Classification
Du, Jiachen
Gui, Lin
Xu, Ruifeng
Xia, Yunqing
Wang, Xuan
[J]. IEEE INTELLIGENT SYSTEMS, 2020, 35 (04) : 102 - 109

← 1 2 3 4 5 →