Empowering Language Understanding with Counterfactual Reasoning

被引：0

作者：

Feng, Fuli ^{[1
,2
]}

Zhang, Jizhi ^{[3
]}

He, Xiangnan ^{[3
]}

Zhang, Hanwang ^{[4
]}

Chua, Tat-Seng ^{[2
]}

机构：

[1] NExT Sea Joint Lab, Singapore, Singapore

[2] Natl Univ Singapore, Singapore, Singapore

[3] Univ Sci & Technol China, Hefei, Anhui, Peoples R China

[4] Nanyang Technol Univ, Singapore, Singapore

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021 | 2021年

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Present language understanding methods have demonstrated extraordinary ability of recognizing patterns in texts via machine learning. However, existing methods indiscriminately use the recognized patterns in the testing phase that is inherently different from us humans who have counterfactual thinking, e.g., to scrutinize for the hard testing samples. Inspired by this, we propose a Counterfactual Reasoning Model, which mimics the counterfactual thinking by learning from few counterfactual samples. In particular, we devise a generation module to generate representative counterfactual samples for each factual sample, and a retrospective module to retrospect the model prediction by comparing the counterfactual and factual samples. Extensive experiments on sentiment analysis (SA) and natural language inference (NLI) validate the effectiveness of our method.

引用

页码：2226 / 2236

页数：11

共 43 条

[1] Confidence Predictions Affect Performance Confidence and Neural Preparation in Perceptual Decision Making
Boldt, Annika
Schiffer, Anne-Marike
Waszak, Florian
Yeung, Nick
[J]. SCIENTIFIC REPORTS, 2019, 9 (1)
[2] Bowman S. R., 2015, P 2015 C EMP METH NA, P632, DOI 10.18653/v1/D15-1075
[3] Chen L., 2020, P IEEE CVF C COMP VI, P10800, DOI 10.1109/CVPR42600.2020.01081
[4] Chomsky N., 2002, Syntactic structures
[5] Corbiere Charles, 2019, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), P2898
[6] Daniel K., 2017, Thinking, fast and slow
[7] Dattorro J., 2010, Convex optimization and Euclidean distance geometry
[8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[9] Techniques and Applications for Sentiment Analysis
Feldman, Ronen
[J]. COMMUNICATIONS OF THE ACM, 2013, 56 (04) : 82 - 89
[10] Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method
Feng, Fuli
Huang, Weiran
He, Xiangnan
Xin, Xin
Wang, Qifan
Chua, Tat-Seng
[J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1208 - 1218

← 1 2 3 4 5 →