Empowering Language Understanding with Counterfactual Reasoning

被引:0
作者
Feng, Fuli [1 ,2 ]
Zhang, Jizhi [3 ]
He, Xiangnan [3 ]
Zhang, Hanwang [4 ]
Chua, Tat-Seng [2 ]
机构
[1] NExT Sea Joint Lab, Singapore, Singapore
[2] Natl Univ Singapore, Singapore, Singapore
[3] Univ Sci & Technol China, Hefei, Anhui, Peoples R China
[4] Nanyang Technol Univ, Singapore, Singapore
来源
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021 | 2021年
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Present language understanding methods have demonstrated extraordinary ability of recognizing patterns in texts via machine learning. However, existing methods indiscriminately use the recognized patterns in the testing phase that is inherently different from us humans who have counterfactual thinking, e.g., to scrutinize for the hard testing samples. Inspired by this, we propose a Counterfactual Reasoning Model, which mimics the counterfactual thinking by learning from few counterfactual samples. In particular, we devise a generation module to generate representative counterfactual samples for each factual sample, and a retrospective module to retrospect the model prediction by comparing the counterfactual and factual samples. Extensive experiments on sentiment analysis (SA) and natural language inference (NLI) validate the effectiveness of our method.
引用
收藏
页码:2226 / 2236
页数:11
相关论文
共 43 条
  • [1] Confidence Predictions Affect Performance Confidence and Neural Preparation in Perceptual Decision Making
    Boldt, Annika
    Schiffer, Anne-Marike
    Waszak, Florian
    Yeung, Nick
    [J]. SCIENTIFIC REPORTS, 2019, 9 (1)
  • [2] Bowman S. R., 2015, P 2015 C EMP METH NA, P632, DOI 10.18653/v1/D15-1075
  • [3] Chen L., 2020, P IEEE CVF C COMP VI, P10800, DOI 10.1109/CVPR42600.2020.01081
  • [4] Chomsky N., 2002, Syntactic structures
  • [5] Corbiere Charles, 2019, 33rd Conference on Neural Information Processing Systems (NeurIPS 2019), P2898
  • [6] Daniel K., 2017, Thinking, fast and slow
  • [7] Dattorro J., 2010, Convex optimization and Euclidean distance geometry
  • [8] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
  • [9] Techniques and Applications for Sentiment Analysis
    Feldman, Ronen
    [J]. COMMUNICATIONS OF THE ACM, 2013, 56 (04) : 82 - 89
  • [10] Should Graph Convolution Trust Neighbors? A Simple Causal Inference Method
    Feng, Fuli
    Huang, Weiran
    He, Xiangnan
    Xin, Xin
    Wang, Qifan
    Chua, Tat-Seng
    [J]. SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 1208 - 1218