Counterfactual reasoning: Testing language models' understanding of hypothetical scenarios

被引：0

作者：

Li, Jiaxuan ^{[1
]}

Yu, Lang ^{[2
]}

Ettinger, Allyson ^{[3
]}

机构：

[1] Univ Calif Irvine, Irvine, CA 92617 USA

[2] Meta, Seattle, WA 98109 USA

[3] Univ Chicago, Chicago, IL 60637 USA

来源：

61ST CONFERENCE OF THE THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 2 | 2023年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Current pre-trained language models have enabled remarkable improvements in downstream tasks, but it remains difficult to distinguish effects of statistical correlation from more systematic logical reasoning grounded on the understanding of real world. We tease these factors apart by leveraging counterfactual conditionals, which force language models to predict unusual consequences based on hypothetical propositions. We introduce a set of tests from psycholinguistic experiments, as well as larger-scale controlled datasets, to probe counterfactual predictions from five pre-trained language models. We find that models are consistently able to override real-world knowledge in counterfactual scenarios, and that this effect is more robust in case of stronger baseline world knowledge-however, we also find that for most models this effect appears largely to be driven by simple lexical cues. When we mitigate effects of both world knowledge and lexical cues to test knowledge of linguistic nuances of counterfactuals, we find that only GPT-3 shows sensitivity to these nuances, though this sensitivity is also non-trivially impacted by lexical associative factors.

引用

页码：804 / 815

页数：12

共 19 条

[1] Brown TB, 2020, ADV NEUR IN, V33
[2] Devlin J, 2019, 2019 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL HLT 2019), VOL. 1, P4171
[3] Amnesic Probing: Behavioral Explanation with Amnesic Counterfactuals
Elazar, Yanai
Ravfogel, Shauli
Jacovi, Alon
Goldberg, Yoav
[J]. TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 160 - 175
[4] Anomalies in real and counterfactual worlds: An eye-movement investigation
Ferguson, Heather J.
Sanford, Anthony J.
[J]. JOURNAL OF MEMORY AND LANGUAGE, 2008, 58 (03) : 609 - 626
[5] Eye movements reveal rapid concurrent access to factual and counterfactual interpretations of the world
Ferguson, Heather J.
[J]. QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 2012, 65 (05) : 939 - 961
[6] Predicting Pragmatic Reasoning in Language Games
Frank, Michael C.
Goodman, Noah D.
[J]. SCIENCE, 2012, 336 (6084) : 998 - 998
[7] Frohberg J, 2022, Arxiv, DOI arXiv:2112.11941
[8] Liu YH, 2019, Arxiv, DOI [arXiv:1907.11692, 10.48550/arXiv.1907.11692,abs/1907.11692,arXiv-1907, DOI 10.48550/ARXIV.1907.11692,ABS/1907.11692,ARXIV-1907, DOI 10.48550/ARXIV.1907.11692]
[9] McKenzie Ian, 2022, The inverse scaling prize
[10] Meng K, 2022, Arxiv, DOI arXiv:2202.05262

← 1 2 →