Causal Analysis of Syntactic Agreement Mechanisms in Neural Language Models

被引:0
作者
Finlayson, Matthew [1 ]
Mueller, Aaron [2 ]
Gehrmann, Sebastian [3 ]
Shieber, Stuart [1 ]
Linzen, Tal [4 ]
Belinkov, Yonatan [5 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
[2] Johns Hopkins Univ, Baltimore, MD USA
[3] Google Res, New York, NY USA
[4] NYU, New York, NY USA
[5] Technion IIT, Haifa, Israel
来源
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) | 2021年
基金
以色列科学基金会; 美国国家科学基金会;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Targeted syntactic evaluations have demonstrated the ability of language models to perform subject-verb agreement given difficult contexts. To elucidate the mechanisms by which the models accomplish this behavior, this study applies causal mediation analysis to pre-trained neural language models. We investigate the magnitude of models' preferences for grammatical inflections, as well as whether neurons process subject-verb agreement similarly across sentences with different syntactic structures. We uncover similarities and differences across architectures and model sizes-notably, that larger models do not necessarily learn stronger preferences. We also observe two distinct mechanisms for producing subject-verb agreement depending on the syntactic structure of the input sentence. Finally, we find that language models rely on similar sets of neurons when given sentences with similar syntactic structure.
引用
收藏
页码:1828 / 1843
页数:16
相关论文
共 42 条
  • [1] Belinkov Y, 2021, Computational Linguistics
  • [2] Belinkov Y., 2019, C N AM CHAPTER ASS C
  • [3] Belinkov Yonatan, 2018, THESIS MASSACHUSETTS
  • [4] Chi E. A., 2020, P 58 ANN M ASS COMP, P5564, DOI [DOI 10.18653/V1/2020.ACL-MAIN.493, 10.18653/v1/2020.acl-main.493]
  • [5] Dai ZH, 2019, 57TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2019), P2978
  • [6] Devlin J, 2018, ARXIV
  • [7] Futrell R., 2019, P 2019 C N AM CHAPTE, V1, P32, DOI DOI 10.18653/V1/N19-1004
  • [8] Giulianelli Mario, 2018, P 2018 EMNLP WORKSHO, P240
  • [9] Goldberg Y., 2019, Assessing BERTs Syntactic Abilities
  • [10] Gulordava K., 2018, P 2018 C N AM CHAPTE, V1, P1195, DOI [10.18653/v1/N18-1108, DOI 10.18653/V1/N18-1108]