Surprisal From Language Models Can Predict ERPs in Processing Predicate-Argument Structures Only if Enriched by an Agent Preference Principle

被引:9
作者
Huber, Eva [1 ,2 ]
Sauppe, Sebastian [1 ,2 ,3 ]
Isasi-Isasmendi, Arrate [1 ,2 ]
Bornkessel-Schlesewsky, Ina [4 ]
Merlo, Paola [5 ,6 ]
Bickel, Balthasar [1 ,2 ]
机构
[1] Univ Zurich, Dept Comparat Language Sci, Zurich, Switzerland
[2] Univ Zurich, Ctr Interdisciplinary Study Language Evolut, Zurich, Switzerland
[3] Univ Zurich, Dept Psychol, Zurich, Switzerland
[4] Univ South Australia, Australian Res Ctr Interact & Virtual Environm, Cognit Neurosci Lab, Adelaide, Australia
[5] Univ Geneva, Dept Linguist, Geneva, Switzerland
[6] Univ Geneva, Univ Ctr Comp Sci, Geneva, Switzerland
来源
NEUROBIOLOGY OF LANGUAGE | 2024年 / 5卷 / 01期
基金
瑞士国家科学基金会; 澳大利亚研究理事会;
关键词
artificial neural networks; computational modeling; event cognition; ERP; sentence processing; surprisal; large language models (LLMs); R PACKAGE; BRAIN; COMPREHENSION; EVENTS; ROLES; ORDER; INFORMATION; REANALYSIS; SPEAKERS; ANIMACY;
D O I
10.1162/nol_a_00121
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Language models based on artificial neural networks increasingly capture key aspects of how humans process sentences. Most notably, model-based surprisals predict event-related potentials such as N400 amplitudes during parsing. Assuming that these models represent realistic estimates of human linguistic experience, their success in modeling language processing raises the possibility that the human processing system relies on no other principles than the general architecture of language models and on sufficient linguistic input. Here, we test this hypothesis on N400 effects observed during the processing of verb-final sentences in German, Basque, and Hindi. By stacking Bayesian generalised additive models, we show that, in each language, N400 amplitudes and topographies in the region of the verb are best predicted when model-based surprisals are complemented by an Agent Preference principle that transiently interprets initial role-ambiguous noun phrases as agents, leading to reanalysis when this interpretation fails. Our findings demonstrate the need for this principle independently of usage frequencies and structural differences between languages. The principle has an unequal force, however. Compared to surprisal, its effect is weakest in German, stronger in Hindi, and still stronger in Basque. This gradient is correlated with the extent to which grammars allow unmarked NPs to be patients, a structural feature that boosts reanalysis effects. We conclude that language models gain more neurobiological plausibility by incorporating an Agent Preference. Conversely, theories of human processing profit from incorporating surprisal estimates in addition to principles like the Agent Preference, which arguably have distinct evolutionary roots.
引用
收藏
页码:167 / 200
页数:34
相关论文
共 143 条
[11]  
Bickel Balthasar., 2010, OXFORD HDB LINGUISTI, P399, DOI DOI 10.1093/OXFORDHB/9780199281251.013.0020
[12]  
Borer Hagit., 2005, STRUCTURING SENSE VO, DOI [10.1093/acprof:oso/9780199263929.001.0001, DOI 10.1093/ACPROF:OSO/9780199263905.001.0001, DOI 10.1093/ACPROF:OSO/9780199263929.001.0001]
[13]   Eliciting thematic reanalysis effects: The role of syntax-independent information during parsing [J].
Bornkessel, I ;
Schlesewsky, M ;
Friederici, AD .
LANGUAGE AND COGNITIVE PROCESSES, 2003, 18 (03) :269-298
[14]   The extended argument dependency model: A neurocognitive approach to sentence comprehension across languages [J].
Bornkessel, Ina ;
Schlesewsky, Matthias .
PSYCHOLOGICAL REVIEW, 2006, 113 (04) :787-821
[15]  
Bornkessel-Schlesewsky I., 2020, The cognitive neurosciences, V6th ed., P841, DOI [10.7551/mitpress/11442.003.0094, DOI 10.7551/MITPRESS/11442.003.0094]
[16]   Rapid adaptation of predictive models during language comprehension: Aperiodic EEG slope, individual alpha frequency and idea density modulate individual differences in real-time model updating [J].
Bornkessel-Schlesewsky, Ina ;
Sharrad, Isabella ;
Howlett, Caitlin A. ;
Alday, Phillip M. ;
Corcoran, Andrew W. ;
Bellan, Valeria ;
Wilkinson, Erica ;
Kliegl, Reinhold ;
Lewis, Richard L. ;
Small, Steven L. ;
Schlesewsky, Matthias .
FRONTIERS IN PSYCHOLOGY, 2022, 13
[17]   Toward a Neurobiologically Plausible Model of Language-Related, Negative Event-Related Potentials [J].
Bornkessel-Schlesewsky, Ina ;
Schlesewsky, Matthias .
FRONTIERS IN PSYCHOLOGY, 2019, 10
[18]   Minimality as vacuous distinctness: Evidence from cross-linguistic sentence comprehension [J].
Bornkessel-Schlesewsky, Ina ;
Schlesewsky, Matthias .
LINGUA, 2009, 119 (10) :1541-1559
[19]   Localizing syntactic predictions using recurrent neural network grammars [J].
Brennan, Jonathan R. ;
Dyer, Chris ;
Kuncoro, Adhiguna ;
Hale, John T. .
NEUROPSYCHOLOGIA, 2020, 146
[20]   Hierarchical structure guides rapid linguistic predictions during naturalistic listening [J].
Brennan, Jonathan R. ;
Hale, John T. .
PLOS ONE, 2019, 14 (01)