Surprisal From Language Models Can Predict ERPs in Processing Predicate-Argument Structures Only if Enriched by an Agent Preference Principle

被引：9

作者：

Huber, Eva ^{[1
,2
]}

Sauppe, Sebastian ^{[1
,2
,3
]}

Isasi-Isasmendi, Arrate ^{[1
,2
]}

Bornkessel-Schlesewsky, Ina ^{[4
]}

Merlo, Paola ^{[5
,6
]}

Bickel, Balthasar ^{[1
,2
]}

机构：

[1] Univ Zurich, Dept Comparat Language Sci, Zurich, Switzerland

[2] Univ Zurich, Ctr Interdisciplinary Study Language Evolut, Zurich, Switzerland

[3] Univ Zurich, Dept Psychol, Zurich, Switzerland

[4] Univ South Australia, Australian Res Ctr Interact & Virtual Environm, Cognit Neurosci Lab, Adelaide, Australia

[5] Univ Geneva, Dept Linguist, Geneva, Switzerland

[6] Univ Geneva, Univ Ctr Comp Sci, Geneva, Switzerland

来源：

NEUROBIOLOGY OF LANGUAGE | 2024年 / 5卷 / 01期

基金：

瑞士国家科学基金会; 澳大利亚研究理事会;

关键词：

artificial neural networks; computational modeling; event cognition; ERP; sentence processing; surprisal; large language models (LLMs); R PACKAGE; BRAIN; COMPREHENSION; EVENTS; ROLES; ORDER; INFORMATION; REANALYSIS; SPEAKERS; ANIMACY;

D O I：

10.1162/nol_a_00121

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Language models based on artificial neural networks increasingly capture key aspects of how humans process sentences. Most notably, model-based surprisals predict event-related potentials such as N400 amplitudes during parsing. Assuming that these models represent realistic estimates of human linguistic experience, their success in modeling language processing raises the possibility that the human processing system relies on no other principles than the general architecture of language models and on sufficient linguistic input. Here, we test this hypothesis on N400 effects observed during the processing of verb-final sentences in German, Basque, and Hindi. By stacking Bayesian generalised additive models, we show that, in each language, N400 amplitudes and topographies in the region of the verb are best predicted when model-based surprisals are complemented by an Agent Preference principle that transiently interprets initial role-ambiguous noun phrases as agents, leading to reanalysis when this interpretation fails. Our findings demonstrate the need for this principle independently of usage frequencies and structural differences between languages. The principle has an unequal force, however. Compared to surprisal, its effect is weakest in German, stronger in Hindi, and still stronger in Basque. This gradient is correlated with the extent to which grammars allow unmarked NPs to be patients, a structural feature that boosts reanalysis effects. We conclude that language models gain more neurobiological plausibility by incorporating an Agent Preference. Conversely, theories of human processing profit from incorporating surprisal estimates in addition to principles like the Agent Preference, which arguably have distinct evolutionary roots.

引用

页码：167 / 200

页数：34

共 143 条

[1]

Agerri R, 2020, PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), P4781

[2]

Arehalli S, 2020, PsyArXiv, DOI [10.31234/osf.io/97qcg, DOI 10.31234/OSF.IO/97QCG]

[3]

Arehalli Suhas, 2022, P 26 C COMPUTATIONAL, P301, DOI 10.18653/v1/2022.conll-1.20

[4] Probabilistic language models in cognitive neuroscience: Promises and pitfalls [J].

Armeni, Kristijan ;

Willems, Roel M. ;

Frank, Stefan L. .

NEUROSCIENCE AND BIOBEHAVIORAL REVIEWS, 2017, 83 :579-588

[5]

Aurnhammer C., 2018, PsyArXiv, DOI [10.31234/osf.io/wec74, DOI 10.31234/OSF.IO/WEC74]

[6] Subject-object ambiguities in German embedded clauses: An across-the-board comparison [J].

Bader, M ;

Meng, M .

JOURNAL OF PSYCHOLINGUISTIC RESEARCH, 1999, 28 (02) :121-143

[7] Word order in German: A corpus study [J].

Bader, Markus ;

Haeussler, Jana .

LINGUA, 2010, 120 (03) :717-762

[8] The Entropy of WordsLearnability and Expressivity across More than 1000 Languages [J].

Bentz, Christian ;

Alikaniotis, Dimitrios ;

Cysouw, Michael ;

Ferrer-i-Cancho, Ramon .

ENTROPY, 2017, 19 (06)

[9] Referential density in discourse and syntactic typology [J].

Bickel, B .

LANGUAGE, 2003, 79 (04) :708-736

[10] The Neurophysiology of Language Processing Shapes the Evolution of Grammar: Evidence from Case Marking [J].

Bickel, Balthasar ;

Witzlack-Makarevich, Alena ;

Choudhary, Kamal K. ;

Schlesewsky, Matthias ;

Bornkessel-Schlesewsky, Ina .

PLOS ONE, 2015, 10 (08)

← 1 2 3 4 5 6 7 8 9 10 →