Modeling Structure-Building in the Brain With CCG Parsing and Large Language Models

被引:13
作者
Stanojevic, Milos [1 ]
Brennan, Jonathan R. R. [2 ]
Dunagan, Donald [3 ]
Steedman, Mark [4 ]
Hale, John T. T. [1 ,3 ]
机构
[1] Google DeepMind, London, England
[2] Univ Michigan, Dept Linguist, Lorch Hall,611 Tappen Ave, Ann Arbor, MI 48109 USA
[3] Univ Georgia, Dept Linguist, Athens, GA USA
[4] Univ Edinburgh, Sch Informat, Edinburgh, Scotland
基金
美国国家科学基金会;
关键词
Syntax; Parsing; Grammar; fMRI; Neural networks; Language modeling; Surprisal; SENTENCE COMPREHENSION; NEURAL BASIS; MEMORY; FMRI; INFORMATION; SYNTAX;
D O I
10.1111/cogs.13312
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
To model behavioral and neural correlates of language comprehension in naturalistic environments, researchers have turned to broad-coverage tools from natural-language processing and machine learning. Where syntactic structure is explicitly modeled, prior work has relied predominantly on context-free grammars (CFGs), yet such formalisms are not sufficiently expressive for human languages. Combinatory categorial grammars (CCGs) are sufficiently expressive directly compositional models of grammar with flexible constituency that affords incremental interpretation. In this work, we evaluate whether a more expressive CCG provides a better model than a CFG for human neural signals collected with functional magnetic resonance imaging (fMRI) while participants listen to an audiobook story. We further test between variants of CCG that differ in how they handle optional adjuncts. These evaluations are carried out against a baseline that includes estimates of next-word predictability from a transformer neural network language model. Such a comparison reveals unique contributions of CCG structure-building predominantly in the left posterior temporal lobe: CCG-derived measures offer a superior fit to neural signals compared to those derived from a CFG. These effects are spatially distinct from bilateral superior temporal effects that are unique to predictability. Neural effects for structure-building are thus separable from predictability during naturalistic listening, and those effects are best characterized by a grammar whose expressive power is motivated on independent linguistic grounds.
引用
收藏
页数:39
相关论文
共 109 条
[11]  
Berwick RobertC. Amy S. Weinberg., 1984, The Grammatical Basis of Linguistic Performance: Language Use and Acquisition
[12]   Localising memory retrieval and syntactic composition: an fMRI study of naturalistic language comprehension [J].
Bhattasali, Shohini ;
Fabre, Murielle ;
Luh, Wen-Ming ;
Al Saied, Hazem ;
Constant, Mathieu ;
Pallier, Christophe ;
Brennan, Jonathan R. ;
Spreng, R. Nathan ;
Hale, John .
LANGUAGE COGNITION AND NEUROSCIENCE, 2019, 34 (04) :491-510
[13]  
Bies A., 1995, MSCIS950607 U PENNS
[14]  
Boersma Paul, 2021, Praat: doing phonetics by computer [Computer program]. Version 6.2.10, DOI DOI 10.1097/AUD.0B013E31821473F7
[15]   Naturalistic Sentence Comprehension in the Brain [J].
Brennan, Jonathan .
LANGUAGE AND LINGUISTICS COMPASS, 2016, 10 (07) :299-313
[16]   Syntactic structure building in the anterior temporal lobe during natural story listening [J].
Brennan, Jonathan ;
Nir, Yuval ;
Hasson, Uri ;
Malach, Rafael ;
Heeger, David J. ;
Pylkkaenen, Liina .
BRAIN AND LANGUAGE, 2012, 120 (02) :163-173
[17]   Localizing syntactic predictions using recurrent neural network grammars [J].
Brennan, Jonathan R. ;
Dyer, Chris ;
Kuncoro, Adhiguna ;
Hale, John T. .
NEUROPSYCHOLOGIA, 2020, 146
[18]   Hierarchical structure guides rapid linguistic predictions during naturalistic listening [J].
Brennan, Jonathan R. ;
Hale, John T. .
PLOS ONE, 2019, 14 (01)
[19]   MEG Evidence for Incremental Sentence Composition in the Anterior Temporal Lobe [J].
Brennan, Jonathan R. ;
Pylkkanen, Liina .
COGNITIVE SCIENCE, 2017, 41 :1515-1531
[20]   Abstract linguistic structure correlates with temporal activity during naturalistic comprehension [J].
Brennan, Jonathan R. ;
Stabler, Edward P. ;
Van Wagenen, Sarah E. ;
Luh, Wen-Ming ;
Hale, John T. .
BRAIN AND LANGUAGE, 2016, 157 :81-94