Crosslinguistic word order variation reflects evolutionary pressures of dependency and information locality

被引:8
作者
Hahn, Michael [1 ,2 ]
Xu, Yang [3 ]
机构
[1] Stanford Univ, Dept Linguist, Stanford, CA 94305 USA
[2] Saarland Univ, Collaborat Res Ctr 1102, D-66041 Saarbrucken, Germany
[3] Univ Toronto, Dept Comp Sci, Cognit Sci Program, Toronto, ON M5S 3G8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
language evolution; crosslinguistic variation; word order; coadaptation; efficient communications; LANGUAGE; PHYLOGENIES; CATEGORIES; INFERENCE; DISTANCE; GRAMMAR;
D O I
10.1073/pnas.2122604119
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Languages vary considerably in syntactic structure. About 40% of the world's languages have subject-verb-object order, and about 40% have subject-object-verb order. Extensive work has sought to explain this word order variation across languages. However, the existing approaches are not able to explain coherently the frequency distribution and evolution of word order in individual languages. We propose that variation in word order reflects different ways of balancing competing pressures of dependency locality and information locality, whereby languages favor placing elements together when they are syntactically related or contextually informative about each other. Using data from 80 languages in 17 language families and phylogenetic modeling, we demonstrate that languages evolve to balance these pressures, such that word order change is accompanied by change in the frequency distribution of the syntactic structures that speakers communicate to maintain overall efficiency. Variability in word order thus reflects different ways in which languages resolve these evolutionary pressures. We identify relevant characteristics that result from this joint optimization, particularly the frequency with which subjects and objects are expressed together for the same verb. Our findings suggest that syntactic structure and usage across languages coadapt to support efficient communication under limited cognitive resources.
引用
收藏
页数:10
相关论文
共 106 条
[1]  
[Anonymous], 2010, LANGUAGE USAGE COGNI
[2]  
[Anonymous], 2008, The Limits of syntactic variation
[3]  
[Anonymous], 2001, The Atoms of Language
[4]  
Bech K., 2014, ISWOC CORPUS
[5]  
Behaghel Otto., 1932, DTSCH SYNTAX GESCHIC, V4
[6]   Bayesian inference for Markov processes with diffusion and discrete components [J].
Blackwell, PG .
BIOMETRIKA, 2003, 90 (03) :613-627
[7]   brms: An R Package for Bayesian Multilevel Models Using Stan [J].
Buerkner, Paul-Christian .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 80 (01) :1-28
[8]  
Bybee J., 1994, The Evolution of Grammar: Tense, Aspect and Modality in the Languages of the World
[9]   From usage to grammar: The mind's response to repetition [J].
Bybee, Joan .
LANGUAGE, 2006, 82 (04) :711-733
[10]   Stan: A Probabilistic Programming Language [J].
Carpenter, Bob ;
Gelman, Andrew ;
Hoffman, Matthew D. ;
Lee, Daniel ;
Goodrich, Ben ;
Betancourt, Michael ;
Brubaker, Marcus A. ;
Guo, Jiqiang ;
Li, Peter ;
Riddell, Allen .
JOURNAL OF STATISTICAL SOFTWARE, 2017, 76 (01) :1-29