Large Language Models Demonstrate the Potential of Statistical Learning in Language

被引:37
作者
Contreras Kallens, Pablo [1 ]
Kristensen-McLachlan, Ross Deans [2 ,3 ,4 ]
Christiansen, Morten H. [1 ,3 ,4 ,5 ,6 ]
机构
[1] Cornell Univ, Dept Psychol, Ithaca, NY USA
[2] Aarhus Univ, Ctr Humanities Comp, Aarhus, Denmark
[3] Aarhus Univ, Interacting Minds Ctr, Aarhus, Denmark
[4] Aarhus Univ, Sch Commun & Culture, Aarhus, Denmark
[5] Haskins Labs Inc, New Haven, CT USA
[6] Cornell Univ, Dept Psychol, 228 Uris Hall, Ithaca, NY 14853 USA
关键词
Large language models; Artificial intelligence; Language acquisition; Statistical learning; Grammar; Innateness; Linguistic experience; PRINCIPLES;
D O I
10.1111/cogs.13256
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
To what degree can language be acquired from linguistic input alone? This question has vexed scholars for millennia and is still a major focus of debate in the cognitive science of language. The complexity of human language has hampered progress because studies of language-especially those involving computational modeling-have only been able to deal with small fragments of our linguistic skills. We suggest that the most recent generation of Large Language Models (LLMs) might finally provide the computational tools to determine empirically how much of the human language ability can be acquired from linguistic experience. LLMs are sophisticated deep learning architectures trained on vast amounts of natural language data, enabling them to perform an impressive range of linguistic tasks. We argue that, despite their clear semantic and pragmatic limitations, LLMs have already demonstrated that human-like grammatical language can be acquired without the need for a built-in grammar. Thus, while there is still much to learn about how humans acquire and use language, LLMs provide full-fledged computational models for cognitive scientists to empirically evaluate just how far statistical learning might take us in explaining the full complexity of human language.
引用
收藏
页数:6
相关论文
共 42 条
  • [1] [Anonymous], 1980, RULES REPRESENTATION
  • [2] [Anonymous], 1985, Learning internal representations by error propagation, DOI 10.1016/b978-1-4832-1446-7.50035-2
  • [3] [Anonymous], 1996, Rethinking innateness: A con- nectionist perspective on development
  • [4] Arehalli S, 2023, Arxiv, DOI arXiv:2210.12187
  • [5] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big?
    Bender, Emily M.
    Gebru, Timnit
    McMillan-Major, Angelina
    Shmitchell, Shmargaret
    [J]. PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, : 610 - 623
  • [6] BigScience Workshop, 2022, BLOOM HUGG FAC
  • [7] Brown TB, 2020, ADV NEUR IN, V33
  • [8] VERBAL-BEHAVIOR - SKINNER,BF
    CHOMSKY, N
    [J]. LANGUAGE, 1959, 35 (01) : 26 - 58
  • [9] The language capacity: architecture and evolution
    Chomsky, Noam
    [J]. PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (01) : 200 - 203
  • [10] Chomsky Noam, 2000, Step by Step. Essays on Minimalist Syntax in Honor of Howard Lasnik, P89