Large Language Models Demonstrate the Potential of Statistical Learning in Language

被引：46

作者：

Contreras Kallens, Pablo ^{[1
]}

Kristensen-McLachlan, Ross Deans ^{[2
,3
,4
]}

Christiansen, Morten H. ^{[1
,3
,4
,5
,6
]}

机构：

[1] Cornell Univ, Dept Psychol, Ithaca, NY USA

[2] Aarhus Univ, Ctr Humanities Comp, Aarhus, Denmark

[3] Aarhus Univ, Interacting Minds Ctr, Aarhus, Denmark

[4] Aarhus Univ, Sch Commun & Culture, Aarhus, Denmark

[5] Haskins Labs Inc, New Haven, CT USA

[6] Cornell Univ, Dept Psychol, 228 Uris Hall, Ithaca, NY 14853 USA

来源：

COGNITIVE SCIENCE | 2023年 / 47卷 / 03期

关键词：

Large language models; Artificial intelligence; Language acquisition; Statistical learning; Grammar; Innateness; Linguistic experience; PRINCIPLES;

D O I：

10.1111/cogs.13256

中图分类号：

B84 [心理学];

学科分类号：

04 ; 0402 ;

摘要：

To what degree can language be acquired from linguistic input alone? This question has vexed scholars for millennia and is still a major focus of debate in the cognitive science of language. The complexity of human language has hampered progress because studies of language-especially those involving computational modeling-have only been able to deal with small fragments of our linguistic skills. We suggest that the most recent generation of Large Language Models (LLMs) might finally provide the computational tools to determine empirically how much of the human language ability can be acquired from linguistic experience. LLMs are sophisticated deep learning architectures trained on vast amounts of natural language data, enabling them to perform an impressive range of linguistic tasks. We argue that, despite their clear semantic and pragmatic limitations, LLMs have already demonstrated that human-like grammatical language can be acquired without the need for a built-in grammar. Thus, while there is still much to learn about how humans acquire and use language, LLMs provide full-fledged computational models for cognitive scientists to empirically evaluate just how far statistical learning might take us in explaining the full complexity of human language.

引用

页数：6

共 42 条

[1]

[Anonymous], 1980, RULES REPRESENTATION

[2]

Arehalli S, 2023, Arxiv, DOI arXiv:2210.12187

[3] On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? [J].

Bender, Emily M. ;

Gebru, Timnit ;

McMillan-Major, Angelina ;

Shmitchell, Shmargaret .

PROCEEDINGS OF THE 2021 ACM CONFERENCE ON FAIRNESS, ACCOUNTABILITY, AND TRANSPARENCY, FACCT 2021, 2021, :610-623

[4]

BigScience Workshop, 2022, BLOOM HUGG FAC

[5]

Brown TB, 2020, ADV NEUR IN, V33

[6] VERBAL-BEHAVIOR - SKINNER,BF [J].

CHOMSKY, N .

LANGUAGE, 1959, 35 (01) :26-58

[7]

Chomsky N., 1995, The minimalist program

[8] The language capacity: architecture and evolution [J].

Chomsky, Noam .

PSYCHONOMIC BULLETIN & REVIEW, 2017, 24 (01) :200-203

[9]

Chowdhery A, 2022, Arxiv, DOI [arXiv:2204.02311, 10.48550/arXiv.2204.02311, DOI 10.48550/ARXIV.2204.02311]

[10]

Christiansen M. H., 2022, LANGUAGE GAME IMPROV

← 1 2 3 4 5 →