Large language models are better than theoretical linguists at theoretical linguistics

被引：0

作者：

Ambridge, Ben ^{[1
,2
]}

Blything, Liam ^{[1
,2
]}

机构：

[1] Univ Manchester, Manchester, England

[2] ESRCInternat Ctr Language & Commun Dev LuCiD, Manchester, England

来源：

THEORETICAL LINGUISTICS | 2024年 / 50卷 / 1-2期

基金：

英国经济与社会研究理事会; 欧洲研究理事会;

关键词：

large language models; causatives; grammaticality judgments; VERB SEMANTICS; ENTRENCHMENT; RETREAT; CONSTRAINTS; PREEMPTION; ERRORS; ROLES; SAY;

D O I：

10.1515/tl-2024-2002

中图分类号：

H0 [语言学];

学科分类号：

030303 ; 0501 ; 050102 ;

摘要：

Large language models are better than theoretical linguists at theoretical linguistics, at least in the domain of verb argument structure; explaining why (for example), we can say both The ball rolled and Someone rolled the ball, but not both The man laughed and *Someone laughed the man. Verbal accounts of this phenomenon either do not make precise quantitative predictions at all, or do so only with the help of ancillary assumptions and by-hand data processing. Large language models, on the other hand (taking text-davinci-002 as an example), predict human acceptability ratings for these types of sentences with correlations of around r = 0.9, and themselves constitute theories of language acquisition and representation; theories that instantiate exemplar-, input- and construction-based approaches, though only very loosely. Indeed, large language models succeed where these verbal (i.e., non-computational) linguistic theories fail, precisely because the latter insist - in the service of intuitive interpretability - on simple yet empirically inadequate (over)generalizations.

引用

页码：33 / 48

页数：16

共 50 条

[41] Large Language Models: Their Success and Impact
Makridakis, Spyros
Petropoulos, Fotios
Kang, Yanfei
FORECASTING, 2023, 5 (03): : 536 - 549
[42] Large language models for chemistry robotics
Yoshikawa, Naruki
Skreta, Marta
Darvish, Kourosh
Arellano-Rubach, Sebastian
Ji, Zhi
Kristensen, Lasse Bjorn
Li, Andrew Zou
Zhao, Yuchi
Xu, Haoping
Kuramshin, Artur
Aspuru-Guzik, Alan
Shkurti, Florian
Garg, Animesh
AUTONOMOUS ROBOTS, 2023, 47 (08) : 1057 - 1086
[43] Large Language Models and Computer Security
Iyengar, Arun
Kundu, Ashish
2023 5TH IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS, TPS-ISA, 2023, : 307 - 313
[44] A survey on large language models for recommendation
Wu, Likang
Zheng, Zhi
Qiu, Zhaopeng
Wang, Hao
Gu, Hongchao
Shen, Tingjia
Qin, Chuan
Zhu, Chen
Zhu, Hengshu
Liu, Qi
Xiong, Hui
Chen, Enhong
WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
[45] Sentiment trading with large language models
Kirtac, Kemal
Germano, Guido
FINANCE RESEARCH LETTERS, 2024, 62
[46] Flying Into the Future With Large Language Models
Kanjilal, Sanjat
CLINICAL INFECTIOUS DISEASES, 2024, 78 (04) : 867 - 869
[47] A Surgical Perspective on Large Language Models
Miller, Robert
ANNALS OF SURGERY, 2023, 278 (02) : E211 - E213
[48] Foundation Models, Generative AI, and Large Language Models
Ross, Angela
McGrow, Kathleen
Zhi, Degui
Rasmy, Laila
CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
[49] Generative Large Language Models Explained
Yan, Xueming
Xiao, Yan
Jin, Yaochu
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (04) : 45 - 46
[50] Large Language Models as Kuwaiti Annotators
Alostad, Hana
BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (02)

← 1 2 3 4 5 →