Large language models are better than theoretical linguists at theoretical linguistics

被引:0
作者
Ambridge, Ben [1 ,2 ]
Blything, Liam [1 ,2 ]
机构
[1] Univ Manchester, Manchester, England
[2] ESRCInternat Ctr Language & Commun Dev LuCiD, Manchester, England
基金
英国经济与社会研究理事会; 欧洲研究理事会;
关键词
large language models; causatives; grammaticality judgments; VERB SEMANTICS; ENTRENCHMENT; RETREAT; CONSTRAINTS; PREEMPTION; ERRORS; ROLES; SAY;
D O I
10.1515/tl-2024-2002
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Large language models are better than theoretical linguists at theoretical linguistics, at least in the domain of verb argument structure; explaining why (for example), we can say both The ball rolled and Someone rolled the ball, but not both The man laughed and *Someone laughed the man. Verbal accounts of this phenomenon either do not make precise quantitative predictions at all, or do so only with the help of ancillary assumptions and by-hand data processing. Large language models, on the other hand (taking text-davinci-002 as an example), predict human acceptability ratings for these types of sentences with correlations of around r = 0.9, and themselves constitute theories of language acquisition and representation; theories that instantiate exemplar-, input- and construction-based approaches, though only very loosely. Indeed, large language models succeed where these verbal (i.e., non-computational) linguistic theories fail, precisely because the latter insist - in the service of intuitive interpretability - on simple yet empirically inadequate (over)generalizations.
引用
收藏
页码:33 / 48
页数:16
相关论文
共 50 条
  • [41] Large Language Models: Their Success and Impact
    Makridakis, Spyros
    Petropoulos, Fotios
    Kang, Yanfei
    FORECASTING, 2023, 5 (03): : 536 - 549
  • [42] Large language models for chemistry robotics
    Yoshikawa, Naruki
    Skreta, Marta
    Darvish, Kourosh
    Arellano-Rubach, Sebastian
    Ji, Zhi
    Kristensen, Lasse Bjorn
    Li, Andrew Zou
    Zhao, Yuchi
    Xu, Haoping
    Kuramshin, Artur
    Aspuru-Guzik, Alan
    Shkurti, Florian
    Garg, Animesh
    AUTONOMOUS ROBOTS, 2023, 47 (08) : 1057 - 1086
  • [43] Large Language Models and Computer Security
    Iyengar, Arun
    Kundu, Ashish
    2023 5TH IEEE INTERNATIONAL CONFERENCE ON TRUST, PRIVACY AND SECURITY IN INTELLIGENT SYSTEMS AND APPLICATIONS, TPS-ISA, 2023, : 307 - 313
  • [44] A survey on large language models for recommendation
    Wu, Likang
    Zheng, Zhi
    Qiu, Zhaopeng
    Wang, Hao
    Gu, Hongchao
    Shen, Tingjia
    Qin, Chuan
    Zhu, Chen
    Zhu, Hengshu
    Liu, Qi
    Xiong, Hui
    Chen, Enhong
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2024, 27 (05):
  • [45] Sentiment trading with large language models
    Kirtac, Kemal
    Germano, Guido
    FINANCE RESEARCH LETTERS, 2024, 62
  • [46] Flying Into the Future With Large Language Models
    Kanjilal, Sanjat
    CLINICAL INFECTIOUS DISEASES, 2024, 78 (04) : 867 - 869
  • [47] A Surgical Perspective on Large Language Models
    Miller, Robert
    ANNALS OF SURGERY, 2023, 278 (02) : E211 - E213
  • [48] Foundation Models, Generative AI, and Large Language Models
    Ross, Angela
    McGrow, Kathleen
    Zhi, Degui
    Rasmy, Laila
    CIN-COMPUTERS INFORMATICS NURSING, 2024, 42 (05) : 377 - 387
  • [49] Generative Large Language Models Explained
    Yan, Xueming
    Xiao, Yan
    Jin, Yaochu
    IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2024, 19 (04) : 45 - 46
  • [50] Large Language Models as Kuwaiti Annotators
    Alostad, Hana
    BIG DATA AND COGNITIVE COMPUTING, 2025, 9 (02)