Large language models are better than theoretical linguists at theoretical linguistics

被引:0
作者
Ambridge, Ben [1 ,2 ]
Blything, Liam [1 ,2 ]
机构
[1] Univ Manchester, Manchester, England
[2] ESRCInternat Ctr Language & Commun Dev LuCiD, Manchester, England
基金
英国经济与社会研究理事会; 欧洲研究理事会;
关键词
large language models; causatives; grammaticality judgments; VERB SEMANTICS; ENTRENCHMENT; RETREAT; CONSTRAINTS; PREEMPTION; ERRORS; ROLES; SAY;
D O I
10.1515/tl-2024-2002
中图分类号
H0 [语言学];
学科分类号
030303 ; 0501 ; 050102 ;
摘要
Large language models are better than theoretical linguists at theoretical linguistics, at least in the domain of verb argument structure; explaining why (for example), we can say both The ball rolled and Someone rolled the ball, but not both The man laughed and *Someone laughed the man. Verbal accounts of this phenomenon either do not make precise quantitative predictions at all, or do so only with the help of ancillary assumptions and by-hand data processing. Large language models, on the other hand (taking text-davinci-002 as an example), predict human acceptability ratings for these types of sentences with correlations of around r = 0.9, and themselves constitute theories of language acquisition and representation; theories that instantiate exemplar-, input- and construction-based approaches, though only very loosely. Indeed, large language models succeed where these verbal (i.e., non-computational) linguistic theories fail, precisely because the latter insist - in the service of intuitive interpretability - on simple yet empirically inadequate (over)generalizations.
引用
收藏
页码:33 / 48
页数:16
相关论文
共 50 条
  • [1] Large Language Models and theoretical linguistics
    Fox, Danny
    Katzir, Roni
    THEORETICAL LINGUISTICS, 2024, 50 (1-2) : 71 - 76
  • [2] Can Large Language Models Generate Middle School Mathematics Explanations Better Than Human Teachers?
    Wang, Allison
    Prihar, Ethan
    Haim, Aaron
    Heffernan, Neil
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, 2024, 2151 : 242 - 250
  • [3] Toward a Better Understanding of the Emotional Dynamics of Negotiation with Large Language Models
    Lin, Eleanor
    Hale, James
    Gratch, Jonathan
    PROCEEDINGS OF THE 2023 INTERNATIONAL SYMPOSIUM ON THEORY, ALGORITHMIC FOUNDATIONS, AND PROTOCOL DESIGN FOR MOBILE NETWORKS AND MOBILE COMPUTING, MOBIHOC 2023, 2023, : 545 - 550
  • [4] THEORETICAL ASPECTS OF QUINTOM MODELS
    Qiu, Taotao
    MODERN PHYSICS LETTERS A, 2010, 25 (11-12) : 909 - 921
  • [5] Theoretical aspects of the phonological models
    Hernando Cuadrado, Luis Alberto
    DICENDA-CUADERNOS DE FILOLOGIA HISPANICA, 2007, 25 : 105 - 123
  • [6] THEORETICAL MODELS OF DARK ENERGY
    Yoo, Jaewon
    Watanabe, Yuki
    INTERNATIONAL JOURNAL OF MODERN PHYSICS D, 2012, 21 (12):
  • [7] Generative AI in Psychological Therapy: Perspectives on Computational Linguistics and Large Language Models in Written Behaviour Monitoring
    Bird, Jordan J.
    Wright, David
    Sumich, Alexander
    Lotfi, Ahmad
    17TH ACM INTERNATIONAL CONFERENCE ON PERVASIVE TECHNOLOGIES RELATED TO ASSISTIVE ENVIRONMENTS, PETRA 2024, 2024, : 322 - 328
  • [8] Better Results Through Ambiguity Resolution: Large Language Models that Ask Clarifying Questions
    Tix, Bernadette
    Binsted, Kim
    AUGMENTED COGNITION, PT II, AC 2024, 2024, 14695 : 72 - 87
  • [9] Large language models are less effective at clinical prediction tasks than locally trained machine learning models
    Brown, Katherine E.
    Yan, Chao
    Li, Zhuohang
    Zhang, Xinmeng
    Collins, Benjamin X.
    Chen, You
    Clayton, Ellen Wright
    Kantarcioglu, Murat
    Vorobeychik, Yevgeniy
    Malin, Bradley A.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2025, 32 (05) : 811 - 822
  • [10] Imitation and Large Language Models
    Boisseau, Eloise
    MINDS AND MACHINES, 2024, 34 (04)