Emergent analogical reasoning in large language models

被引:101
作者
Webb, Taylor [1 ]
Holyoak, Keith J. [1 ]
Lu, Hongjing [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Stat, Los Angeles 90095, CA USA
关键词
RELATIONAL COMPLEXITY; INTELLIGENCE; SIMILARITY;
D O I
10.1038/s41562-023-01659-w
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Webb et al. show that new artificial intelligence language models, such as Generative Pre-trained Transformer 3, are able to solve analogical reasoning problems at a human-like level of performance. The recent advent of large language models has reinvigorated debate over whether human cognitive capacities might emerge in such generic models given sufficient training data. Of particular interest is the ability of these models to reason about novel problems zero-shot, without any direct training. In human cognition, this capacity is closely tied to an ability to reason by analogy. Here we performed a direct comparison between human reasoners and a large language model (the text-davinci-003 variant of Generative Pre-trained Transformer (GPT)-3) on a range of analogical tasks, including a non-visual matrix reasoning task based on the rule structure of Raven's Standard Progressive Matrices. We found that GPT-3 displayed a surprisingly strong capacity for abstract pattern induction, matching or even surpassing human capabilities in most settings; preliminary tests of GPT-4 indicated even better performance. Our results indicate that large language models such as GPT-3 have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems.
引用
收藏
页码:1526 / 1541
页数:16
相关论文
共 70 条
  • [41] Mahowald K, 2023, Arxiv, DOI [arXiv:2301.06627, DOI 10.48550/ARXIV.2301.06627]
  • [42] Marcus G., 2001, ALGEBRAIC MIND INTEG
  • [43] Spatial Alignment Facilitates Visual Comparison
    Matlen, Bryan J.
    Gentner, Dedre
    Franconeri, Steven L.
    [J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2020, 46 (05) : 443 - 457
  • [44] Recreating Raven's: Software for systematically generating large numbers of Raven-like matrix problems with normed properties
    Matzen, Laura E.
    Benz, Zachary O.
    Dixon, Kevin R.
    Posey, Jamie
    Kroger, James K.
    Speed, Ann E.
    [J]. BEHAVIOR RESEARCH METHODS, 2010, 42 (02) : 525 - 541
  • [45] Placing language in an integrated understanding system: Next steps toward human-level performance in neural language models
    McClelland, James L.
    Hill, Felix
    Rudolph, Maja
    Baldridge, Jason
    Schutze, Hinrich
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2020, 117 (42) : 25966 - 25974
  • [46] Mitchell M., 1993, Analogy-Making as Perception
  • [48] Mondal S.S., 2023, 11 INT C LEARN REPR
  • [49] ELEMENTS OF A THEORY OF HUMAN PROBLEM-SOLVING
    NEWELL, A
    SHAW, JC
    SIMON, HA
    [J]. PSYCHOLOGICAL REVIEW, 1958, 65 (03) : 151 - 166
  • [50] Ouyang L, 2022, ADV NEUR IN