Emergent analogical reasoning in large language models

被引:101
作者
Webb, Taylor [1 ]
Holyoak, Keith J. [1 ]
Lu, Hongjing [1 ,2 ]
机构
[1] Univ Calif Los Angeles, Dept Psychol, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Stat, Los Angeles 90095, CA USA
关键词
RELATIONAL COMPLEXITY; INTELLIGENCE; SIMILARITY;
D O I
10.1038/s41562-023-01659-w
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Webb et al. show that new artificial intelligence language models, such as Generative Pre-trained Transformer 3, are able to solve analogical reasoning problems at a human-like level of performance. The recent advent of large language models has reinvigorated debate over whether human cognitive capacities might emerge in such generic models given sufficient training data. Of particular interest is the ability of these models to reason about novel problems zero-shot, without any direct training. In human cognition, this capacity is closely tied to an ability to reason by analogy. Here we performed a direct comparison between human reasoners and a large language model (the text-davinci-003 variant of Generative Pre-trained Transformer (GPT)-3) on a range of analogical tasks, including a non-visual matrix reasoning task based on the rule structure of Raven's Standard Progressive Matrices. We found that GPT-3 displayed a surprisingly strong capacity for abstract pattern induction, matching or even surpassing human capabilities in most settings; preliminary tests of GPT-4 indicated even better performance. Our results indicate that large language models such as GPT-3 have acquired an emergent ability to find zero-shot solutions to a broad range of analogy problems.
引用
收藏
页码:1526 / 1541
页数:16
相关论文
共 70 条
  • [1] Achiam OJ, 2023, Arxiv, DOI [arXiv:2303.08774, DOI 10.48550/ARXIV.2303.08774]
  • [2] [Anonymous], 2012, Oxford Handbook of Thinking and Reasoning
  • [3] [Anonymous], 1971, Abilities: Their structure, growth, and action
  • [4] Barrett DGT, 2018, PR MACH LEARN RES, V80
  • [5] Using cognitive psychology to understand GPT-3
    Binz, Marcel
    Schulz, Eric
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 120 (06)
  • [6] Brown TB, 2020, ARXIV, DOI DOI 10.48550/ARXIV.2005.14165
  • [7] WHAT ONE INTELLIGENCE-TEST MEASURES - A THEORETICAL ACCOUNT OF THE PROCESSING IN THE RAVEN PROGRESSIVE MATRICES TEST
    CARPENTER, PA
    JUST, MA
    SHELL, P
    [J]. PSYCHOLOGICAL REVIEW, 1990, 97 (03) : 404 - 431
  • [8] Chalmers D. J., 1992, Journal of Experimental and Theoretical Artificial Intelligence, V4, P185, DOI 10.1080/09528139208953747
  • [9] Chan SCY, 2022, ADV NEUR IN
  • [10] Chen Mark, 2021, arXiv, DOI DOI 10.48550/ARXIV.2107.03374