Large language models without grounding recover non-sensorimotor but not sensorimotor features of human concepts

被引:1
作者
Xu, Qihui [1 ]
Peng, Yingying [2 ]
Nastase, Samuel A. [3 ,4 ]
Chodorow, Martin [5 ,6 ]
Wu, Minghua [2 ]
Li, Ping [2 ,7 ]
机构
[1] Ohio State Univ, Dept Psychol, Columbus, OH 43210 USA
[2] Hong Kong Polytech Univ, Fac Humanities, Dept Chinese & Bilingual Studies, Hong Kong, Peoples R China
[3] Princeton Univ, Dept Psychol, Princeton, NJ USA
[4] Princeton Univ, Princeton Neurosci Inst, Princeton, NJ USA
[5] CUNY Hunter Coll, Dept Psychol, New York, NY USA
[6] CUNY, Grad Ctr, Dept Psychol, New York, NY USA
[7] PolyU Hangzhou Technol & Innovat Res Inst, Hangzhou, Peoples R China
关键词
RATINGS; KNOWLEDGE; BLIND; NORMS; REPRESENTATIONS; CONCRETENESS; ACQUISITION; ATTENTION;
D O I
10.1038/s41562-025-02203-8
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
To what extent can language give rise to complex conceptual representation? Is multisensory experience essential? Recent large language models (LLMs) challenge the necessity of grounding for concept formation: whether LLMs without grounding nevertheless exhibit human-like representations. Here we compare multidimensional representations of similar to 4,442 lexical concepts between humans (the Glasgow Norms(1), N = 829; and the Lancaster Norms(2), N = 3,500) and state-of-the-art LLMs with and without visual learning, across non-sensorimotor, sensory and motor domains. We found that (1) the similarity between model and human representations decreases from non-sensorimotor to sensory domains and is minimal in motor domains, indicating a systematic divergence, and (2) models with visual learning exhibit enhanced similarity with human representations in visual-related dimensions. These results highlight the potential limitations of language in isolation for LLMs and that the integration of diverse modalities can potentially enhance alignment with human conceptual representation.
引用
收藏
页数:22
相关论文
共 77 条
[1]   Convergence of visual and tactile shape processing in the human lateral occipital complex [J].
Amedi, A ;
Jacobson, G ;
Hendler, T ;
Malach, R ;
Zohary, E .
CEREBRAL CORTEX, 2002, 12 (11) :1202-1212
[2]   Perceptual and motor attribute ratings for 559 object concepts [J].
Amsel, Ben D. ;
Urbach, Thomas P. ;
Kutas, Marta .
BEHAVIOR RESEARCH METHODS, 2012, 44 (04) :1028-1041
[3]  
Anil R., 2023, ARXIV, DOI DOI 10.48550/ARXIV.2305.10403
[4]  
[Anonymous], 2023, GPT-4 Technical Report, DOI DOI 10.48550/ARXIV.2303.08774
[5]  
[Anonymous], 2022, OpenAI
[6]   Subjective frequency estimates for 2,938 monosyllabic words [J].
Balota, DA ;
Pilotti, M ;
Cortese, MJ .
MEMORY & COGNITION, 2001, 29 (04) :639-647
[7]   Multi-dimensional sensorimotor grounding of concrete and abstract categories [J].
Banks, Briony ;
Connell, Louise .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2023, 378 (1870)
[8]   Grounded cognition [J].
Barsalou, Lawrence W. .
ANNUAL REVIEW OF PSYCHOLOGY, 2008, 59 :617-645
[9]  
Benjamini Y, 2001, ANN STAT, V29, P1165
[10]   Dual coding of knowledge in the human brain [J].
Bi, Yanchao .
TRENDS IN COGNITIVE SCIENCES, 2021, 25 (10) :883-895