Human Shape Representations Are Not an Emergent Property of Learning to Classify Objects

被引:1
|
作者
Malhotra, Gaurav [1 ,3 ]
Dujmovic, Marin [1 ]
Hummel, John [2 ]
Bowers, Jeffrey S. [1 ]
机构
[1] Univ Bristol, Sch Psychol Sci, Bristol, England
[2] Univ Illinois, Dept Psychol, Champaign, IL USA
[3] Univ Bristol, Sch Psychol Sci, 12A Priory Rd, Bristol BS8 1TU, England
基金
欧洲研究理事会;
关键词
vision; convolutional neural networks; object recognition; shape representation; relational representation; RECOGNITION; SURFACE; MODELS; PERCEPTION; INVARIANCE; NETWORK;
D O I
10.1037/xge0001440
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Humans are particularly sensitive to relationships between parts of objects. It remains unclear why this is. One hypothesis is that relational features are highly diagnostic of object categories and emerge as a result of learning to classify objects. We tested this by analyzing the internal representations of supervised convolutional neural networks (CNNs) trained to classify large sets of objects. We found that CNNs do not show the same sensitivity to relational changes as previously observed for human participants. Furthermore, when we precisely controlled the deformations to objects, human behavior was best predicted by the number of relational changes while CNNs were equally sensitive to all changes. Even changing the statistics of the learning environment by making relations uniquely diagnostic did not make networks more sensitive to relations in general. Our results show that learning to classify objects is not sufficient for the emergence of human shape representations. Instead, these results suggest that humans are selectively sensitive to relational changes because they build representations of distal objects from their retinal images and interpret relational changes as changes to these distal objects. This inferential process makes human shape representations qualitatively different from those of artificial neural networks optimized to perform image classification.
引用
收藏
页码:3380 / 3402
页数:23
相关论文
共 39 条
  • [1] Learning separate visual representations of independently rotating objects
    Tromans, James Matthew
    Page, Hector J. I.
    Stringer, Simon M.
    NETWORK-COMPUTATION IN NEURAL SYSTEMS, 2012, 23 (1-2) : 1 - 23
  • [2] Learning illumination- and orientation-invariant representations of objects through temporal association
    Wallis, Guy
    Backus, Benjamin T.
    Langer, Michael
    Huebner, Gesche
    Buelthoff, Heinrich
    JOURNAL OF VISION, 2009, 9 (07):
  • [3] Skeletal representations of shape in the human visual cortex
    Ayzenberg, Vladislav
    Kamps, Frederik S.
    Dilks, Daniel D.
    Lourenco, Stella F.
    NEUROPSYCHOLOGIA, 2022, 164
  • [4] Object recognition learning differentiates the representations of objects at the ERP component N1
    Wang, G.
    Suemitsu, K.
    CLINICAL NEUROPHYSIOLOGY, 2007, 118 (02) : 372 - 380
  • [5] Learning Invariant Visual Shape Representations from Physics
    Franzius, Mathias
    Wersing, Heiko
    ARTIFICIAL NEURAL NETWORKS (ICANN 2010), PT III, 2010, 6354 : 298 - 302
  • [6] Collective Rhythm as an Emergent Property During Human Social Coordination
    Farrera, Arodi
    Ramos-Fernandez, Gabriel
    FRONTIERS IN PSYCHOLOGY, 2022, 12
  • [7] Learning spatiotemporal representations for human fall detection in surveillance video
    Kong, Yongqiang
    Huang, Jianhui
    Huang, Shanshan
    Wei, Zhengang
    Wang, Shengke
    JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2019, 59 : 215 - 230
  • [8] Learning descriptive and distinctive parts of objects with a part-based shape similarity measure
    Lakämper, R
    Latecki, LJ
    Megalooikonomou, V
    Wang, Q
    Wang, XZ
    PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING, 2004, : 672 - 677
  • [10] Manipulating objects during learning shrinks the global scale of spatial representations in memory: a virtual reality study
    Lhuillier, S.
    Dutriaux, L.
    Nicolas, S.
    Gyselinck, V.
    SCIENTIFIC REPORTS, 2024, 14 (01)