Teaching deep networks to see shape: Lessons from a simplified visual world

被引:0
作者
Jarvers, Christian [1 ]
Neumann, Heiko [1 ]
机构
[1] Ulm Univ, Inst Neural Informat Proc, Ulm, Germany
关键词
NEURAL DYNAMICS; PERCEPTION; MODELS; INFORMATION; FRAMEWORK;
D O I
10.1371/journal.pcbi.1012019
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Deep neural networks have been remarkably successful as models of the primate visual system. One crucial problem is that they fail to account for the strong shape-dependence of primate vision. Whereas humans base their judgements of category membership to a large extent on shape, deep networks rely much more strongly on other features such as color and texture. While this problem has been widely documented, the underlying reasons remain unclear. We design simple, artificial image datasets in which shape, color, and texture features can be used to predict the image class. By training networks from scratch to classify images with single features and feature combinations, we show that some network architectures are unable to learn to use shape features, whereas others are able to use shape in principle but are biased towards the other features. We show that the bias can be explained by the interactions between the weight updates for many images in mini-batch gradient descent. This suggests that different learning algorithms with sparser, more local weight changes are required to make networks more sensitive to shape and improve their capability to describe human vision.
引用
收藏
页数:32
相关论文
共 79 条
  • [1] Amini S, 2024, Arxiv, DOI arXiv:2406.05927
  • [2] Atanasov A, 2021, INT C LEARN REPR
  • [3] Does the brain's ventral visual pathway compute object shape?
    Ayzenberg, Vladislav
    Behrmann, Marlene
    [J]. TRENDS IN COGNITIVE SCIENCES, 2022, 26 (12) : 1119 - 1132
  • [4] Ba J., 2016, arXiv
  • [5] Deep learning models fail to capture the configural nature of human shape perception
    Baker, Nicholas
    Elder, James H.
    [J]. ISCIENCE, 2022, 25 (09)
  • [6] Local features and global shape information in object classification by deep convolutional neural networks
    Baker, Nicholas
    Lu, Hongjing
    Erlikhman, Gennady
    Kellman, Philip J.
    [J]. VISION RESEARCH, 2020, 172 : 46 - 61
  • [7] Deep convolutional networks do not classify based on global object shape
    Baker, Nicholas
    Lu, Hongjing
    Erlikhman, Gennady
    Kellman, Philip J.
    [J]. PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (12)
  • [8] NEUROSCIENCE Neural population control via deep image synthesis
    Bashivan, Pouya
    Kar, Kohitij
    DiCarlo, James J.
    [J]. SCIENCE, 2019, 364 (6439) : 453 - +
  • [9] Deep problems with neural network models of human vision
    Bowers, Jeffrey S.
    Malhotra, Gaurav
    Dujmovic, Marin
    Llera Montero, Milton
    Tsvetkov, Christian
    Biscione, Valerio
    Puebla, Guillermo
    Adolfi, Federico
    Hummel, John E.
    Heaton, Rachel F.
    Evans, Benjamin D.
    Mitchell, Jeffrey
    Blything, Ryan
    [J]. BEHAVIORAL AND BRAIN SCIENCES, 2022, 46
  • [10] Computing with a Canonical Neural Circuits Model with Pool Normalization and Modulating Feedback
    Brosch, Tobias
    Neumann, Heiko
    [J]. NEURAL COMPUTATION, 2014, 26 (12) : 2735 - 2789