Teaching deep networks to see shape: Lessons from a simplified visual world

被引：0

作者：

Jarvers, Christian ^{[1
]}

Neumann, Heiko ^{[1
]}

机构：

[1] Ulm Univ, Inst Neural Informat Proc, Ulm, Germany

来源：

PLOS COMPUTATIONAL BIOLOGY | 2024年 / 20卷 / 11期

关键词：

NEURAL DYNAMICS; PERCEPTION; MODELS; INFORMATION; FRAMEWORK;

D O I：

10.1371/journal.pcbi.1012019

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Deep neural networks have been remarkably successful as models of the primate visual system. One crucial problem is that they fail to account for the strong shape-dependence of primate vision. Whereas humans base their judgements of category membership to a large extent on shape, deep networks rely much more strongly on other features such as color and texture. While this problem has been widely documented, the underlying reasons remain unclear. We design simple, artificial image datasets in which shape, color, and texture features can be used to predict the image class. By training networks from scratch to classify images with single features and feature combinations, we show that some network architectures are unable to learn to use shape features, whereas others are able to use shape in principle but are biased towards the other features. We show that the bias can be explained by the interactions between the weight updates for many images in mini-batch gradient descent. This suggests that different learning algorithms with sparser, more local weight changes are required to make networks more sensitive to shape and improve their capability to describe human vision.

引用

页数：32

共 79 条

[1] Amini S, 2024, Arxiv, DOI arXiv:2406.05927
[2] Atanasov A, 2021, INT C LEARN REPR
[3] Does the brain's ventral visual pathway compute object shape?
Ayzenberg, Vladislav
Behrmann, Marlene
[J]. TRENDS IN COGNITIVE SCIENCES, 2022, 26 (12) : 1119 - 1132
[4] Ba J., 2016, arXiv
[5] Deep learning models fail to capture the configural nature of human shape perception
Baker, Nicholas
Elder, James H.
[J]. ISCIENCE, 2022, 25 (09)
[6] Local features and global shape information in object classification by deep convolutional neural networks
Baker, Nicholas
Lu, Hongjing
Erlikhman, Gennady
Kellman, Philip J.
[J]. VISION RESEARCH, 2020, 172 : 46 - 61
[7] Deep convolutional networks do not classify based on global object shape
Baker, Nicholas
Lu, Hongjing
Erlikhman, Gennady
Kellman, Philip J.
[J]. PLOS COMPUTATIONAL BIOLOGY, 2018, 14 (12)
[8] NEUROSCIENCE Neural population control via deep image synthesis
Bashivan, Pouya
Kar, Kohitij
DiCarlo, James J.
[J]. SCIENCE, 2019, 364 (6439) : 453 - +
[9] Deep problems with neural network models of human vision
Bowers, Jeffrey S.
Malhotra, Gaurav
Dujmovic, Marin
Llera Montero, Milton
Tsvetkov, Christian
Biscione, Valerio
Puebla, Guillermo
Adolfi, Federico
Hummel, John E.
Heaton, Rachel F.
Evans, Benjamin D.
Mitchell, Jeffrey
Blything, Ryan
[J]. BEHAVIORAL AND BRAIN SCIENCES, 2022, 46
[10] Computing with a Canonical Neural Circuits Model with Pool Normalization and Modulating Feedback
Brosch, Tobias
Neumann, Heiko
[J]. NEURAL COMPUTATION, 2014, 26 (12) : 2735 - 2789

← 1 2 3 4 5 6 7 8 →