Deep Neural Networks as a Computational Model for Human Shape Sensitivity

被引:185
作者
Kubilius, Jonas [1 ]
Bracci, Stefania [1 ]
Op de Beeck, Hans P. [1 ]
机构
[1] Univ Leuven, KU Leuven, Brain & Cognit, Leuven, Belgium
基金
欧洲研究理事会;
关键词
GREATER SENSITIVITY; OBJECT REPRESENTATIONS; HIERARCHICAL-MODELS; METRIC CHANGES; RECOGNITION; NEURONS; PIGEONS; CORTEX; SET; V4;
D O I
10.1371/journal.pcbi.1004896
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Theories of object recognition agree that shape is of primordial importance, but there is no consensus about how shape might be represented, and so far attempts to implement a model of shape perception that would work with realistic stimuli have largely failed. Recent studies suggest that state-of-the-art convolutional 'deep' neural networks (DNNs) capture important aspects of human object perception. We hypothesized that these successes might be partially related to a human-like representation of object shape. Here we demonstrate that sensitivity for shape features, characteristic to human and primate vision, emerges in DNNs when trained for generic object recognition from natural photographs. We show that these models explain human shape judgments for several benchmark behavioral and neural stimulus sets on which earlier models mostly failed. In particular, although never explicitly trained for such stimuli, DNNs develop acute sensitivity to minute variations in shape and to non-accidental properties that have long been implicated to form the basis for object recognition. Even more strikingly, when tested with a challenging stimulus set in which shape and category membership are dissociated, the most complex model architectures capture human shape sensitivity as well as some aspects of the category structure that emerges from human judgments. As a whole, these results indicate that convolutional neural networks not only learn physically correct representations of object categories but also develop perceptually accurate representational spaces of shapes. An even more complete model of human object representations might be in sight by training deep architectures for multiple tasks, which is so characteristic in human development.
引用
收藏
页数:26
相关论文
共 67 条
[1]   Greater sensitivity to nonaccidental than metric shape properties in preschool children [J].
Amir, Ori ;
Biederman, Irving ;
Herald, Sarah B. ;
Shah, Manan P. ;
Mintz, Toben H. .
VISION RESEARCH, 2014, 97 :83-88
[2]   Sensitivity to nonaccidental properties across various shape dimensions [J].
Amir, Ori ;
Biederman, Irving ;
Hayworth, Kenneth J. .
VISION RESEARCH, 2012, 62 :35-43
[3]   The neural basis for shape preferences [J].
Amir, Ori ;
Biederman, Irving ;
Hayworth, Kenneth J. .
VISION RESEARCH, 2011, 51 (20) :2198-2206
[4]   Orientation-Cue Invariant Population Responses to Contrast-Modulated and Phase-Reversed Contour Stimuli in Macaque V1 and V2 [J].
An, Xu ;
Gong, Hongliang ;
Yin, Jiapeng ;
Wang, Xiaochun ;
Pan, Yanxia ;
Zhang, Xian ;
Lu, Yiliang ;
Yang, Yupeng ;
Toth, Zoltan ;
Schiessl, Ingo ;
McLoughlin, Niall ;
Wang, Wei .
PLOS ONE, 2014, 9 (09)
[5]  
Anguelov D, 2015, PROC IEEE C COMPUT V
[6]  
[Anonymous], 2014, ARXIV14090575CS
[7]  
[Anonymous], ARXIV150507376CSQBIO
[8]  
[Anonymous], 1985, Perceptual Organization and Visual Recognition
[9]  
[Anonymous], ARXIV14116836CS
[10]  
[Anonymous], PSYCHOL SCI