Mid-level visual features underlie the high-level categorical organization of the ventral stream

被引:143
作者
Long, Bria [1 ,2 ]
Yu, Chen-Ping [1 ,3 ]
Konkle, Talia [1 ]
机构
[1] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA
[2] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA
[3] Phiar Technol Inc, Palo Alto, CA 94303 USA
关键词
ventral stream organization; mid-level features; object recognition; fMRI; deep neural networks; FUSIFORM FACE AREA; NEURAL REPRESENTATIONS; CONGENITALLY BLIND; TEMPORAL CORTEX; OBJECT CATEGORY; SHAPE; SELECTIVITY; SIZE; PERCEPTION; SIMILARITY;
D O I
10.1073/pnas.1719616115
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Human object-selective cortex shows a large-scale organization characterized by the high-level properties of both animacy and object size. To what extent are these neural responses explained by primitive perceptual features that distinguish animals from objects and big objects from small objects? To address this question, we used a texture synthesis algorithm to create a class of stimuli-texforms-which preserve some mid-level texture and form information from objects while rendering them unrecognizable. We found that unrecognizable texforms were sufficient to elicit the large-scale organizations of object-selective cortex along the entire ventral pathway. Further, the structure in the neural patterns elicited by texforms was well predicted by curvature features and by intermediate layers of a deep convolutional neural network, supporting the mid-level nature of the representations. These results provide clear evidence that a substantial portion of ventral stream organization can be accounted for by coarse texture and form information without requiring explicit recognition of intact objects.
引用
收藏
页码:E9015 / E9024
页数:10
相关论文
共 73 条
[1]   Low-level properties of natural images predict topographic patterns of neural response in the ventral visual pathway [J].
Andrews, Timothy J. ;
Watson, David M. ;
Rice, Grace E. ;
Hartley, Tom .
JOURNAL OF VISION, 2015, 15 (07) :1-12
[2]   Selectivity for low-level features of objects in the human ventral stream [J].
Andrews, Timothy J. ;
Clarke, Alex ;
Pell, Philip ;
Hartley, Tom .
NEUROIMAGE, 2010, 49 (01) :703-711
[3]   Shape Similarity, Better than Semantic Membership, Accounts for the Structure of Visual Object Representations in a Population of Monkey Inferotemporal Neurons [J].
Baldassi, Carlo ;
Alemi-Neissi, Alireza ;
Pagan, Marino ;
DiCarlo, James J. ;
Zecchina, Riccardo ;
Zoccolan, Davide .
PLOS COMPUTATIONAL BIOLOGY, 2013, 9 (08)
[4]   Object Domain and Modality in the Ventral Visual Pathway [J].
Bi, Yanchao ;
Wang, Xiaoying ;
Caramazza, Alfonso .
TRENDS IN COGNITIVE SCIENCES, 2016, 20 (04) :282-290
[5]   RECOGNITION-BY-COMPONENTS - A THEORY OF HUMAN IMAGE UNDERSTANDING [J].
BIEDERMAN, I .
PSYCHOLOGICAL REVIEW, 1987, 94 (02) :115-147
[6]   Dissociations and Associations between Shape and Category Representations in the Two Visual Pathways [J].
Bracci, Stefania ;
Op de Beeck, Hans .
JOURNAL OF NEUROSCIENCE, 2016, 36 (02) :432-444
[7]   Rectilinear Edge Selectivity Is Insufficient to Explain the Category Selectivity of the Parahippocampal Place Area [J].
Bryan, Peter B. ;
Julian, Joshua B. ;
Epstein, Russell A. .
FRONTIERS IN HUMAN NEUROSCIENCE, 2016, 10 :1-12
[8]   The fusiform face area is tuned for curvilinear patterns with more high-contrasted elements in the upper part [J].
Caldara, Roberto ;
Seghier, Mohamed L. ;
Rossion, Bruno ;
Lazeyras, Francois ;
Michel, Christoph ;
Hauert, Claude-Alain .
NEUROIMAGE, 2006, 31 (01) :313-319
[9]   A Sparse Object Coding Scheme in Area V4 [J].
Carlson, Eric T. ;
Rasquinha, Russell J. ;
Zhang, Kechen ;
Connor, Charles E. .
CURRENT BIOLOGY, 2011, 21 (04) :288-293
[10]   Movement and mind:: A functional imaging study of perception and interpretation of complex intentional movement patterns [J].
Castelli, F ;
Happé, F ;
Frith, U ;
Frith, C .
NEUROIMAGE, 2000, 12 (03) :314-325