Mid-level visual features underlie the high-level categorical organization of the ventral stream

被引：143

作者：

Long, Bria ^{[1
,2
]}

Yu, Chen-Ping ^{[1
,3
]}

Konkle, Talia ^{[1
]}

机构：

[1] Harvard Univ, Dept Psychol, 33 Kirkland St, Cambridge, MA 02138 USA

[2] Stanford Univ, Dept Psychol, Stanford, CA 94305 USA

[3] Phiar Technol Inc, Palo Alto, CA 94303 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2018年 / 115卷 / 38期

关键词：

ventral stream organization; mid-level features; object recognition; fMRI; deep neural networks; FUSIFORM FACE AREA; NEURAL REPRESENTATIONS; CONGENITALLY BLIND; TEMPORAL CORTEX; OBJECT CATEGORY; SHAPE; SELECTIVITY; SIZE; PERCEPTION; SIMILARITY;

D O I：

10.1073/pnas.1719616115

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Human object-selective cortex shows a large-scale organization characterized by the high-level properties of both animacy and object size. To what extent are these neural responses explained by primitive perceptual features that distinguish animals from objects and big objects from small objects? To address this question, we used a texture synthesis algorithm to create a class of stimuli-texforms-which preserve some mid-level texture and form information from objects while rendering them unrecognizable. We found that unrecognizable texforms were sufficient to elicit the large-scale organizations of object-selective cortex along the entire ventral pathway. Further, the structure in the neural patterns elicited by texforms was well predicted by curvature features and by intermediate layers of a deep convolutional neural network, supporting the mid-level nature of the representations. These results provide clear evidence that a substantial portion of ventral stream organization can be accounted for by coarse texture and form information without requiring explicit recognition of intact objects.

引用

页码：E9015 / E9024

页数：10

共 73 条

[1] Low-level properties of natural images predict topographic patterns of neural response in the ventral visual pathway [J].