Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior

被引：267

作者：

Kar, Kohitij ^{[1
,2
,3
]}

Kubilius, Jonas ^{[1
,2
,4
]}

Schmidt, Kailyn ^{[1
,2
]}

Issa, Elias B. ^{[1
,2
,5
]}

DiCarlo, James J. ^{[1
,2
,3
]}

机构：

[1] MIT, McGovern Inst Brain Res, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[2] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA

[3] MIT, Ctr Brains Minds & Machines, 77 Massachusetts Ave, Cambridge, MA 02139 USA

[4] Katholieke Univ Leuven, Brain & Cognit, Leuven, Belgium

[5] Columbia Univ, Dept Neurosci, Zuckerman Mind Brain Behav Inst, New York, NY USA

来源：

NATURE NEUROSCIENCE | 2019年 / 22卷 / 06期

关键词：

CATEGORY REPRESENTATIONS; FEEDBACK CONNECTIONS; CORTEX; MONKEY; FEEDFORWARD; ATTENTION; RESPONSES; MODELS; SPEED; V1;

D O I：

10.1038/s41593-019-0392-5

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

Non-recurrent deep convolutional neural networks (CNNs) are currently the best at modeling core object recognition, a behavior that is supported by the densely recurrent primate ventral stream, culminating in the inferior temporal (IT) cortex. If recurrence is critical to this behavior, then primates should outperform feedforward-only deep CNNs for images that require additional recurrent processing beyond the feedforward IT response. Here we first used behavioral methods to discover hundreds of these 'challenge' images. Second, using large-scale electrophysiology, we observed that behaviorally sufficient object identity solutions emerged similar to 30 ms later in the IT cortex for challenge images compared with primate performance-matched 'control' images. Third, these behaviorally critical late-phase IT response patterns were poorly predicted by feedforward deep CNN activations. Notably, very-deep CNNs and shallower recurrent CNNs better predicted these late IT responses, suggesting that there is a functional equivalence between additional nonlinear transformations and recurrence. Beyond arguing that recurrent circuits are critical for rapid object identification, our results provide strong constraints for future recurrent model development.

引用

页码：974 / +

页数：16

共 54 条

[1]

[Anonymous], 2018, Adversarial examples that fool both computer vision and time-limited humans

[2]

[Anonymous], 2017, COMP DEEP NEURAL NET

[3]

[Anonymous], 2014, ABS14053531 CORR

[4]

[Anonymous], 2016, BRIDGING GAPS RESIDU