Evidence that recurrent circuits are critical to the ventral stream's execution of core object recognition behavior

被引:267
作者
Kar, Kohitij [1 ,2 ,3 ]
Kubilius, Jonas [1 ,2 ,4 ]
Schmidt, Kailyn [1 ,2 ]
Issa, Elias B. [1 ,2 ,5 ]
DiCarlo, James J. [1 ,2 ,3 ]
机构
[1] MIT, McGovern Inst Brain Res, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[2] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA
[3] MIT, Ctr Brains Minds & Machines, 77 Massachusetts Ave, Cambridge, MA 02139 USA
[4] Katholieke Univ Leuven, Brain & Cognit, Leuven, Belgium
[5] Columbia Univ, Dept Neurosci, Zuckerman Mind Brain Behav Inst, New York, NY USA
关键词
CATEGORY REPRESENTATIONS; FEEDBACK CONNECTIONS; CORTEX; MONKEY; FEEDFORWARD; ATTENTION; RESPONSES; MODELS; SPEED; V1;
D O I
10.1038/s41593-019-0392-5
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Non-recurrent deep convolutional neural networks (CNNs) are currently the best at modeling core object recognition, a behavior that is supported by the densely recurrent primate ventral stream, culminating in the inferior temporal (IT) cortex. If recurrence is critical to this behavior, then primates should outperform feedforward-only deep CNNs for images that require additional recurrent processing beyond the feedforward IT response. Here we first used behavioral methods to discover hundreds of these 'challenge' images. Second, using large-scale electrophysiology, we observed that behaviorally sufficient object identity solutions emerged similar to 30 ms later in the IT cortex for challenge images compared with primate performance-matched 'control' images. Third, these behaviorally critical late-phase IT response patterns were poorly predicted by feedforward deep CNN activations. Notably, very-deep CNNs and shallower recurrent CNNs better predicted these late IT responses, suggesting that there is a functional equivalence between additional nonlinear transformations and recurrence. Beyond arguing that recurrent circuits are critical for rapid object identification, our results provide strong constraints for future recurrent model development.
引用
收藏
页码:974 / +
页数:16
相关论文
共 54 条
[1]  
[Anonymous], 2018, Adversarial examples that fool both computer vision and time-limited humans
[2]  
[Anonymous], 2017, COMP DEEP NEURAL NET
[3]  
[Anonymous], 2014, ABS14053531 CORR
[4]  
[Anonymous], 2016, BRIDGING GAPS RESIDU
[5]   Impact of learning on representation of parts and wholes in monkey inferotemporal cortex [J].
Baker, CI ;
Behrmann, M ;
Olson, CR .
NATURE NEUROSCIENCE, 2002, 5 (11) :1210-1216
[6]   Top-down facilitation of visual recognition [J].
Bar, M ;
Kassam, KS ;
Ghuman, AS ;
Boshyan, J ;
Schmidt, AM ;
Dale, AM ;
Hämäläinen, MS ;
Marinkovic, K ;
Schacter, DL ;
Rosen, BR ;
Halgren, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (02) :449-454
[7]   A Source for Feature-Based Attention in the Prefrontal Cortex [J].
Bichot, Narcisse P. ;
Heard, Matthew T. ;
DeGennaro, Ellen M. ;
Desimone, Robert .
NEURON, 2015, 88 (04) :832-844
[8]   Task alters category representations in prefrontal but not high-level visual cortex [J].
Bugatus, Lior ;
Weiner, Kevin S. ;
Grill-Spector, Kalanit .
NEUROIMAGE, 2017, 155 :437-449
[9]   Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition [J].
Cadieu, Charles F. ;
Hong, Ha ;
Yamins, Daniel L. K. ;
Pinto, Nicolas ;
Ardila, Diego ;
Solomon, Ethan A. ;
Majaj, Najib J. ;
DiCarlo, James J. .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (12)
[10]   How Does the Brain Solve Visual Object Recognition? [J].
DiCarlo, James J. ;
Zoccolan, Davide ;
Rust, Nicole C. .
NEURON, 2012, 73 (03) :415-434