Recurrent Convolutional Neural Networks: A Better Model of Biological Object Recognition

被引:130
作者
Spoerer, Courtney J. [1 ]
McClure, Patrick [1 ]
Kriegeskorte, Nikolaus [1 ]
机构
[1] Univ Cambridge, Med Res Council, Cognit & Brain Sci Unit, Cambridge, England
基金
英国医学研究理事会; 欧洲研究理事会;
关键词
object recognition; occlusion; top-down processing; convolutional neural network; recurrent neura network; HIERARCHICAL-MODELS; INFORMATION; ARCHITECTURE; COMPETITION; DYNAMICS; GRADIENT; VISION; SPACE; SPEED; FACE;
D O I
10.3389/fpsyg.2017.01551
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Feedforward neural networks provide the dominant model of how the brain performs visual object recognition. However, these networks lack the lateral and feedback connections, and the resulting recurrent neuronal dynamics, of the ventral visual pathway in the human and nonhuman primate brain. Here we investigate recurrent convolutional neural networks with bottom-up (B), lateral (L), and top-down (T) connections. Combining these types of connections yields four architectures (B, BT, BL, and BLT), which we systematically test and compare. We hypothesized that recurrent dynamics might improve recognition performance in the challenging scenario of partial occlusion. We introduce two novel occluded object recognition tasks to test the efficacy of the models, digit clutter (where multiple target digits occlude one another) and digit debris (where target digits are occluded by digit fragments). We find that recurrent neural networks outperform feedforward control models (approximately matched in parametric complexity) at recognising objects, both in the absence of occlusion and in all occlusion conditions. Recurrent neural networks are not only more neurobiologically plausible in their architecture; their dynamics also afford superior task performance. This work shows that computer vision can benefit from using recurrent convolutional architectures and suggests that the ubiquitous recurrent connections in biological brains are essential for task performance.
引用
收藏
页数:14
相关论文
共 49 条
[1]   Lateral competition for cortical space by layer-specific horizontal circuits [J].
Adesnik, Hillel ;
Scanziani, Massimo .
NATURE, 2010, 464 (7292) :1155-U71
[2]  
[Anonymous], 2013, Advances in Neural Information Processing Systems
[3]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[4]   Dynamic shape synthesis in posterior inferotemporal cortex [J].
Brincat, SL ;
Connor, CE .
NEURON, 2006, 49 (01) :17-24
[5]   Representational dynamics of object vision: The first 1000 ms [J].
Carlson, Thomas ;
Tovar, David A. ;
Alink, Arjen ;
Kriegeskorte, Nikolaus .
JOURNAL OF VISION, 2013, 13 (10)
[6]   Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence [J].
Cichy, Radoslaw Martin ;
Khosla, Aditya ;
Pantazis, Dimitrios ;
Torralba, Antonio ;
Oliva, Aude .
SCIENTIFIC REPORTS, 2016, 6
[7]   Resolving human object recognition in space and time [J].
Cichy, Radoslaw Martin ;
Pantazis, Dimitrios ;
Oliva, Aude .
NATURE NEUROSCIENCE, 2014, 17 (03) :455-462
[8]   Predicting the Time Course of Individual Objects with MEG [J].
Clarke, Alex ;
Devereux, Barry J. ;
Randall, Billi ;
Tyler, Lorraine K. .
CEREBRAL CORTEX, 2015, 25 (10) :3602-3612
[9]   A neural model of figure-ground organization [J].
Craft, Edward ;
Schuetze, Hartmut ;
Niebur, Ernst ;
von der Heydt, Ruediger .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 97 (06) :4310-4326
[10]   How Does the Brain Solve Visual Object Recognition? [J].
DiCarlo, James J. ;
Zoccolan, Davide ;
Rust, Nicole C. .
NEURON, 2012, 73 (03) :415-434