There Is a "U" in Clutter: Evidence for Robust Sparse Codes Underlying Clutter Tolerance in Human Vision

被引:11
作者
Cox, Patrick H. [1 ]
Riesenhuber, Maximilian [1 ]
机构
[1] Georgetown Univ, Med Ctr, Dept Neurosci, Washington, DC 20007 USA
基金
美国国家科学基金会;
关键词
clutter; HMAX; sparse coding; vision; FUSIFORM FACE AREA; DORSOLATERAL PREFRONTAL CORTEX; PERCEPTUAL DECISION-MAKING; SHAPE-BASED MODEL; RESPONSE NORMALIZATION; OBJECT RECOGNITION; RECEPTIVE-FIELDS; MACAQUE; NEURONS; FMRI;
D O I
10.1523/JNEUROSCI.1211-15.2015
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The ability to recognize objects in clutter is crucial for human vision, yet the underlying neural computations remain poorly understood. Previous single-unit electrophysiology recordings in inferotemporal cortex in monkeys and fMRI studies of object-selective cortex in humans have shown that the responses to pairs of objects can sometimes be well described as a weighted average of the responses to the constituent objects. Yet, from a computational standpoint, it is not clear how the challenge of object recognition in clutter can be solved if downstream areas must disentangle the identity of an unknown number of individual objects from the confounded average neuronal responses. An alternative idea is that recognition is based on a subpopulation of neurons that are robust to clutter, i.e., that do not show response averaging, but rather robust object-selective responses in the presence of clutter. Here we show that simulations using the HMAX model of object recognition in cortex can fit the aforementioned single-unit and fMRI data, showing that the averaging-like responses can be understood as the result of responses of object-selective neurons to suboptimal stimuli. Moreover, the model shows how object recognition can be achieved by a sparse readout of neurons whose selectivity is robust to clutter. Finally, the model provides a novel prediction about human object recognition performance, namely, that target recognition ability should show a U-shaped dependency on the similarity of simultaneously presented clutter objects. This prediction is confirmed experimentally, supporting a simple, unifying model of how the brain performs object recognition in clutter.
引用
收藏
页码:14148 / 14159
页数:12
相关论文
共 76 条
[1]   Microstimulation of inferotemporal cortex influences face categorization [J].
Afraz, Seyed-Reza ;
Kiani, Roozbeh ;
Esteky, Hossein .
NATURE, 2006, 442 (7103) :692-695
[2]   Robust Selectivity to Two-Object Images in Human Visual Cortex [J].
Agam, Yigal ;
Liu, Hesheng ;
Papanastassiou, Alexander ;
Buia, Calin ;
Golby, Alexandra J. ;
Madsen, Joseph R. ;
Kreiman, Gabriel .
CURRENT BIOLOGY, 2010, 20 (09) :872-879
[3]   ECCENTRIC VISION - ADVERSE INTERACTIONS BETWEEN LINE SEGMENTS [J].
ANDRIESSEN, JJ ;
BOUMA, H .
VISION RESEARCH, 1976, 16 (01) :71-78
[4]   The distributed representation of random and meaningful object pairs in human occipitotemporal cortex: The weighted average as a general rule [J].
Baeck, Annelies ;
Wagemans, Johan ;
Op de Beeck, Hans P. .
NEUROIMAGE, 2013, 70 :37-47
[5]   A summary-statistic representation in peripheral vision explains visual crowding [J].
Balas, Benjamin ;
Nakano, Lisa ;
Rosenholtz, Ruth .
JOURNAL OF VISION, 2009, 9 (12)
[6]   A morphable model for the synthesis of 3D faces [J].
Blanz, V ;
Vetter, T .
SIGGRAPH 99 CONFERENCE PROCEEDINGS, 1999, :187-194
[7]   INTERACTION EFFECTS IN PARAFOVEAL LETTER RECOGNITION [J].
BOUMA, H .
NATURE, 1970, 226 (5241) :177-&
[8]   The psychophysics toolbox [J].
Brainard, DH .
SPATIAL VISION, 1997, 10 (04) :433-436
[9]  
Britten KH, 1999, J NEUROSCI, V19, P5074
[10]   A model of V4 shape selectivity and invariance [J].
Cadieu, Charles ;
Kouh, Minjoon ;
Pasupathy, Anitha ;
Connor, Charles E. ;
Riesenhuber, Maximilian ;
Poggio, Tomaso .
JOURNAL OF NEUROPHYSIOLOGY, 2007, 98 (03) :1733-1750