Neural Mechanisms Underlying Visual Object Recognition

被引:10
作者
Afraz, Arash
Yamins, Daniel L. K.
DiCarlo, James J. [1 ]
机构
[1] MIT, Dept Brain & Cognit Sci, E25-618, Cambridge, MA 02139 USA
来源
COGNITION, VOL 79, 2014 | 2014年 / 79卷
关键词
INFEROTEMPORAL CORTEX; TEMPORAL CORTEX; INFORMATION; NEURONS; MICROSTIMULATION; EYE;
D O I
10.1101/sqb.2014.79.024729
中图分类号
B84 [心理学]; C [社会科学总论]; Q98 [人类学];
学科分类号
03 ; 0303 ; 030303 ; 04 ; 0402 ;
摘要
Invariant visual object recognition and the underlying neural representations are fundamental to higher-level human cognition. To understand these neural underpinnings, we combine human and monkey psychophysics, large-scale neurophysiology, neural perturbation methods, and computational modeling to construct falsifiable, predictive models that aim to fully account for the neural encoding and decoding processes that underlie visual object recognition. A predictive encoding model must minimally describe the transformation of the retinal image to population patterns of neural activity along the entire cortical ventral stream of visual processing and must accurately predict the responses to any retinal image. A predictive decoding model must minimally describe the transformation from those population patterns of neural activity to observed object recognition behavior ( i. e., subject reports), and, given that population pattern of activity, it must accurately predict behavior for any object recognition task. To date, we have focused on core object recognition-a remarkable behavior that is accomplished with image viewing durations of <200 msec. Our work thus far reveals that the neural encoding process is reasonably well explained by a largely feed-forward, highly complex, multistaged nonlinear neural network-the current best neuronal simulation models predict approximately one-half of the relevant neuronal response variance across the highest levels of the ventral stream ( areas V4 and IT). Remarkably, however, the decoding process from IT to behavior for all object recognition tasks tested thus far is very accurately predicted by simple direct linear conversion of the inferior temporal neural population state to behavior choice. We have recently examined the behavioral consequences of direct suppression of IT neural activity using pharmacological and optogenetic methods and find them to be well-explained by the same linear decoding model.
引用
收藏
页码:99 / 107
页数:9
相关论文
共 32 条
  • [1] Optogenetic and pharmacological suppression of spatial clusters of face neurons reveal their causal role in face gender discrimination
    Afraz, Arash
    Boyden, Edward S.
    DiCarlo, James J.
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2015, 112 (21) : 6730 - 6735
  • [2] Microstimulation of inferotemporal cortex influences face categorization
    Afraz, Seyed-Reza
    Kiani, Roozbeh
    Esteky, Hossein
    [J]. NATURE, 2006, 442 (7103) : 692 - 695
  • [3] [Anonymous], 1990, VISUAL AGNOSIA DISOR
  • [4] Object Representations in the Temporal Cortex of Monkeys and Humans as Revealed by Functional Magnetic Resonance Imaging
    Bell, Andrew H.
    Hadj-Bouziane, Fadila
    Frihauf, Jennifer B.
    Tootell, Roger B. H.
    Ungerleider, Leslie G.
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 2009, 101 (02) : 688 - 700
  • [5] PRIMATE FRONTAL EYE FIELDS .1. SINGLE NEURONS DISCHARGING BEFORE SACCADES
    BRUCE, CJ
    GOLDBERG, ME
    [J]. JOURNAL OF NEUROPHYSIOLOGY, 1985, 53 (03) : 603 - 635
  • [6] Do we know what the early visual system does?
    Carandini, M
    Demb, JB
    Mante, V
    Tolhurst, DJ
    Dan, Y
    Olshausen, BA
    Gallant, JL
    Rust, NC
    [J]. JOURNAL OF NEUROSCIENCE, 2005, 25 (46) : 10577 - 10597
  • [7] Transformation of shape information in the ventral pathway
    Connor, Charles E.
    Brincat, Scott L.
    Pasupathy, Anitha
    [J]. CURRENT OPINION IN NEUROBIOLOGY, 2007, 17 (02) : 140 - 147
  • [8] DESIMONE R, 1984, J NEUROSCI, V4, P2051
  • [9] Untangling invariant object recognition
    DiCarlo, James J.
    Cox, David D.
    [J]. TRENDS IN COGNITIVE SCIENCES, 2007, 11 (08) : 333 - 341
  • [10] How Does the Brain Solve Visual Object Recognition?
    DiCarlo, James J.
    Zoccolan, Davide
    Rust, Nicole C.
    [J]. NEURON, 2012, 73 (03) : 415 - 434