Visual Object Recognition: Do We (Finally) Know More Now Than We Did?

被引：47

作者：

Gauthier, Isabel ^{[1
]}

Tarr, Michael J. ^{[2
]}

机构：

[1] Vanderbilt Univ, Dept Psychol, Nashville, TN 37240 USA

[2] Carnegie Mellon Univ, Dept Psychol, Ctr Neural Basis Cognit, Pittsburgh, PA 15213 USA

来源：

ANNUAL REVIEW OF VISION SCIENCE, VOL 2 | 2016年 / 2卷

关键词：

object recognition; face recognition; invariance; category selectivity; decoding; deep neural networks; FUSIFORM FACE AREA; DEEP NEURAL-NETWORKS; HIERARCHICAL-MODELS; DIFFERING VIEWS; EXPERTISE; REPRESENTATIONS; SHAPE; CORTEX; FMRI; SPECIFICITY;

D O I：

10.1146/annurev-vision-111815-114621

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

How do we recognize objects despite changes in their appearance? The past three decades have been witness to intense debates regarding both whether objects are encoded invariantly with respect to viewing conditions and whether specialized, separable mechanisms are used for the recognition of different object categories. We argue that such dichotomous debates ask the wrong question. Much more important is the nature of object representations: What are features that enable invariance or differential processing between categories? Although the nature of object features is still an unanswered question, new methods for connecting data to models show significant potential for helping us to better understand neural codes for objects. Most prominently, new approaches to analyzing data from functional magnetic resonance imaging, including neural decoding and representational similarity analysis, and new computational models of vision, including convolutional neural networks, have enabled a much more nuanced understanding of visual representation. Convolutional neural networks are particularly intriguing as a tool for studying biological vision in that this class of artificial vision systems, based on biologically plausible deep neural networks, exhibits visual recognition capabilities that are approaching those of human observers. As these models improve in their recognition performance, it appears that they also become more effective in predicting and accounting for neural responses in the ventral cortex. Applying these and other deep models to empirical data shows great promise for enabling future progress in the study of visual recognition.

引用

页码：377 / 396

页数：20

共 72 条

[41] Distinctive image features from scale-invariant keypoints
Lowe, DG
[J]. INTERNATIONAL JOURNAL OF COMPUTER VISION, 2004, 60 (02) : 91 - 110
[42] Marr D., 1982, Vision. A computational investigation into the human representation and processing of visual information
[43] The representation of object concepts in the brain
Martin, Alex
[J]. ANNUAL REVIEW OF PSYCHOLOGY, 2007, 58 : 25 - 45
[44] The visual word form area: expertise for reading in the fusiform gyrus
McCandliss, BD
Cohen, L
Dehaene, S
[J]. TRENDS IN COGNITIVE SCIENCES, 2003, 7 (07) : 293 - 299
[45] Cortical Thickness in Fusiform Face Area Predicts Face and Object Recognition Performance
McGugin, Rankin W.
Van Gulick, Ana E.
Gauthier, Isabel
[J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 2016, 28 (02) : 282 - 294
[46] The Vanderbilt Expertise Test reveals domain-general and domain-specific sex effects in object recognition
McGugin, Rankin W.
Richler, Jennifer J.
Herzmann, Grit
Speegle, Magen
Gauthier, Isabel
[J]. VISION RESEARCH, 2012, 69 : 10 - 22
[47] Robust expertise effects in right FFA
McGugin, Rankin Williams
Newton, Allen T.
Gore, John C.
Gauthier, Isabel
[J]. NEUROPSYCHOLOGIA, 2014, 63 : 135 - 144
[48] Expertise Effects in Face-Selective Areas are Robust to Clutter and Diverted Attention, but not to Competition
McGugin, Rankin Williams
Van Gulick, Ana E.
Tamber-Rosenau, Benjamin J.
Ross, David A.
Gauthier, Isabel
[J]. CEREBRAL CORTEX, 2015, 25 (09) : 2610 - 2622
[49] High-resolution imaging of expertise reveals reliable object selectivity in the fusiform face area related to perceptual performance
McGugin, Rankin Williams
Gatenby, J. Christopher
Gore, John C.
Gauthier, Isabel
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2012, 109 (42) : 17063 - 17068
[50] What is special about face recognition? Nineteen experiments on a person with visual object agnosia and dyslexia but normal face recognition
Moscovitch, M
Winocur, G
Behrmann, M
[J]. JOURNAL OF COGNITIVE NEUROSCIENCE, 1997, 9 (05) : 555 - 604

← 1 2 3 4 5 6 7 8 →