Visual Object Recognition: Do We (Finally) Know More Now Than We Did?

被引：47

作者：

Gauthier, Isabel ^{[1
]}

Tarr, Michael J. ^{[2
]}

机构：

[1] Vanderbilt Univ, Dept Psychol, Nashville, TN 37240 USA

[2] Carnegie Mellon Univ, Dept Psychol, Ctr Neural Basis Cognit, Pittsburgh, PA 15213 USA

来源：

ANNUAL REVIEW OF VISION SCIENCE, VOL 2 | 2016年 / 2卷

关键词：

object recognition; face recognition; invariance; category selectivity; decoding; deep neural networks; FUSIFORM FACE AREA; DEEP NEURAL-NETWORKS; HIERARCHICAL-MODELS; DIFFERING VIEWS; EXPERTISE; REPRESENTATIONS; SHAPE; CORTEX; FMRI; SPECIFICITY;

D O I：

10.1146/annurev-vision-111815-114621

中图分类号：

Q189 [神经科学];

学科分类号：

071006 ;

摘要：

How do we recognize objects despite changes in their appearance? The past three decades have been witness to intense debates regarding both whether objects are encoded invariantly with respect to viewing conditions and whether specialized, separable mechanisms are used for the recognition of different object categories. We argue that such dichotomous debates ask the wrong question. Much more important is the nature of object representations: What are features that enable invariance or differential processing between categories? Although the nature of object features is still an unanswered question, new methods for connecting data to models show significant potential for helping us to better understand neural codes for objects. Most prominently, new approaches to analyzing data from functional magnetic resonance imaging, including neural decoding and representational similarity analysis, and new computational models of vision, including convolutional neural networks, have enabled a much more nuanced understanding of visual representation. Convolutional neural networks are particularly intriguing as a tool for studying biological vision in that this class of artificial vision systems, based on biologically plausible deep neural networks, exhibits visual recognition capabilities that are approaching those of human observers. As these models improve in their recognition performance, it appears that they also become more effective in predicting and accounting for neural responses in the ventral cortex. Applying these and other deep models to empirical data shows great promise for enabling future progress in the study of visual recognition.

引用

页码：377 / 396

页数：20

共 72 条

[1] The representation of object viewpoint in human visual cortex
Andresen, David R.
Vinberg, Joakim
Grill-Spector, Kalanit
[J]. NEUROIMAGE, 2009, 45 (02) : 522 - 536
[2] Reconsidering the role of structure in vision
Barenholtz, Elan
Tarr, Michael J.
[J]. CATEGORIES IN USE, 2007, 47 : 157 - 180
[3] Differing views on views: response to Hayward and Tarr (2000)
Biederman, I
Bar, M
[J]. VISION RESEARCH, 2000, 40 (28) : 3901 - 3905
[4] HUMAN IMAGE UNDERSTANDING - RECENT RESEARCH AND A THEORY
BIEDERMAN, I
[J]. COMPUTER VISION GRAPHICS AND IMAGE PROCESSING, 1985, 32 (01): : 29 - 73
[5] View-invariant representations of familiar objects by neurons in the inferior temporal visual cortex
Booth, MCA
Rolls, ET
[J]. CEREBRAL CORTEX, 1998, 8 (06) : 510 - 523
[6] Limits of generalization between categories and implications for theories of category specificity
Bukach, Cindy M.
Phillips, W. Stewart
Gauthier, Isabel
[J]. ATTENTION PERCEPTION & PSYCHOPHYSICS, 2010, 72 (07) : 1865 - 1874
[7] HOW ARE 3-DIMENSIONAL OBJECTS REPRESENTED IN THE BRAIN
BULTHOFF, HH
EDELMAN, SY
TARR, MJ
[J]. CEREBRAL CORTEX, 1995, 5 (03) : 247 - 260
[8] Attribute-based neural substrates in temporal cortex for perceiving and knowing about objects
Chao, LL
Haxby, JV
Martin, A
[J]. NATURE NEUROSCIENCE, 1999, 2 (10) : 913 - 919
[9] A Visual Short-Term Memory Advantage for Objects of Expertise
Curby, Kim M.
Glazek, Kuba
Gauthier, Isabel
[J]. JOURNAL OF EXPERIMENTAL PSYCHOLOGY-HUMAN PERCEPTION AND PERFORMANCE, 2009, 35 (01) : 94 - 107
[10] Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

← 1 2 3 4 5 6 7 8 →