Comparing the Visual Representations and Performance of Humans and Deep Neural Networks

被引:13
作者
Jacobs, Robert A. [1 ]
Bates, Christopher J. [1 ]
机构
[1] Univ Rochester, Dept Brain & Cognit Sci, 601 Elmwood Ave, Rochester, NY 14627 USA
基金
美国国家科学基金会;
关键词
perception; vision; artificial intelligence; deep neural networks;
D O I
10.1177/0963721418801342
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Although deep neural networks (DNNs) are state-of-the-art artificial intelligence systems, it is unclear what insights, if any, they provide about human intelligence. We address this issue in the domain of visual perception. After briefly describing DNNs, we provide an overview of recent results comparing human visual representations and performance with those of DNNs. In many cases, DNNs acquire visual representations and processing strategies that are very different from those used by people. We conjecture that there are at least two factors preventing them from serving as better psychological models. First, DNNs are currently trained with impoverished data, such as data lacking important visual cues to three-dimensional structure, data lacking multisensory statistical regularities, and data in which stimuli are unconnected to an observer's actions and goals. Second, DNNs typically lack adaptations to capacity limits, such as attentional mechanisms, visual working memory, and compressed mental representations biased toward preserving task-relevant abstractions.
引用
收藏
页码:34 / 39
页数:6
相关论文
共 26 条
[1]  
[Anonymous], 2018, Adversarial examples that fool both computer vision and time-limited humans
[2]  
[Anonymous], 2017, On the Limitation of Convolutional Neural Networks in Recognizing Negative Images
[3]  
[Anonymous], 2015, Nature, DOI [10.1038/nature14539, DOI 10.1038/NATURE14539]
[4]  
[Anonymous], STUDY COMPARISON HUM
[5]  
[Anonymous], COMPARING DEEP NEURA
[6]  
[Anonymous], MODELING HUMAN CATEG
[7]  
[Anonymous], 2018, SAME DIFFERENT PROBL
[8]   Deictic codes for the embodiment of cognition [J].
Ballard, DH ;
Hayhoe, MM ;
Pook, PK ;
Rao, RPN .
BEHAVIORAL AND BRAIN SCIENCES, 1997, 20 (04) :723-+
[9]   Visual Shape Perception as Bayesian Inference of 3D Object-Centered Shape Representations [J].
Erdogan, Goker ;
Jacobs, Robert A. .
PSYCHOLOGICAL REVIEW, 2017, 124 (06) :740-761
[10]  
Eslami S. M., 2016, Advances in Neural Information Processing Systems, V29