Combining Convolutional Neural Networks and Cognitive Models to Predict Novel Object Recognition in Humans

被引:14
作者
Annis, Jeffrey [1 ]
Gauthier, Isabel [1 ]
Palmeri, Thomas J. [1 ]
机构
[1] Vanderbilt Univ, Dept Psychol, 111 21st Ave South,301 Wilson Hall, Nashville, TN 37240 USA
基金
美国国家科学基金会;
关键词
convolutional neural networks; object recognition; linear ballistic accumulator; INDIVIDUAL-DIFFERENCES; HIERARCHICAL-MODELS; EXPERT; ATTENTION; CHOICE; FACE; REPRESENTATION; SIMILARITY; ALGORITHM; FRAMEWORK;
D O I
10.1037/xlm0000968
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Object representations from convolutional neural network (CNN) models of computer vision (LeCun, Bengio, & Hinton, 2015) were used to drive a cognitive model of decision making, the linear ballistic accumulator (LBA) model (Brown & Heathcote, 2008), to predict errors and response times (RTs) in a novel object recognition task in humans. CNNs have become very successful at visual tasks like classifying objects in real-world images (e.g., He, Zhang, Ren, & Sun, 2015; Krizhevsky, Sutskever, & Hinton, 2012). We asked whether object representations learned by CNNs previously trained on a large corpus of natural images could be used to predict performance recognizing novel objects the network has never been trained on; we used novel Greebles, Ziggerins, and Sheinbugs that have been used in a number of previous object recognition studies. We specifically investigated whether a model combining high-level CNN representations of these novel objects could be used to drive an LBA model of decision making to account for errors and RTs in a same- different matching task (from Richler et al., 2019). Combining linearly transformed CNN object representations with the LBA provided reasonable accounts of performance not only on average, but at the individual-participant level and the item level as well. We frame the findings in the context of growing interest in using CNN models to understand visual object representations and the promise of using CNN representations to extend cognitive models to explain more complex aspects of human behavior.
引用
收藏
页码:785 / 807
页数:23
相关论文
共 86 条
[71]   Modelling individual difference in visual categorization [J].
Shen, Jianhong ;
Palmeri, Thomas J. .
VISUAL COGNITION, 2016, 24 (03) :260-283
[72]  
Shental N, 2002, LECT NOTES COMPUT SC, V2353, P776
[73]   TOWARD A UNIVERSAL LAW OF GENERALIZATION FOR PSYCHOLOGICAL SCIENCE [J].
SHEPARD, RN .
SCIENCE, 1987, 237 (4820) :1317-1323
[74]  
Simonyan Karen, 2015, Comput. Sci.
[75]  
Szegedy C., 2015, P IEEE C COMP VIS PA, P1
[76]  
Tang Yuan., 2015, dml: Distance Metric Learning in R
[77]   A Markov Chain Monte Carlo version of the genetic algorithm differential evolution: easy Bayesian computing for real parameter spaces [J].
Ter Braak, Cajo J. F. .
STATISTICS AND COMPUTING, 2006, 16 (03) :239-249
[78]   A Method for Efficiently Sampling From Distributions With Correlated Dimensions [J].
Turner, Brandon M. ;
Sederberg, Per B. ;
Brown, Scott D. ;
Steyvers, Mark .
PSYCHOLOGICAL METHODS, 2013, 18 (03) :368-384
[79]   The time course of perceptual choice: The leaky, competing accumulator model [J].
Usher, M ;
McClelland, JL .
PSYCHOLOGICAL REVIEW, 2001, 108 (03) :550-592
[80]  
van der Maaten L, 2014, J MACH LEARN RES, V15, P3221