Using Psychophysical Methods to Understand Mechanisms of Face Identification in a Deep Neural Network

被引:3
作者
Xu, Tian [1 ]
Garrod, Oliver [1 ]
Scholte, Steven H. [2 ]
Ince, Robin [1 ]
Schyns, Philippe G. [1 ]
机构
[1] Univ Glasgow, Glasgow, Lanark, Scotland
[2] Univ Amsterdam, Amsterdam, Netherlands
来源
PROCEEDINGS 2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW) | 2018年
基金
英国惠康基金; 英国工程与自然科学研究理事会;
关键词
INFORMATION; RECOGNITION; FEATURES;
D O I
10.1109/CVPRW.2018.00266
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Deep Convolutional Neural Networks (CNNs) have been one of the most influential recent developments in computer vision, particularly for categorization [20]. The promise of CNNs is at least two-fold. First, they represent the best engineering solution to successfully tackle the foundational task of visual categorization with a performance level that even exceeds that of humans [19, 27]. Second, for computational neuroscience, CNNs provide a testable modelling platform for visual categorizations inspired by the multilayered organization of visual cortex [7]. Here, we used a 3D generative model to control the variance of information learned to identify 2,000 face identities in one CNN architecture (10-layer ResNet [9]). We generated 25M face images to train the network by randomly sampling intrinsic (i.e. face morphology, gender, age, expression and ethnicity) and extrinsic factors of face variance (i.e. 3D pose, illumination, scale and 2D translation). At testing, the network performed with 99% generalization accuracy for face identity across variations of intrinsic and extrinsic factors. State-of-the-art information mapping techniques from psychophysics (i.e. Representational Similarity Analysis [18] and Bubbles [8]) revealed respectively the network layer at which factors of variance are resolved and the face features that are used for identity. By explicitly controlling the generative factors of face information, we provide an alternative framework based on human psychophysics to understand information processing in CNNs.
引用
收藏
页码:2057 / 2065
页数:9
相关论文
共 43 条
[21]   Microsoft COCO: Common Objects in Context [J].
Lin, Tsung-Yi ;
Maire, Michael ;
Belongie, Serge ;
Hays, James ;
Perona, Pietro ;
Ramanan, Deva ;
Dollar, Piotr ;
Zitnick, C. Lawrence .
COMPUTER VISION - ECCV 2014, PT V, 2014, 8693 :740-755
[22]   What are the visual features underlying human versus machine vision? [J].
Linsley, D. ;
Eberhardt, S. ;
Sharma, T. ;
Gupta, P. ;
Serre, T. .
2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017), 2017, :2706-2714
[23]   BubbLeNet: Foveated imaging for visual discovery [J].
Matzen, Kevin ;
Snavely, Noah .
2015 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV), 2015, :1931-1939
[24]   Usage of spatial scales for the categorization of faces, objects, and scenes [J].
Morrison, DJ ;
Schyns, PG .
PSYCHONOMIC BULLETIN & REVIEW, 2001, 8 (03) :454-469
[25]   VISUAL CELLS IN THE TEMPORAL CORTEX SENSITIVE TO FACE VIEW AND GAZE DIRECTION [J].
PERRETT, DI ;
SMITH, PAJ ;
POTTER, DD ;
MISTLIN, AJ ;
HEAD, AS ;
MILNER, AD ;
JEEVES, MA .
PROCEEDINGS OF THE ROYAL SOCIETY SERIES B-BIOLOGICAL SCIENCES, 1985, 223 (1232) :293-317
[26]   Stimulus features coded by single neurons of a macaque body category selective patch [J].
Popivanov, Ivo D. ;
Schyns, Philippe G. ;
Vogels, Rufin .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2016, 113 (17) :E2450-E2459
[27]   Deep learning [J].
Rusk, Nicole .
NATURE METHODS, 2016, 13 (01) :35-35
[28]   ImageNet Large Scale Visual Recognition Challenge [J].
Russakovsky, Olga ;
Deng, Jia ;
Su, Hao ;
Krause, Jonathan ;
Satheesh, Sanjeev ;
Ma, Sean ;
Huang, Zhiheng ;
Karpathy, Andrej ;
Khosla, Aditya ;
Bernstein, Michael ;
Berg, Alexander C. ;
Fei-Fei, Li .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 115 (03) :211-252
[29]   Single-Neuron Correlates of Atypical Face Processing in Autism [J].
Rutishauser, Ueli ;
Tudusciuc, Oana ;
Wang, Shuo ;
Mamelak, Adam N. ;
Ross, Ian B. ;
Adolphs, Ralph .
NEURON, 2013, 80 (04) :887-899
[30]  
Schroff F, 2015, PROC CVPR IEEE, P815, DOI 10.1109/CVPR.2015.7298682