A mixture of sparse coding models explaining properties of face neurons related to holistic and parts-based processing

被引：10

作者：

Hosoya, Haruo ^{[1
,5
]}

Hyvarinen, Aapo ^{[2
,3
,4
]}

机构：

[1] ATR Int, Cognit Mechanisms Labs, Kyoto, Japan

[2] Univ Helsinki, Dept Comp Sci, Helsinki, Finland

[3] Univ Helsinki, HIIT, Helsinki, Finland

[4] UCL, Gatsby Computat Neurosci Unit, London, England

[5] 2-2-2 Hikaridai, Keihanna Sci City, Kyoto, Japan

来源：

PLOS COMPUTATIONAL BIOLOGY | 2017年 / 13卷 / 07期

基金：

芬兰科学院;

关键词：

NATURAL IMAGES; MACAQUE; EMERGENCE; PATCHES; CORTEX; RECOGNITION; OBJECTS; SHIFT; AREA;

D O I：

10.1371/journal.pcbi.1005667

中图分类号：

Q5 [生物化学];

学科分类号：

071010 ; 081704 ;

摘要：

Experimental studies have revealed evidence of both parts-based and holistic representations of objects and faces in the primate visual system. However, it is still a mystery how such seemingly contradictory types of processing can coexist within a single system. Here, we propose a novel theory called mixture of sparse coding models, inspired by the formation of category-specific subregions in the inferotemporal (IT) cortex. We developed a hierarchical network that constructed a mixture of two sparse coding submodels on top of a simple Gabor analysis. The submodels were each trained with face or non-face object images, which resulted in separate representations of facial parts and object parts. Importantly, evoked neural activities were modeled by Bayesian inference, which had a top-down explaining-away effect that enabled recognition of an individual part to depend strongly on the category of the whole input. We show that this explaining-away effect was indeed crucial for the units in the face submodel to exhibit significant selectivity to face images over object images in a similar way to actual face-selective neurons in the macaque IT cortex. Furthermore, the model explained, qualitatively and quantitatively, several tuning properties to facial features found in the middle patch of face processing in IT as documented by Freiwald, Tsao, and Livingstone (2009). These included, in particular, tuning to only a small number of facial features that were often related to geometrically large parts like face outline and hair, preference and anti-preference of extreme facial features (e.g., very large/small inter-eye distance), and reduction of the gain of feature tuning for partial face stimuli compared to whole face stimuli. Thus, we hypothesize that the coding principle of facial features in the middle patch of face processing in the macaque IT cortex may be closely related to mixture of sparse coding models.

引用

页数：27

共 47 条

[1] [Anonymous], 2007, Tech. rep
[2] Barlow H B, 1972, Perception, V1, P371, DOI 10.1068/p010371
[3] Barlow H. B., 1961, SENS COMMUN, V1, P1
[4] Face recognition by independent component analysis
Bartlett, MS
Movellan, JR
Sejnowski, TJ
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2002, 13 (06): : 1450 - 1464
[5] Bishop C., 2006, Pattern recognition and machine learning, P423
[6] VISUAL PROPERTIES OF NEURONS IN AREA V4 OF THE MACAQUE - SENSITIVITY TO STIMULUS FORM
DESIMONE, R
SCHEIN, SJ
[J]. JOURNAL OF NEUROPHYSIOLOGY, 1987, 57 (03) : 835 - 868
[7] A specialized face-processing model inspired by the organization of monkey face patches explains several face-specific phenomena observed in humans
Farzmahdi, Amirhossein
Rajaei, Karim
Ghodrati, Masoud
Ebrahimpour, Reza
Khaligh-Razavi, Seyed-Mahdi
[J]. SCIENTIFIC REPORTS, 2016, 6
[8] Functional Compartmentalization and Viewpoint Generalization Within the Macaque Face-Processing System
Freiwald, Winrich A.
Tsao, Doris Y.
[J]. SCIENCE, 2010, 330 (6005) : 845 - 851
[9] A face feature space in the macaque temporal lobe
Freiwald, Winrich A.
Tsao, Doris Y.
Livingstone, Margaret S.
[J]. NATURE NEUROSCIENCE, 2009, 12 (09) : 1187 - U28
[10] NEOCOGNITRON - A SELF-ORGANIZING NEURAL NETWORK MODEL FOR A MECHANISM OF PATTERN-RECOGNITION UNAFFECTED BY SHIFT IN POSITION
FUKUSHIMA, K
[J]. BIOLOGICAL CYBERNETICS, 1980, 36 (04) : 193 - 202

← 1 2 3 4 5 →