Soft mixer assignment in a hierarchical generative model of natural scene statistics

被引:29
作者
Schwartz, Odelia [1 ]
Sejnowski, Terrence J.
Dayan, Peter
机构
[1] Salk Inst Biol Studies, Howard Hughes Med Inst, Computat Neurobiol Lab, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Dept Biol, La Jolla, CA 92093 USA
[3] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
关键词
D O I
10.1162/neco.2006.18.11.2680
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian scale mixture models offer a top-down description of signal generation that captures key bottom-up statistical characteristics of filter responses to images. However, the pattern of dependence among the filters for this class of models is prespecified. We propose a novel extension to the gaussian scale mixture model that learns the pattern of dependence from observed inputs and thereby induces a hierarchical representation of these inputs. Specifically, we propose that inputs are generated by gaussian variables (modeling local filter structure), multiplied by a mixer variable that is assigned probabilistically to each input from a set of possible mixers. We demonstrate inference of both components of the generative model, for synthesized data and for different classes of natural images, such as a generic ensemble and faces. For natural images, the mixer variable assignments show invariances resembling those of complex cells in visual cortex; the statistics of the gaussian components of the model are in accord with the outputs of divisive normalization models. We also show how our model helps interrelate a wide range of models of image statistics and cortical processing.
引用
收藏
页码:2680 / 2718
页数:39
相关论文
共 74 条
[11]  
DEBONET J, 1997, ADV NEURAL INFORM PR, V9
[12]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[13]   Learning the invariance properties of complex cells from their responses to natural stimuli [J].
Einhäuser, W ;
Kayser, C ;
König, P ;
Körding, KP .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2002, 15 (03) :475-486
[14]   WHAT IS THE GOAL OF SENSORY CODING [J].
FIELD, DJ .
NEURAL COMPUTATION, 1994, 6 (04) :559-601
[15]   RELATIONS BETWEEN THE STATISTICS OF NATURAL IMAGES AND THE RESPONSE PROPERTIES OF CORTICAL-CELLS [J].
FIELD, DJ .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1987, 4 (12) :2379-2394
[16]   Learning Invariance from Transformation Sequences [J].
Foldiak, Peter .
NEURAL COMPUTATION, 1991, 3 (02) :194-200
[18]   Edge co-occurrence in natural images predicts contour grouping performance [J].
Geisler, WS ;
Perry, JS ;
Super, BJ ;
Gallogly, DP .
VISION RESEARCH, 2001, 41 (06) :711-724
[19]   Probability models for clutter in natural images [J].
Grenander, U ;
Srivastava, A .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2001, 23 (04) :424-429
[20]   Generative models for discovering sparse distributed representations [J].
Hinton, GE ;
Ghahramani, Z .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1997, 352 (1358) :1177-1190