Soft mixer assignment in a hierarchical generative model of natural scene statistics

被引:29
作者
Schwartz, Odelia [1 ]
Sejnowski, Terrence J.
Dayan, Peter
机构
[1] Salk Inst Biol Studies, Howard Hughes Med Inst, Computat Neurobiol Lab, La Jolla, CA 92037 USA
[2] Univ Calif San Diego, Dept Biol, La Jolla, CA 92093 USA
[3] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
关键词
D O I
10.1162/neco.2006.18.11.2680
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gaussian scale mixture models offer a top-down description of signal generation that captures key bottom-up statistical characteristics of filter responses to images. However, the pattern of dependence among the filters for this class of models is prespecified. We propose a novel extension to the gaussian scale mixture model that learns the pattern of dependence from observed inputs and thereby induces a hierarchical representation of these inputs. Specifically, we propose that inputs are generated by gaussian variables (modeling local filter structure), multiplied by a mixer variable that is assigned probabilistically to each input from a set of possible mixers. We demonstrate inference of both components of the generative model, for synthesized data and for different classes of natural images, such as a generic ensemble and faces. For natural images, the mixer variable assignments show invariances resembling those of complex cells in visual cortex; the statistics of the gaussian components of the model are in accord with the outputs of divisive normalization models. We also show how our model helps interrelate a wide range of models of image statistics and cortical processing.
引用
收藏
页码:2680 / 2718
页数:39
相关论文
共 74 条
[1]   Dynamic trees for image modelling [J].
Adams, NJ ;
Williams, CKI .
IMAGE AND VISION COMPUTING, 2003, 21 (10) :865-877
[2]  
ANDREWS DF, 1974, J ROY STAT SOC B MET, V36, P99
[3]  
[Anonymous], ADV NEURAL INFORM PR
[4]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[5]  
Barlow H. B., 1961, SENS COMMUN, P217, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[6]   Implicit learning in 3D object recognition: The importance of temporal context [J].
Becker, S .
NEURAL COMPUTATION, 1999, 11 (02) :347-374
[7]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[8]  
BOLLERSLEV T, 1994, HDB ECONOMETRICS, V5
[9]   DESCRIPTION AND GENERATION OF SPHERICALLY INVARIANT SPEECH-MODEL SIGNALS [J].
BREHM, H ;
STAMMLER, W .
SIGNAL PROCESSING, 1987, 12 (02) :119-141
[10]   Image compression via joint statistical characterization in the wavelet domain [J].
Buccigrossi, RW ;
Simoncelli, EP .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1999, 8 (12) :1688-1701