Phase diagram of restricted Boltzmann machines and generalized Hopfield networks with arbitrary priors

被引:67
作者
Barra, Adriano [1 ]
Genovese, Giuseppe [2 ]
Sollich, Peter [3 ]
Tantari, Daniele [4 ,5 ]
机构
[1] Univ Salento, Dipartimento Matemat & Fis Ennio De Giorgi, I-73100 Lecce, Italy
[2] Univ Zurich, Inst Math, CH-8057 Zurich, Switzerland
[3] Kings Coll London, Dept Math, London WC2R 2LS, England
[4] Scuola Normale Super Pisa, Ctr Ennio de Giorgi, Piazza Cavalieri 3, I-56100 Pisa, Italy
[5] Scuola Normale Super Pisa, Piazza Cavalieri 7, I-56126 Pisa, Italy
关键词
NEURAL-NETWORKS; FREE-ENERGY; MODEL; PATTERNS; SYSTEMS; STORAGE;
D O I
10.1103/PhysRevE.97.022310
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
Restricted Boltzmann machines are described by the Gibbs measure of a bipartite spin glass, which in turn can be seen as a generalized Hopfield network. This equivalence allows us to characterize the state of these systems in terms of their retrieval capabilities, both at low and high load, of pure states. We study the paramagnetic-spin glass and the spin glass-retrieval phase transitions, as the pattern (i.e., weight) distribution and spin (i.e., unit) priors vary smoothly from Gaussian real variables to Boolean discrete variables. Our analysis shows that the presence of a retrieval phase is robust and not peculiar to the standard Hopfield model with Boolean patterns. The retrieval region becomes larger when the pattern entries and retrieval units get more peaked and, conversely, when the hidden units acquire a broader prior and therefore have a stronger response to high fields. Moreover, at low load retrieval always exists below some critical temperature, for every pattern distribution ranging from the Boolean to the Gaussian case.
引用
收藏
页数:14
相关论文
共 48 条
[1]   Immune networks: multitasking capabilities near saturation [J].
Agliari, E. ;
Annibale, A. ;
Barra, A. ;
Coolen, A. C. C. ;
Tantari, D. .
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2013, 46 (41)
[2]   Immune networks: multi-tasking capabilities at medium load [J].
Agliari, E. ;
Annibale, A. ;
Barra, A. ;
Coolen, A. C. C. ;
Tantari, D. .
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2013, 46 (33)
[3]   Neural Networks Retrieving Boolean Patterns in a Sea of Gaussian Ones [J].
Agliari, Elena ;
Barra, Adriano ;
Longo, Chiara ;
Tantari, Daniele .
JOURNAL OF STATISTICAL PHYSICS, 2017, 168 (05) :1085-1104
[4]   SPIN-GLASS MODELS OF NEURAL NETWORKS [J].
AMIT, DJ ;
GUTFREUND, H .
PHYSICAL REVIEW A, 1985, 32 (02) :1007-1018
[5]   STORING INFINITE NUMBERS OF PATTERNS IN A SPIN-GLASS MODEL OF NEURAL NETWORKS [J].
AMIT, DJ ;
GUTFREUND, H ;
SOMPOLINSKY, H .
PHYSICAL REVIEW LETTERS, 1985, 55 (14) :1530-1533
[6]  
[Anonymous], 2015, Advances in neural information processing systems
[7]  
[Anonymous], 2005, Theory of neural information processing systems
[8]  
[Anonymous], 2001, Statistical Mechanics of Learning
[9]  
[Anonymous], 2006, STAT MECH DISORDERED, DOI DOI 10.1017/CBO9780511616808
[10]  
[Anonymous], 1991, LECT NOTES