Neural Networks Retrieving Boolean Patterns in a Sea of Gaussian Ones

被引:22
作者
Agliari, Elena [1 ,2 ]
Barra, Adriano [2 ,3 ,4 ]
Longo, Chiara [1 ]
Tantari, Daniele [2 ,5 ]
机构
[1] Sapienza Univ Roma, Dipartimento Matemat, Rome, Italy
[2] Ist Nazl Alta Matemat GNFM INdAM, Rome, Italy
[3] Univ Salento, Dipartimento Matemat & Fis Ennio Giorgi, Lecce, Italy
[4] Ist Nazl Fis Nucl, Sez Lecce, Lecce, Italy
[5] Scuola Normale Super Pisa, Ctr Ennio De Giorgi, Pisa, Italy
关键词
Neural networks; Hopfield model; Boltzmann machine; CORRELATED PATTERNS; HOPFIELD NETWORKS; MODEL;
D O I
10.1007/s10955-017-1840-9
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Restricted Boltzmann machines are key tools in machine learning and are described by the energy function of bipartite spin-glasses. From a statistical mechanical perspective, they share the same Gibbs measure of Hopfield networks for associative memory. In this equivalence, weights in the former play as patterns in the latter. As Boltzmann machines usually require real weights to be trained with gradient-descent-like methods, while Hopfield networks typically store binary patterns to be able to retrieve, the investigation of a mixed Hebbian network, equipped with both real (e.g., Gaussian) and discrete (e.g., Boolean) patterns naturally arises. We prove that, in the challenging regime of a high storage of real patterns, where retrieval is forbidden, an additional load of Boolean patterns can still be retrieved, as long as the ratio between the overall load and the network size does not exceed a critical threshold, that turns out to be the same of the standard Amit-Gutfreund-Sompolinsky theory. Assuming replica symmetry, we study the case of a low load of Boolean patterns combining the stochastic stability and Hamilton-Jacobi interpolating techniques. The result can be extended to the high load by a non rigorous but standard replica computation argument.
引用
收藏
页码:1085 / 1104
页数:20
相关论文
共 58 条
[1]   Retrieval Capabilities of Hierarchical Networks: From Dyson to Hopfield [J].
Agliari, Elena ;
Barra, Adriano ;
Galluzzi, Andrea ;
Guerra, Francesco ;
Tantari, Daniele ;
Tavani, Flavia .
PHYSICAL REVIEW LETTERS, 2015, 114 (02)
[2]   Multitasking Associative Networks [J].
Agliari, Elena ;
Barra, Adriano ;
Galluzzi, Andrea ;
Guerra, Francesco ;
Moauro, Francesco .
PHYSICAL REVIEW LETTERS, 2012, 109 (26)
[3]   Parallel retrieval of correlated patterns: From Hopfield networks to Boltzmann machines [J].
Agliari, Elena ;
Barra, Adriano ;
De Antoni, Andrea ;
Galluzzi, Andrea .
NEURAL NETWORKS, 2013, 38 :52-63
[4]   On the stability of the quenched state in mean-field spin-glass models [J].
Aizenman, M ;
Contucci, P .
JOURNAL OF STATISTICAL PHYSICS, 1998, 92 (5-6) :765-783
[5]  
Amit D.J., 1992, Modeling brain function: The world of attractor neural networks
[6]   SPIN-GLASS MODELS OF NEURAL NETWORKS [J].
AMIT, DJ ;
GUTFREUND, H .
PHYSICAL REVIEW A, 1985, 32 (02) :1007-1018
[7]   STORING INFINITE NUMBERS OF PATTERNS IN A SPIN-GLASS MODEL OF NEURAL NETWORKS [J].
AMIT, DJ ;
GUTFREUND, H ;
SOMPOLINSKY, H .
PHYSICAL REVIEW LETTERS, 1985, 55 (14) :1530-1533
[8]  
[Anonymous], 2001, Fields Institute Communications, DOI DOI 10.1090/FIC/030/10
[9]  
[Anonymous], 2005, Theory of neural information processing systems
[10]  
[Anonymous], PROGR PROBABILITY