On the equivalence of Hopfield networks and Boltzmann Machines

被引:91
作者
Barra, Adriano [1 ]
Bernacchia, Alberto [2 ]
Santucci, Enrica [3 ]
Contucci, Pierluigi [4 ]
机构
[1] Univ Roma La Sapienza, Dipartimento Fis, I-00185 Rome, Italy
[2] Yale Univ, Dept Neurobiol, New Haven, CT 06510 USA
[3] Univ Aquila, Dipartimento Matemat, I-67010 Laquila, Italy
[4] Alma Mater Studiorum Univ Bologna, Dipartimento Matemat, I-40126 Bologna, Italy
关键词
Statistical mechanics; Hopfield networks; Boltzmann Machines; NEURAL-NETWORKS; IMAGES; MEMORY;
D O I
10.1016/j.neunet.2012.06.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A specific type of neural networks, the Restricted Boltzmann Machines (RBM), are implemented for classification and feature detection in machine learning. They are characterized by separate layers of visible and hidden units, which are able to learn efficiently a generative model of the observed data. We study a "hybrid" version of RBMs, in which hidden units are analog and visible units are binary, and we show that thermodynamics of visible units are equivalent to those of a Hopfield network, in which the N visible units are the neurons and the P hidden units are the learned patterns. We apply the method of stochastic stability to derive the thermodynamics of the model, by considering a formal extension of this technique to the case of multiple sets of stored patterns, which may act as a benchmark for the study of correlated sets. Our results imply that simulating the dynamics of a Hopfield network, requiring the update of N neurons and the storage of N (N - 1)/2 synapses, can be accomplished by a hybrid Boltzmann Machine, requiring the update of N P neurons but the storage of only NP synapses. In addition, the well known glass transition of the Hopfield network has a counterpart in the Boltzmann Machine: it corresponds to an optimum criterion for selecting the relative sizes of the hidden and visible layers, resolving the trade-off between flexibility and generality of the model. The low storage phase of the Hopfield model corresponds to few hidden units and hence a overly constrained RBM, while the spin-glass phase (too many hidden units) corresponds to unconstrained RBM prone to overfitting of the observed data. (C) 2012 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 20 条
  • [1] On the stability of the quenched state in mean-field spin-glass models
    Aizenman, M
    Contucci, P
    [J]. JOURNAL OF STATISTICAL PHYSICS, 1998, 92 (5-6) : 765 - 783
  • [2] Amit D. J., 1992, Modeling brain function: The world of attractor neural networks
  • [3] [Anonymous], P 3 INT C IND COMP A
  • [4] [Anonymous], 2005, Theory of neural information processing systems
  • [5] [Anonymous], 1988, Introduction to Theoretical Neurobiology
  • [6] [Anonymous], 1991, Santa Fe Institute Studies in the Sciences of Complexity
  • [7] Barra A., FULLY SOLVA IN PRESS
  • [8] The mean field Ising model trough interpolating techniques
    Barra, Adriano
    [J]. JOURNAL OF STATISTICAL PHYSICS, 2008, 132 (05) : 787 - 809
  • [9] The Replica Symmetric Approximation of the Analogical Neural Network
    Barra, Adriano
    Genovese, Giuseppe
    Guerra, Francesco
    [J]. JOURNAL OF STATISTICAL PHYSICS, 2010, 140 (04) : 784 - 796
  • [10] Bengio Y., 2009, MACH LEARN, V2, P127