Emergence of Compositional Representations in Restricted Boltzmann Machines

被引:86
作者
Tubiana, J. [1 ,2 ]
Monasson, R. [1 ,2 ]
机构
[1] Sorbonne Univ UPMC, Ecole Normale Super, Lab Phys Theor, 24 rue Lhomond, F-75005 Paris, France
[2] Sorbonne Univ UPMC, PSL Res, CNRS, 24 rue Lhomond, F-75005 Paris, France
关键词
NEURAL NETWORKS;
D O I
10.1103/PhysRevLett.118.138301
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Extracting automatically the complex set of features composing real high-dimensional data is crucial for achieving high performance in machine-learning tasks. Restricted Boltzmann machines (RBM) are empirically known to be efficient for this purpose, and to be able to generate distributed and graded representations of the data. We characterize the structural conditions (sparsity of the weights, low effective temperature, nonlinearities in the activation functions of hidden units, and adaptation of fields maintaining the activity in the visible layer) allowing RBM to operate in such a compositional phase. Evidence is provided by the replica analysis of an adequate statistical ensemble of random RBMs and by RBM trained on the handwritten digits data set MNIST.
引用
收藏
页数:5
相关论文
共 24 条
  • [1] Immune networks: multitasking capabilities near saturation
    Agliari, E.
    Annibale, A.
    Barra, A.
    Coolen, A. C. C.
    Tantari, D.
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2013, 46 (41)
  • [2] Multitasking Associative Networks
    Agliari, Elena
    Barra, Adriano
    Galluzzi, Andrea
    Guerra, Francesco
    Moauro, Francesco
    [J]. PHYSICAL REVIEW LETTERS, 2012, 109 (26)
  • [3] STORING INFINITE NUMBERS OF PATTERNS IN A SPIN-GLASS MODEL OF NEURAL NETWORKS
    AMIT, DJ
    GUTFREUND, H
    SOMPOLINSKY, H
    [J]. PHYSICAL REVIEW LETTERS, 1985, 55 (14) : 1530 - 1533
  • [4] [Anonymous], 2015, Advances in neural information processing systems
  • [5] [Anonymous], 2010, P 13 INT C ARTIFICIA
  • [6] On the equivalence of Hopfield networks and Boltzmann Machines
    Barra, Adriano
    Bernacchia, Alberto
    Santucci, Enrica
    Contucci, Pierluigi
    [J]. NEURAL NETWORKS, 2012, 34 : 1 - 9
  • [7] Representation Learning: A Review and New Perspectives
    Bengio, Yoshua
    Courville, Aaron
    Vincent, Pascal
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2013, 35 (08) : 1798 - 1828
  • [8] Fischer Asja, 2012, Progress in Pattern Recognition, Image Analysis, ComputerVision, and Applications. Proceedings 17th Iberoamerican Congress, CIARP 2012, P14, DOI 10.1007/978-3-642-33275-3_2
  • [9] Hinton G. E., 2010, Momentum, P599
  • [10] NEURAL NETWORKS AND PHYSICAL SYSTEMS WITH EMERGENT COLLECTIVE COMPUTATIONAL ABILITIES
    HOPFIELD, JJ
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA-BIOLOGICAL SCIENCES, 1982, 79 (08): : 2554 - 2558