Equivalence of restricted Boltzmann machines and tensor network states

被引:180
作者
Chen, Jing [1 ,2 ,4 ]
Cheng, Song [1 ,2 ]
Xie, Haidong [1 ,2 ]
Wang, Lei [1 ]
Xiang, Tao [1 ,3 ]
机构
[1] Chinese Acad Sci, Inst Phys, POB 603, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Phys Sci, Beijing 100049, Peoples R China
[3] Collaborat Innovat Ctr Quantum Matter, Beijing 100190, Peoples R China
[4] Flatiron Inst, Ctr Computat Quantum Phys, New York, NY 10010 USA
基金
中国国家自然科学基金;
关键词
MATRIX RENORMALIZATION-GROUP; PRODUCT STATES; DEEP; ENTANGLEMENT; MODELS;
D O I
10.1103/PhysRevB.97.085104
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
The restricted Boltzmann machine (RBM) is one of the fundamental building blocks of deep learning. RBM finds wide applications in dimensional reduction, feature extraction, and recommender systems via modeling the probability distributions of a variety of input data including natural images, speech signals, and customer ratings, etc. We build a bridge between RBM and tensor network states (TNS) widely used in quantum many-body physics research. We devise efficient algorithms to translate an RBM into the commonly used TNS. Conversely, we give sufficient and necessary conditions to determine whether a TNS can be transformed into an RBM of given architectures. Revealing these general and constructive connections can cross fertilize both deep learning and quantum many-body physics. Notably, by exploiting the entanglement entropy bound of TNS, we can rigorously quantify the expressive power of RBM on complex data sets. Insights into TNS and its entanglement capacity can guide the design of more powerful deep learning architectures. On the other hand, RBM can represent quantum many-body states with fewer parameters compared to TNS, which may allow more efficient classical simulations.
引用
收藏
页数:16
相关论文
共 86 条
  • [1] RIGOROUS RESULTS ON VALENCE-BOND GROUND-STATES IN ANTIFERROMAGNETS
    AFFLECK, I
    KENNEDY, T
    LIEB, EH
    TASAKI, H
    [J]. PHYSICAL REVIEW LETTERS, 1987, 59 (07) : 799 - 802
  • [2] SPIN-GLASS MODELS OF NEURAL NETWORKS
    AMIT, DJ
    GUTFREUND, H
    [J]. PHYSICAL REVIEW A, 1985, 32 (02): : 1007 - 1018
  • [3] [Anonymous], ARXIV171003545
  • [4] [Anonymous], ARXIV160304467
  • [5] [Anonymous], 2010, J. Mach. Learn. Res.
  • [6] [Anonymous], ARXIV170106246
  • [7] [Anonymous], 2002, GRADUATE STUDIES MAT
  • [8] [Anonymous], TensorFlow: A System for Large-Scale Machine Learning
  • [9] [Anonymous], APPEARS IN PROCEEDIN
  • [10] [Anonymous], 2003, Information Theory, Inference and Learning Algorithms