From Probabilistic Graphical Models to Generalized Tensor Networks for Supervised Learning

被引:36
作者
Glasser, Ivan [1 ,2 ]
Pancotti, Nicola [1 ,2 ]
Cirac, J. Ignacio [1 ,2 ]
机构
[1] Max Planck Inst Quantum Opt, D-85748 Garching, Germany
[2] MCQST, D-80799 Munich, Germany
基金
欧盟地平线“2020”;
关键词
Tensile stress; Graphical models; Machine learning; Matrix decomposition; Probabilistic logic; Network architecture; Boltzmann machines; graphical models; machine learning; quantum circuits; string-bond states; supervised learning; tensor networks; tensor-train; MATRIX PRODUCT STATES;
D O I
10.1109/ACCESS.2020.2986279
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Tensor networks have found a wide use in a variety of applications in physics and computer science, recently leading to both theoretical insights as well as practical algorithms in machine learning. In this work we explore the connection between tensor networks and probabilistic graphical models, and show that it motivates the definition of generalized tensor networks where information from a tensor can be copied and reused in other parts of the network. We discuss the relationship between generalized tensor network architectures used in quantum physics, such as string-bond states, and architectures commonly used in machine learning. We provide an algorithm to train these networks in a supervised-learning context and show that they overcome the limitations of regular tensor networks in higher dimensions, while keeping the computation efficient. A method to combine neural networks and tensor networks as part of a common deep learning architecture is also introduced. We benchmark our algorithm for several generalized tensor network architectures on the task of classifying images and sounds, and show that they outperform previously introduced tensor-network algorithms. The models we consider also have a natural implementation on a quantum computer and may guide the development of near-term quantum machine learning architectures.
引用
收藏
页码:68169 / 68182
页数:14
相关论文
共 65 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
Anandkumar A, 2014, J MACH LEARN RES, V15, P2773
[3]  
[Anonymous], 2017, PUBL HLTH PERSP
[4]  
[Anonymous], 2008, P 25 INT C MACH LEAR, DOI DOI 10.1145/1390156.1390224
[5]  
[Anonymous], 1986, PARALLEL DISTRIBUTED
[6]  
[Anonymous], 2010, MNIST HANDWRITTEN DI
[7]  
[Anonymous], 2009, Probabilistic Graphical Models: Principles and Techniques
[8]  
[Anonymous], 2017, ARXIV171103357
[9]  
[Anonymous], 2017, Tensor regression networks
[10]  
[Anonymous], 2017, ARXIV171101416