Statistical Physics of Unsupervised Learning with Prior Knowledge in Neural Networks

被引:12
作者
Hou, Tianqi [1 ,2 ]
Huang, Haiping [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Phys, Clear Water Bay, Hong Kong, Peoples R China
[2] Sun Yat Sen Univ, Sch Phys, PMI Lab, Guangzhou 510275, Peoples R China
关键词
BAYESIAN-INFERENCE;
D O I
10.1103/PhysRevLett.124.248302
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Integrating sensory inputs with prior beliefs from past experiences in unsupervised learning is a common and fundamental characteristic of brain or artificial neural computation. However, a quantitative role of prior knowledge in unsupervised learning remains unclear, prohibiting a scientific understanding of unsupervised learning. Here, we propose a statistical physics model of unsupervised learning with prior knowledge, revealing that the sensory inputs drive a series of continuous phase transitions related to spontaneous intrinsic-symmetry breaking. The intrinsic symmetry includes both reverse symmetry and permutation symmetry, commonly observed in most artificial neural networks. Compared to the prior-free scenario, the prior reduces more strongly the minimal data size triggering the reverse-symmetry breaking transition, and moreover, the prior merges, rather than separates, permutation-symmetry breaking phases. We claim that the prior can be learned from data samples, which in physics corresponds to a two-parameter Nishimori constraint. This Letter thus reveals mechanisms about the influence of the prior on unsupervised learning.
引用
收藏
页数:5
相关论文
共 31 条
[11]  
Hernández-Lobato JM, 2015, PR MACH LEARN RES, V37, P1861
[12]   Reducing the dimensionality of data with neural networks [J].
Hinton, G. E. ;
Salakhutdinov, R. R. .
SCIENCE, 2006, 313 (5786) :504-507
[13]   Training products of experts by minimizing contrastive divergence [J].
Hinton, GE .
NEURAL COMPUTATION, 2002, 14 (08) :1771-1800
[14]   Minimal model of permutation symmetry in unsupervised learning [J].
Hou, Tianqi ;
Wong, K. Y. Michael ;
Huang, Haiping .
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2019, 52 (41)
[15]  
Huang H., ARXIV191107662
[16]   Statistical mechanics of unsupervised feature learning in a restricted Boltzmann machine with binary synapses [J].
Huang, Haiping .
JOURNAL OF STATISTICAL MECHANICS-THEORY AND EXPERIMENT, 2017,
[17]   Unsupervised feature learning from finite data by message passing: Discontinuous versus continuous phase transition [J].
Huang, Haiping ;
Toyoizumi, Taro .
PHYSICAL REVIEW E, 2016, 94 (06)
[18]   Advanced mean-field theory of the restricted Boltzmann machine [J].
Huang, Haiping ;
Toyoizumi, Taro .
PHYSICAL REVIEW E, 2015, 91 (05)
[19]   The Nishimori line and Bayesian statistics [J].
Iba, Y .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1999, 32 (21) :3875-3888
[20]   Object perception as Bayesian inference [J].
Kersten, D ;
Mamassian, P ;
Yuille, A .
ANNUAL REVIEW OF PSYCHOLOGY, 2004, 55 :271-304