Statistical Physics of Unsupervised Learning with Prior Knowledge in Neural Networks

被引:12
作者
Hou, Tianqi [1 ,2 ]
Huang, Haiping [2 ]
机构
[1] Hong Kong Univ Sci & Technol, Dept Phys, Clear Water Bay, Hong Kong, Peoples R China
[2] Sun Yat Sen Univ, Sch Phys, PMI Lab, Guangzhou 510275, Peoples R China
关键词
BAYESIAN-INFERENCE;
D O I
10.1103/PhysRevLett.124.248302
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Integrating sensory inputs with prior beliefs from past experiences in unsupervised learning is a common and fundamental characteristic of brain or artificial neural computation. However, a quantitative role of prior knowledge in unsupervised learning remains unclear, prohibiting a scientific understanding of unsupervised learning. Here, we propose a statistical physics model of unsupervised learning with prior knowledge, revealing that the sensory inputs drive a series of continuous phase transitions related to spontaneous intrinsic-symmetry breaking. The intrinsic symmetry includes both reverse symmetry and permutation symmetry, commonly observed in most artificial neural networks. Compared to the prior-free scenario, the prior reduces more strongly the minimal data size triggering the reverse-symmetry breaking transition, and moreover, the prior merges, rather than separates, permutation-symmetry breaking phases. We claim that the prior can be learned from data samples, which in physics corresponds to a two-parameter Nishimori constraint. This Letter thus reveals mechanisms about the influence of the prior on unsupervised learning.
引用
收藏
页数:5
相关论文
共 31 条
[1]  
[Anonymous], 2009, INFORM PHYS COMPUTAT, DOI DOI 10.1093/ACPROF:OSO/9780198570837.001
[2]  
[Anonymous], 1961, Sensory communication
[3]  
Atanov A., ARXIV181006943
[4]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311
[5]   Probabilistic Population Codes for Bayesian Decision Making [J].
Beck, Jeffrey M. ;
Ma, Wei Ji ;
Kiani, Roozbeh ;
Hanks, Tim ;
Churchland, Anne K. ;
Roitman, Jamie ;
Shadlen, Michael N. ;
Latham, Peter E. ;
Pouget, Alexandre .
NEURON, 2008, 60 (06) :1142-1152
[6]  
Blundell C, 2015, PR MACH LEARN RES, V37, P1613
[7]   Neural implementation of Bayesian inference in a sensorimotor behavior [J].
Darlington, Timothy R. ;
Beck, Jeffrey M. ;
Lisberger, Stephen G. .
NATURE NEUROSCIENCE, 2018, 21 (10) :1442-+
[8]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[9]   Predicting the Future as Bayesian Inference: People Combine Prior Knowledge With Observations When Estimating Duration and Extent [J].
Griffiths, Thomas L. ;
Tenenbaum, Joshua B. .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2011, 140 (04) :725-743
[10]  
Hansem S., 2019, NEURON, V103, P934