Statistical Physics of Unsupervised Learning with Prior Knowledge in Neural Networks

被引：12

作者：

Hou, Tianqi ^{[1
,2
]}

Huang, Haiping ^{[2
]}

机构：

[1] Hong Kong Univ Sci & Technol, Dept Phys, Clear Water Bay, Hong Kong, Peoples R China

[2] Sun Yat Sen Univ, Sch Phys, PMI Lab, Guangzhou 510275, Peoples R China

来源：

PHYSICAL REVIEW LETTERS | 2020年 / 124卷 / 24期

关键词：

BAYESIAN-INFERENCE;

D O I：

10.1103/PhysRevLett.124.248302

中图分类号：

O4 [物理学];

学科分类号：

0702 ;

摘要：

Integrating sensory inputs with prior beliefs from past experiences in unsupervised learning is a common and fundamental characteristic of brain or artificial neural computation. However, a quantitative role of prior knowledge in unsupervised learning remains unclear, prohibiting a scientific understanding of unsupervised learning. Here, we propose a statistical physics model of unsupervised learning with prior knowledge, revealing that the sensory inputs drive a series of continuous phase transitions related to spontaneous intrinsic-symmetry breaking. The intrinsic symmetry includes both reverse symmetry and permutation symmetry, commonly observed in most artificial neural networks. Compared to the prior-free scenario, the prior reduces more strongly the minimal data size triggering the reverse-symmetry breaking transition, and moreover, the prior merges, rather than separates, permutation-symmetry breaking phases. We claim that the prior can be learned from data samples, which in physics corresponds to a two-parameter Nishimori constraint. This Letter thus reveals mechanisms about the influence of the prior on unsupervised learning.

引用

页数：5

共 31 条

[1]

[Anonymous], 2009, INFORM PHYS COMPUTAT, DOI DOI 10.1093/ACPROF:OSO/9780198570837.001

[2]

[Anonymous], 1961, Sensory communication

[3]

Atanov A., ARXIV181006943

[4] Unsupervised Learning [J].

Barlow, H. B. .

NEURAL COMPUTATION, 1989, 1 (03) :295-311

[5] Probabilistic Population Codes for Bayesian Decision Making [J].

Beck, Jeffrey M. ;

Ma, Wei Ji ;

Kiani, Roozbeh ;

Hanks, Tim ;

Churchland, Anne K. ;

Roitman, Jamie ;

Shadlen, Michael N. ;

Latham, Peter E. ;

Pouget, Alexandre .

NEURON, 2008, 60 (06) :1142-1152

[6]

Blundell C, 2015, PR MACH LEARN RES, V37, P1613

[7] Neural implementation of Bayesian inference in a sensorimotor behavior [J].

Darlington, Timothy R. ;

Beck, Jeffrey M. ;

Lisberger, Stephen G. .

NATURE NEUROSCIENCE, 2018, 21 (10) :1442-+

[8] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[9] Predicting the Future as Bayesian Inference: People Combine Prior Knowledge With Observations When Estimating Duration and Extent [J].

Griffiths, Thomas L. ;

Tenenbaum, Joshua B. .

JOURNAL OF EXPERIMENTAL PSYCHOLOGY-GENERAL, 2011, 140 (04) :725-743

[10]

Hansem S., 2019, NEURON, V103, P934

← 1 2 3 4 →