Neural networks and quantum field theory

被引:51
作者
Halverson, James [1 ]
Maiti, Anindita [1 ]
Stoner, Keegan [1 ]
机构
[1] Northeastern Univ, Dept Phys, Boston, MA 02115 USA
来源
MACHINE LEARNING-SCIENCE AND TECHNOLOGY | 2021年 / 2卷 / 03期
关键词
Wilsonian RG flow; infinite width NNGP; finite width NN; Gaussian processes; quantum field theory; RECOGNITION;
D O I
10.1088/2632-2153/abeca3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a theoretical understanding of neural networks in terms of Wilsonian effective field theory. The correspondence relies on the fact that many asymptotic neural networks are drawn from Gaussian processes (GPs), the analog of non-interacting field theories. Moving away from the asymptotic limit yields a non-Gaussian process (NGP) and corresponds to turning on particle interactions, allowing for the computation of correlation functions of neural network outputs with Feynman diagrams. Minimal NGP likelihoods are determined by the most relevant non-Gaussian terms, according to the flow in their coefficients induced by the Wilsonian renormalization group. This yields a direct connection between overparameterization and simplicity of neural network likelihoods. Whether the coefficients are constants or functions may be understood in terms of GP limit symmetries, as expected from 't Hooft's technical naturalness. General theoretical calculations are matched to neural network experiments in the simplest class of models allowing the correspondence. Our formalism is valid for any of the many architectures that becomes a GP in an asymptotic limit, a property preserved under certain types of training.
引用
收藏
页数:48
相关论文
共 41 条
[11]  
Defferrard M, 2016, ADV NEUR IN, V29
[12]  
Dyer E., 2020, ARXIV190911304
[13]  
Fermi E., 1933, Ric. Sci., V4, P491
[14]   COGNITRON - SELF-ORGANIZING MULTILAYERED NEURAL NETWORK [J].
FUKUSHIMA, K .
BIOLOGICAL CYBERNETICS, 1975, 20 (3-4) :121-136
[16]  
Garriga-Alonso Aa., 2019, ARXIV180805587
[17]  
Gomez-Bombarelli R., 2015, ADV NEURAL INFORM PR, V28, P2224
[18]   Deep Residual Learning for Image Recognition [J].
He, Kaiming ;
Zhang, Xiangyu ;
Ren, Shaoqing ;
Sun, Jian .
2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778
[19]  
Henaff M, 2015, DEEP CONVOLUTIONAL N
[20]  
Ioffe S, 2015, PR MACH LEARN RES, V37, P448