Hebbian learning from first principles

被引:1
作者
Albanese, L. [1 ,3 ]
Barra, A. [2 ]
Bianco, P. [1 ]
Durante, F. [1 ]
Pallara, D. [1 ]
机构
[1] Univ Salento, Dipartimento Matemat & Fis Ennio De Giorgi, I-73100 Lecce, Italy
[2] Sapienza Univ Roma, Dipartimento Sci Base & Appl Ingn, I-00161 Rome, Italy
[3] Ist Nazl Fis Nucl, Sez Lecce, I-73100 Lecce, Italy
关键词
STATISTICAL-MECHANICS; NEURAL-NETWORKS; NUMBER; MODEL;
D O I
10.1063/5.0197652
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Recently, the original storage prescription for the Hopfield model of neural networks - as well as for its dense generalizations - has been turned into a genuine Hebbian learning rule by postulating the expression of its Hamiltonian for both the supervised and unsupervised protocols. In these notes, first, we obtain these explicit expressions by relying upon maximum entropy extremization & agrave; la Jaynes. Beyond providing a formal derivation of these recipes for Hebbian learning, this construction also highlights how Lagrangian constraints within entropy extremization force network's outcomes on neural correlations: these try to mimic the empirical counterparts hidden in the datasets provided to the network for its training and, the denser the network, the longer the correlations that it is able to capture. Next, we prove that, in the big data limit, whatever the presence of a teacher (or its lacking), not only these Hebbian learning rules converge to the original storage prescription of the Hopfield model but also their related free energies (and, thus, the statistical mechanical picture provided by Amit, Gutfreund and Sompolinsky is fully recovered). As a sideline, we show mathematical equivalence among standard Cost functions (Hamiltonian), preferred in Statistical Mechanical jargon, and quadratic Loss Functions, preferred in Machine Learning terminology. Remarks on the exponential Hopfield model (as the limit of dense networks with diverging density) and semi-supervised protocols are also provided.
引用
收藏
页数:27
相关论文
共 52 条
[1]   Dense Hebbian neural networks: A replica symmetric picture of supervised learning [J].
Agliari E. ;
Albanese L. ;
Alemanno F. ;
Alessandrelli A. ;
Barra A. ;
Giannotti F. ;
Lotito D. ;
Pedreschi D. .
Physica A: Statistical Mechanics and its Applications, 2023, 626
[2]   Dense Hebbian neural networks: A replica symmetric picture of unsupervised learning [J].
Agliari, Elena ;
Albanese, Linda ;
Alemanno, Francesco ;
Alessandrelli, Andrea ;
Barra, Adriano ;
Giannotti, Fosca ;
Lotito, Daniele ;
Pedreschi, Dino .
PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 627
[3]   The emergence of a concept in shallow neural networks [J].
Agliari, Elena ;
Alemanno, Francesco ;
Barra, Adriano ;
De Marzo, Giordano .
NEURAL NETWORKS, 2022, 148 :232-253
[4]   Machine learning and statistical physics: preface [J].
Agliari, Elena ;
Barra, Adriano ;
Sollich, Peter ;
Zdeborova, Lenka .
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2020, 53 (50)
[5]   Tolerance versus synaptic noise in dense associative memories [J].
Agliari, Elena ;
De Marzo, Giordano .
EUROPEAN PHYSICAL JOURNAL PLUS, 2020, 135 (11)
[6]   Replica symmetry breaking in neural networks: a few steps toward rigorous results [J].
Agliari, Elena ;
Albanese, Linda ;
Barra, Adriano ;
Ottaviani, Gabriele .
JOURNAL OF PHYSICS A-MATHEMATICAL AND THEORETICAL, 2020, 53 (41)
[7]   Generalized Guerra's interpolation schemes for dense associative neural networks [J].
Agliari, Elena ;
Alemanno, Francesco ;
Barra, Adriano ;
Fachechi, Alberto .
NEURAL NETWORKS, 2020, 128 :254-267
[8]   Neural Networks with a Redundant Representation: Detecting the Undetectable [J].
Agliari, Elena ;
Alemanno, Francesco ;
Barra, Adriano ;
Centonze, Martino ;
Fachechi, Alberto .
PHYSICAL REVIEW LETTERS, 2020, 124 (02)
[9]   Notes on the p-spin glass studied via Hamilton-Jacobi and smooth-cavity techniques [J].
Agliari, Elena ;
Barra, Adriano ;
Burioni, Raffaella ;
Di Biasio, Aldo .
JOURNAL OF MATHEMATICAL PHYSICS, 2012, 53 (06)
[10]   Replica Symmetry Breaking in Dense Hebbian Neural Networks [J].
Albanese, Linda ;
Alemanno, Francesco ;
Alessandrelli, Andrea ;
Barra, Adriano .
JOURNAL OF STATISTICAL PHYSICS, 2022, 189 (02)