Replica Symmetry Breaking in Dense Hebbian Neural Networks

被引:11
作者
Albanese, Linda [1 ,2 ,3 ]
Alemanno, Francesco [1 ,2 ]
Alessandrelli, Andrea [1 ,3 ]
Barra, Adriano [1 ,2 ]
机构
[1] Univ Salento, Dipartimento Matemat & Fis, Via Arnesano, I-73100 Lecce, Italy
[2] Ist Nazl Fis Nucl, Campus Ecotekne,Via Monteroni, I-73100 Lecce, Italy
[3] Scuola Super ISUFI, Campus Ecotekne,Via Monteroni, I-73100 Lecce, Italy
关键词
Hebbian neural networks; Replica symmetry breaking; Pattern recognition; SPIN-GLASS MODEL; PATTERNS;
D O I
10.1007/s10955-022-02966-8
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
Understanding the glassy nature of neural networks is pivotal both for theoretical and computational advances in Machine Learning and Theoretical Artificial Intelligence. Keeping the focus on dense associative Hebbian neural networks (i.e. Hopfield networks with polynomial interactions of even degree P > 2), the purpose of this paper is twofold: at first we develop rigorous mathematical approaches to address properly a statistical mechanical picture of the phenomenon of replica symmetry breaking (RSB) in these networks, then- deepening results stemmed via these routes-we aim to inspect the glassiness that they hide. In particular, regarding the methodology, we provide two techniques: the former (closer to mathematical physics in spirit) is an adaptation of the transport PDE to this case, while the latter (more probabilistic in its nature) is an extension of Guerra's interpolation breakthrough. Beyond coherence among the results, either in replica symmetric and in the one-step replica symmetry breaking level of description, we prove the Gardner's picture (heuristically achieved through the replica trick) and we identify the maximal storage capacity by a ground-state analysis in the Baldi-Venkatesh high-storage regime. In the second part of the paper we investigate the glassy structure of these networks: at difference with the replica symmetric scenario (RS), RSB actually stabilizes the spin-glass phase. We report huge differences w.r.t. the standard pairwise Hopfield limit: in particular, it is known that it is possible to express the free energy of the Hopfield neural network (and, in a cascade fashion, all its properties) as a linear combination of the free energies of a hard spin glass (i.e. the Sherrington-Kirkpatrick model) and a soft spin glass (the Gaussian or "spherical" model). While this continues to hold also in the first step of RSB for the Hopfield model, this is no longer true when interactions are more than pairwise (whatever the level of description, RS or RSB). For dense networks solely the free energy of the hard spin glass survives. As the Sherrington-Kirkpatrick spin glass is full-RSB (i.e. Parisi theory holds for that model), while the Gaussian spin-glass is replica symmetric, these different representation theorems prove a huge diversity in the underlying glassiness of associative neural networks.
引用
收藏
页数:41
相关论文
共 68 条
[61]  
Panchenko D., 2013, The Sherrington-Kirkpatrick Model, DOI [10.1007/978-1-4614-6289-7, DOI 10.1007/978-1-4614-6289-7]
[62]   THE FREE ENERGY IN A MULTI-SPECIES SHERRINGTON-KIRKPATRICK MODEL [J].
Panchenko, Dmitry .
ANNALS OF PROBABILITY, 2015, 43 (06) :3494-3513
[63]   REPLICA SYMMETRY-BREAKING IN ATTRACTOR NEURAL-NETWORK MODELS [J].
STEFFAN, H ;
KUHN, R .
ZEITSCHRIFT FUR PHYSIK B-CONDENSED MATTER, 1994, 95 (02) :249-260
[64]  
Subag E, 2022, Arxiv, DOI arXiv:2111.07134
[65]  
Subag E, 2021, Arxiv, DOI arXiv:2111.07132
[66]  
Talagrand M., 2003, Spin Glasses : A Challenge for Mathematicians : Cavity and Mean Field Models
[67]   The Parisi formula [J].
Talagrand, Michel .
ANNALS OF MATHEMATICS, 2006, 163 (01) :221-263
[68]   Residual Dense Network for Image Super-Resolution [J].
Zhang, Yulun ;
Tian, Yapeng ;
Kong, Yu ;
Zhong, Bineng ;
Fu, Yun .
2018 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2018, :2472-2481