Learning restricted Boltzmann machines with pattern induced weights

被引:0
作者
Gari, J. [1 ]
Romero, E. [2 ]
Mazzanti, F. [1 ]
机构
[1] Univ Politecn Cataluna, Dept Fis, Campus Nord B4-B5, E-08034 Barcelona, Spain
[2] Univ Politecn Cataluna, Dept Ciencies Computacio, Campus Nord Omega, E-08034 Barcelona, Spain
关键词
HOPFIELD NETWORKS; NEURAL-NETWORKS; ALGORITHM;
D O I
10.1016/j.neucom.2024.128469
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Restricted Boltzmann Machines are energy-based models capable of learning probability distributions. In practice, though, it is seriously limited by the fact that the computational cost associated with the exact evaluation of the gradients, required during learning, is prohibitively high. The standard approach to mitigate this problem is to use the Contrastive Divergence algorithm, but it leads to a rough approximation that presents issues on its own. As a completely different alternative, a model called RAPID (Pozas-Kerstjen et al., 2021) recently appeared, where unit weights are constructed from high-probability patterns that allow for an effective evaluation of the update rules along learning. In this work we analyze RAPID to find that it also presents some drawbacks that constrain its performance. We identify the problematic sources in RAPID and modify them accordingly to build a similar but more flexible alternative, called PIW (Pattern Induced Weights). Experiments show that PIW performs better than the original RAPID implementation, bringing it to a competitive level when compared to a standard RBM with CDk, with a substantial reduction in the number of training parameters.
引用
收藏
页数:9
相关论文
共 65 条
[1]  
ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2]  
Agliari E., 2021, Learning and retrieval operational modes for three-layer restricted Boltzmann machines, V185
[3]   The emergence of a concept in shallow neural networks [J].
Agliari, Elena ;
Alemanno, Francesco ;
Barra, Adriano ;
De Marzo, Giordano .
NEURAL NETWORKS, 2022, 148 :232-253
[4]   Neural Networks Retrieving Boolean Patterns in a Sea of Gaussian Ones [J].
Agliari, Elena ;
Barra, Adriano ;
Longo, Chiara ;
Tantari, Daniele .
JOURNAL OF STATISTICAL PHYSICS, 2017, 168 (05) :1085-1104
[5]   Parallel retrieval of correlated patterns: From Hopfield networks to Boltzmann machines [J].
Agliari, Elena ;
Barra, Adriano ;
De Antoni, Andrea ;
Galluzzi, Andrea .
NEURAL NETWORKS, 2013, 38 :52-63
[6]  
ai, OCR-Letters data set
[7]  
Amit D.J., 1989, Modeling brain function- the world of attractor neural networks, DOI 10.1017/CBO9780511623257
[8]  
[Anonymous], 2011, P MACHINE LEARNING R
[9]  
archive, Connect-4 data set
[10]   On the equivalence of Hopfield networks and Boltzmann Machines [J].
Barra, Adriano ;
Bernacchia, Alberto ;
Santucci, Enrica ;
Contucci, Pierluigi .
NEURAL NETWORKS, 2012, 34 :1-9