FastHebb: Scaling Hebbian Training of Deep Neural Networks to ImageNet Level

被引：6

作者：

Lagani, Gabriele ^{[1
,2
]}

Gennaro, Claudio ^{[2
]}

Fassold, Hannes ^{[3
]}

Amato, Giuseppe ^{[1
]}

机构：

[1] Univ Pisa, Dept Comp Sci, I-56127 Pisa, Italy

[2] ISTI CNR, I-56124 Pisa, Italy

[3] Joanneum Res, A-8010 Graz, Austria

来源：

SIMILARITY SEARCH AND APPLICATIONS (SISAP 2022) | 2022年 / 13590卷

关键词：

Hebbian learning; Deep learning; Neural networks; Semi-supervised; Sample efficiency; Content-Based Image Retrieval; OPTIMIZATION;

D O I：

10.1007/978-3-031-17849-8_20

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Learning algorithms for Deep Neural Networks are typically based on supervised end-to-end Stochastic Gradient Descent (SGD) training with error backpropagation (backprop). Backprop algorithms require a large number of labelled training samples to achieve high performance. However, in many realistic applications, even if there is plenty of image samples, very few of them are labelled, and semi-supervised sample-efficient training strategies have to be used. Hebbian learning represents a possible approach towards sample efficient training; however, in current solutions, it does not scale well to large datasets. In this paper, we present FastHebb, an efficient and scalable solution for Hebbian learning which achieves higher efficiency by 1) merging together update computation and aggregation over a batch of inputs, and 2) leveraging efficient matrix multiplication algorithms on GPU. We validate our approach on different computer vision benchmarks, in a semi-supervised learning scenario. FastHebb outperforms previous solutions by up to 50 times in terms of training speed, and notably, for the first time, we are able to bring Hebbian algorithms to ImageNet scale.

引用

页码：251 / 264

页数：14

共 38 条

[1] Hebbian Learning Meets Deep Convolutional Neural Networks [J].

Amato, Giuseppe ;

Carrara, Fabio ;

Falchi, Fabrizio ;

Gennaro, Claudio ;

Lagani, Gabriele .

IMAGE ANALYSIS AND PROCESSING - ICIAP 2019, PT I, 2019, 11751 :324-334

[2] Neural Codes for Image Retrieval [J].

Babenko, Artem ;

Slesarev, Anton ;

Chigorin, Alexandr ;

Lempitsky, Victor .

COMPUTER VISION - ECCV 2014, PT I, 2014, 8689 :584-599

[3] Online Representation Learning with Single and Multi-layer Hebbian Networks for Image Classification [J].

Bahroun, Yanis ;

Soltoggio, Andrea .

ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2017, PT I, 2017, 10613 :354-363

[4] Unsupervised neural network learning procedures for feature extraction and classification [J].

Becker, S ;

Plumbley, M .

APPLIED INTELLIGENCE, 1996, 6 (03) :185-203

[5]

Bengio Y., 2007, ADV NEURAL INFORM PR, P153

[6]

Deng J, 2009, PROC CVPR IEEE, P248, DOI 10.1109/CVPRW.2009.5206848

[7]

Gerstner W., 2002, SPIKING NEURON MODEL, P3

[8] ADAPTIVE PATTERN-CLASSIFICATION AND UNIVERSAL RECODING .1. PARALLEL DEVELOPMENT AND CODING OF NEURAL FEATURE DETECTORS [J].

GROSSBERG, S .

BIOLOGICAL CYBERNETICS, 1976, 23 (03) :121-134

[9]

Haykin S.O., 2011, Neural Networks and Learning Machines

[10] Deep Residual Learning for Image Recognition [J].

He, Kaiming ;

Zhang, Xiangyu ;

Ren, Shaoqing ;

Sun, Jian .

2016 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2016, :770-778

← 1 2 3 4 →