STDP-based spiking deep convolutional neural networks for object recognition

被引:519
作者
Kheradpisheh, Saeed Reza [1 ,2 ]
Ganjtabesh, Mohammad [1 ]
Thorpe, Simon J. [2 ]
Masquelier, Timothee [2 ]
机构
[1] Univ Tehran, Sch Math Stat & Comp Sci, Dept Comp Sci, Tehran, Iran
[2] Univ Toulouse 3, CNRS, CERCO, UMR 5549, Toulouse, France
基金
欧洲研究理事会;
关键词
Spiking neural network; STDP; Deep learning; Object recognition; Temporal coding; TIMING-DEPENDENT PLASTICITY; SYNAPTIC PLASTICITY; VISUAL FEATURES; CATEGORIZATION; NEURONS; BRAIN; SPEED;
D O I
10.1016/j.neunet.2017.12.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Previous studies have shown that spike-timing-dependent plasticity (STDP) can be used in spiking neural networks (SNN) to extract visual features of low or intermediate complexity in an unsupervised manner. These studies, however, used relatively shallow architectures, and only one layer was trainable. Another line of research has demonstrated-using rate-based neural networks trained with back-propagation that having many layers increases the recognition robustness, an approach known as deep learning. We thus designed a deep SNN, comprising several convolutional (trainable with STDP) and pooling layers. We used a temporal coding scheme where the most strongly activated neurons fire first, and less activated neurons fire later or not at all. The network was exposed to natural images. Thanks to STDP, neurons progressively learned features corresponding to prototypical patterns that were both salient and frequent. Only a few tens of examples per category were required and no label was needed. After learning, the complexity of the extracted features increased along the hierarchy, from edge detectors in the first layer to object prototypes in the last layer. Coding was very sparse, with only a few thousands spikes per image, and in some cases the object category could be reasonably well inferred from the activity of a single higher-order neuron. More generally, the activity of a few hundreds of such neurons contained robust category information, as demonstrated using a classifier on Caltech 101, ETH-80, and MNIST databases. We also demonstrate the superiority of STDP over other unsupervised techniques such as random crops (HMAX) or auto-encoders. Taken together, our results suggest that the combination of STDP with latency coding may be a key to understanding the way that the primate visual system learns, its remarkable processing speed and its low energy consumption. These mechanisms are also interesting for artificial vision systems, particularly for hardware solutions. (c) 2017 Elsevier Ltd. All rights reserved.
引用
收藏
页码:56 / 67
页数:12
相关论文
共 59 条
[1]  
[Anonymous], 2002, Computational Neuroscience of Vision
[2]  
[Anonymous], 1991, THESIS
[3]  
[Anonymous], 2015, arXiv preprint arXiv:1510.08829
[4]   Categorization and decision-making in a neurobiologically plausible spiking network using a STDP-like learning rule [J].
Beyeler, Michael ;
Dutt, Nikil D. ;
Krichmar, Jeffrey L. .
NEURAL NETWORKS, 2013, 48 :109-124
[5]   Learning real-world stimuli in a neural network with spike-driven synaptic dynamics [J].
Brader, Joseph M. ;
Senn, Walter ;
Fusi, Stefano .
NEURAL COMPUTATION, 2007, 19 (11) :2881-2912
[6]   Mirrored STDP Implements Autoencoder Learning in a Network of Spiking Neurons [J].
Burbank, Kendra S. .
PLOS COMPUTATIONAL BIOLOGY, 2015, 11 (12)
[7]   Deep Neural Networks Rival the Representation of Primate IT Cortex for Core Visual Object Recognition [J].
Cadieu, Charles F. ;
Hong, Ha ;
Yamins, Daniel L. K. ;
Pinto, Nicolas ;
Ardila, Diego ;
Solomon, Ethan A. ;
Majaj, Najib J. ;
DiCarlo, James J. .
PLOS COMPUTATIONAL BIOLOGY, 2014, 10 (12)
[8]   Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition [J].
Cao, Yongqiang ;
Chen, Yang ;
Khosla, Deepak .
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) :54-66
[9]   Comparison of deep neural networks to spatio-temporal cortical dynamics of human visual object recognition reveals hierarchical correspondence [J].
Cichy, Radoslaw Martin ;
Khosla, Aditya ;
Pantazis, Dimitrios ;
Torralba, Antonio ;
Oliva, Aude .
SCIENTIFIC REPORTS, 2016, 6
[10]   Networks of integrate-and-fire neurons using Rank Order Coding B: Spike timing dependent plasticity and emergence of orientation selectivity [J].
Delorme, A ;
Perrinet, L ;
Thorpe, SJ .
NEUROCOMPUTING, 2001, 38 :539-545