Training High-Performance Low-Latency Spiking Neural Networks by Differentiation on Spike Representation

被引：82

作者：

Meng, Qingyan ^{[1
,2
]}

Xiao, Mingqing ^{[3
]}

Yan, Shen ^{[4
]}

Wang, Yisen ^{[3
,5
]}

Lin, Zhouchen ^{[3
,5
,6
]}

Luo, Zhi-Quan ^{[1
,2
]}

机构：

[1] Chinese Univ Hong Kong, Shenzhen, Peoples R China

[2] Shenzhen Res Inst Big Data, Shenzhen, Peoples R China

[3] Peking Univ, Sch Artificial Intelligence, Key Lab Machine Percept MoE, Beijing, Peoples R China

[4] Peking Univ, Ctr Data Sci, Beijing, Peoples R China

[5] Peking Univ, Inst Artificial Intelligence, Beijing, Peoples R China

[6] Peng Cheng Lab, Shenzhen, Guangdong, Peoples R China

来源：

2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR) | 2022年

基金：

中国国家自然科学基金;

关键词：

D O I：

10.1109/CVPR52688.2022.01212

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Spiking Neural Network (SNN) is a promising energy-efficient AI model when implemented on neuromorphic hardware. However, it is a challenge to efficiently train SNNs due to their non-differentiability. Most existing methods either suffer from high latency (i.e., long simulation time steps), or cannot achieve as high performance as Artificial Neural Networks (ANNs). In this paper, we propose the Differentiation on Spike Representation (DSR) method, which could achieve high performance that is competitive to ANNs yet with low latency. First, we encode the spike trains into spike representation using (weighted) firing rate coding. Based on the spike representation, we systematically derive that the spiking dynamics with common neural models can be represented as some sub-differentiable mapping. With this viewpoint, our proposed DSR method trains SNNs through gradients of the mapping and avoids the common non-differentiability problem in SNN training. Then we analyze the error when representing the specific mapping with the forward computation of the SNN. To reduce such error, we propose to train the spike threshold in each layer, and to introduce a new hyperparameter for the neural models. With these components, the DSR method can achieve state-of-the-art SNN performance with low latency on both static and neuromorphic datasets, including CIFAR-10, CIFAR-100, ImageNet, and DVS-CIFAR10.

引用

页码：12434 / 12443

页数：10

共 54 条

[1]

[Anonymous], 2018, IEEE Transactions on Neural Networks and Learning Systems

[2]

[Anonymous], 2015, Neural Networks (IJCNN), 2015 International Joint Conference on

[3]

[Anonymous], 2000, Esann

[4]

[Anonymous], 2015, arXiv preprint arXiv:1510.08829

[5]

Bellec Guillaume, 2018, NEURIPS

[6] A review of the integrate-and-fire neuron model: I. Homogeneous synaptic input [J].

Burkitt, A. N. .

BIOLOGICAL CYBERNETICS, 2006, 95 (01) :1-19

[7] Spiking Deep Convolutional Neural Networks for Energy-Efficient Object Recognition [J].

Cao, Yongqiang ;

Chen, Yang ;

Khosla, Deepak .

INTERNATIONAL JOURNAL OF COMPUTER VISION, 2015, 113 (01) :54-66

[8] Spike timing-dependent plasticity: A Hebbian learning rule [J].

Caporale, Natalia ;

Dan, Yang .

ANNUAL REVIEW OF NEUROSCIENCE, 2008, 31 :25-46

[9]

Choi J., 2018, IEEE C COMP VIS PATT

[10] Loihi: A Neuromorphic Manycore Processor with On-Chip Learning [J].

Davies, Mike ;

Srinivasa, Narayan ;

Lin, Tsung-Han ;

Chinya, Gautham ;

Cao, Yongqiang ;

Choday, Sri Harsha ;

Dimou, Georgios ;

Joshi, Prasad ;

Imam, Nabil ;

Jain, Shweta ;

Liao, Yuyun ;

Lin, Chit-Kwan ;

Lines, Andrew ;

Liu, Ruokun ;

Mathaikutty, Deepak ;

Mccoy, Steve ;

Paul, Arnab ;

Tse, Jonathan ;

Venkataramanan, Guruguhanathan ;

Weng, Yi-Hsin ;

Wild, Andreas ;

Yang, Yoonseok ;

Wang, Hong .

IEEE MICRO, 2018, 38 (01) :82-99

← 1 2 3 4 5 6 →