Memory-Efficient Deep Learning on a SpiNNaker 2 Prototype

被引:37
作者
Liu, Chen [1 ]
Bellec, Guillaume [2 ]
Vogginger, Bernhard [1 ]
Kappel, David [1 ,2 ,3 ]
Partzsch, Johannes [1 ]
Neumaerker, Felix [1 ]
Hoeppner, Sebastian [1 ]
Maass, Wolfgang [2 ]
Furber, Steve B. [4 ]
Legenstein, Robert [2 ]
Mayr, Christian G. [1 ]
机构
[1] Tech Univ Dresden, Inst Circuits & Syst, Dept Elect Engn & Informat Technol, Chair Highly Parallel VLSI Syst & Neuromorph Circ, Dresden, Germany
[2] Graz Univ Technol, Inst Theoret Comp Sci, Graz, Austria
[3] Georg August Univ, Bernstein Ctr Computat Neurosci, Ill Phys Inst Biophys, Gottingen, Germany
[4] Univ Manchester, Sch Comp Sci, Adv Processor Technol Grp, Manchester, Lancs, England
基金
英国工程与自然科学研究理事会;
关键词
deep rewiring; pruning; sparsity; SpiNNaker; memory footprint; parallelism; energy efficient hardware; NETWORKS; PLASTICITY;
D O I
10.3389/fnins.2018.00840
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The memory requirement of deep learning algorithms is considered incompatible with the memory restriction of energy-efficient hardware. A low memory footprint can be achieved by pruning obsolete connections or reducing the precision of connection strengths after the network has been trained. Yet, these techniques are not applicable to the case when neural networks have to be trained directly on hardware due to the hard memory constraints. Deep Rewiring (DEEP R) is a training algorithm which continuously rewires the network while preserving very sparse connectivity all along the training procedure. We apply DEEP R to a deep neural network implementation on a prototype chip of the 2nd generation SpiNNaker system. The local memory of a single core on this chip is limited to 64 KB and a deep network architecture is trained entirely within this constraint without the use of external memory. Throughout training, the proportion of active connections is limited to 1.3%. On the handwritten digits dataset MNIST, this extremely sparse network achieves 96.6% classification accuracy at convergence. Utilizing the multi-processor feature of the SpiNNaker system, we found very good scaling in terms of computation time, per-core memory consumption, and energy constraints. When compared to a X86 CPU implementation, neural network training on the SpiNNaker 2 prototype improves power and energy consumption by two orders of magnitude.
引用
收藏
页数:15
相关论文
共 62 条
[1]   NullHop: A Flexible Convolutional Neural Network Accelerator Based on Sparse Representations of Feature Maps [J].
Aimar, Alessandro ;
Mostafa, Hesham ;
Calabrese, Enrico ;
Rios-Navarro, Antonio ;
Tapiador-Morales, Ricardo ;
Lungu, Iulia-Alexandra ;
Milde, Moritz B. ;
Corradi, Federico ;
Linares-Barranco, Alejandro ;
Liu, Shih-Chii ;
Delbruck, Tobi .
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 30 (03) :644-656
[2]   True North: Design and Tool Flow of a 65 mW 1 Million Neuron Programmable Neurosynaptic Chip [J].
Akopyan, Filipp ;
Sawada, Jun ;
Cassidy, Andrew ;
Alvarez-Icaza, Rodrigo ;
Arthur, John ;
Merolla, Paul ;
Imam, Nabil ;
Nakamura, Yutaka ;
Datta, Pallab ;
Nam, Gi-Joon ;
Taba, Brian ;
Beakes, Michael ;
Brezzo, Bernard ;
Kuang, Jente B. ;
Manohar, Rajit ;
Risk, William P. ;
Jackson, Bryan ;
Modha, Dharmendra S. .
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2015, 34 (10) :1537-1557
[3]   The Human Brain Project: Creating a European Research Infrastructure to Decode the Human Brain [J].
Amunts, Katrin ;
Ebell, Christoph ;
Muller, Jeff ;
Telefont, Martin ;
Knoll, Alois ;
Lippert, Thomas .
NEURON, 2016, 92 (03) :574-581
[4]  
[Anonymous], 2016, 2016 IEEE NORD CIRC, DOI DOI 10.1109/NORCHIP.2016.7792875
[5]  
[Anonymous], 2015, P INT C EV BAS CONTR
[6]  
[Anonymous], 2017 IEEE INT S CIRC
[7]  
[Anonymous], 2016, ARXIV PREPRINT ARXIV
[8]  
[Anonymous], 2016, BinaryNet: Training deep neural networks with weights and activa
[9]  
[Anonymous], 2017, PROC IEEE INT S CIRC, DOI DOI 10.1109/ISCAS.2017.8050528
[10]  
[Anonymous], 2005, BUCE0506 BILK U