Parallel Architecture With Resistive Crosspoint Array for Dictionary Learning Acceleration

被引:50
作者
Kadetotad, Deepak [1 ]
Xu, Zihan [1 ]
Mohanty, Abinash [1 ]
Chen, Pai-Yu [1 ]
Lin, Binbin [1 ]
Ye, Jieping [1 ]
Vrudhula, Sarma [1 ]
Yu, Shimeng [1 ]
Cao, Yu [1 ]
Seo, Jae-Sun [1 ]
机构
[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85281 USA
基金
美国国家科学基金会;
关键词
CMOS integration; dictionary learning; memristive device; parallel computing; resistive crosspoint array; DEVICE; NEURONS;
D O I
10.1109/JETCAS.2015.2426495
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper proposes a parallel architecture with resistive crosspoint array. The design of its two essential operations, read and write, is inspired by the biophysical behavior of a neural system, such as integrate-and-fire and local synapse weight update. The proposed hardware consists of an array with resistive random access memory (RRAM) and CMOS peripheral circuits, which perform matrix-vector multiplication and dictionary update in a fully parallel fashion, at the speed that is independent of the matrix dimension. The read and write circuits are implemented in 65 nm CMOS technology and verified together with an array of RRAM device model built from experimental data. The overall system exploits array-level parallelism and is demonstrated for accelerated dictionary learning tasks. As compared to software implementation running on a 8-core CPU, the proposed hardware achieves more than 3000X speedup, enabling high-speed feature extraction on a single chip.
引用
收藏
页码:194 / 204
页数:11
相关论文
共 32 条
[1]   Lapicque's introduction of the integrate-and-fire model neuron (1907) [J].
Abbott, LF .
BRAIN RESEARCH BULLETIN, 1999, 50 (5-6) :303-304
[2]   Efficient Hybrid CMOS-Nano Circuit Design for Spiking Neurons and Memristive Synapses with STDP [J].
Afifi, Ahmad ;
Ayatollahi, Ahmad ;
Raissi, Farshid ;
Hajghassem, Hasan .
IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2010, E93A (09) :1670-1677
[3]  
[Anonymous], P 2014 IEEE INT EL D
[4]  
[Anonymous], 2013, IEDM
[5]  
[Anonymous], 2009, P 26 ANN INT C MACH, DOI DOI 10.1145/1553374.1553463
[6]   Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type [J].
Bi, GQ ;
Poo, MM .
JOURNAL OF NEUROSCIENCE, 1998, 18 (24) :10464-10472
[7]  
Bottou L, 2012, OPTIMIZATION FOR MACHINE LEARNING, P351
[8]  
Coates A., 2011, International Conference on Machine Learning, P921
[9]   An iterative thresholding algorithm for linear inverse problems with a sparsity constraint [J].
Daubechies, I ;
Defrise, M ;
De Mol, C .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 2004, 57 (11) :1413-1457
[10]  
Hoyer PO, 2002, NEURAL NETWORKS FOR SIGNAL PROCESSING XII, PROCEEDINGS, P557, DOI 10.1109/NNSP.2002.1030067