Efficient Mixed-Signal Neurocomputing Via Successive Integration and Rescaling

被引:18
作者
Bavandpour, Mohammad [1 ]
Sahay, Shubham [1 ]
Mahmoodi, Mohammad R. [1 ]
Strukov, Dmitri [1 ]
机构
[1] Univ Calif Santa Barbara, Dept Elect & Comp Engn, Santa Barbara, CA 93106 USA
关键词
1T-1R memory; memristor; mixed-signal computing; ReRAM; successive integration and rescaling (SIR) technique; time-domain computing; vector-by-matrix multiplier (VMM); MATRIX MULTIPLIER;
D O I
10.1109/TVLSI.2019.2946516
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread and ever-increasing demand for performing in situ inference, signal processing, and other computationally intensive applications in mobile Internet-of-Things (IoT) devices requires fast, compact, and energy-efficient vector-by-matrix multipliers (VMMs). The time-domain VMMs based on emerging nonvolatile memory devices exhibit significantly higher circuit density and energy efficiency than their current-mode counterparts. However, the load capacitors used to accumulate the weighted summation of the inputs in the time-domain-based circuits dominate their energy dissipation and footprint area. The true potential of the time-domain-based VMMs may be realized only when this overhead is minimized. To this end, in this brief, we propose a novel successive integration and rescaling (SIR) approach for implementing a highly efficient mixed-signal time-domain VMM for low-to-medium-precision computing. For a proof of concept, we quantitatively evaluated the performance of the proposed SIR VMM and compared it with the results of the conventional time-domain VMM, using a similar 1T-1R array. Preliminary simulation results for the 4-bit $200\, \times \, 200$ VMM, implemented using a 55-nm technology node, show area and energy efficiencies of 1.33 bits/m(2) and similar to 1.3 POp/J-the numbers, respectively, $\sim 2.5\times $ and $\sim 2.65\times $ higher than those for the prior-work time-domain VMM. Furthermore, we analyze the system-level performance of the proposed SIR VMM engine in the neuromorphic accelerator architectures and provide the preliminary estimates for various deep/recurrent neural network (DNN/RNN) applications.
引用
收藏
页码:823 / 827
页数:5
相关论文
共 29 条
[1]   PUMA: A Programmable Ultra-efficient Memristor-based Accelerator for Machine Learning Inference [J].
Ankit, Aayush ;
El Hajj, Izzat ;
Chalamalasetti, Sai Rahul ;
Ndu, Geoffrey ;
Foltin, Martin ;
Williams, R. Stanley ;
Faraboschi, Paolo ;
Hwu, Wen-mei ;
Strachan, John Paul ;
Roy, Kaushik ;
Milojicic, Dejan S. .
TWENTY-FOURTH INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATING SYSTEMS (ASPLOS XXIV), 2019, :715-731
[2]  
[Anonymous], P IEEE SOI 3D SUBTHR
[3]  
[Anonymous], IEDM
[4]   Energy-Efficient Time-Domain Vector-by-Matrix Multiplier for Neurocomputing and Beyond [J].
Bavandpour, Mohammad ;
Mahmoodi, Mohammad Reza ;
Strukov, Dmitri B. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II-EXPRESS BRIEFS, 2019, 66 (09) :1512-1516
[5]   3D-aCortex: an ultra-compact energy-efficient neurocomputing platform based on commercial 3D-NAND flash memories [J].
Bavandpour, Mohammad ;
Sahay, Shubham ;
Mahmoodi, Mohammad Reza ;
Strukov, Dmitri B. .
NEUROMORPHIC COMPUTING AND ENGINEERING, 2021, 1 (01)
[6]  
Binas J., 2016, ARXIV160607786
[7]  
Burr G.W., 2014, IEEE INT ELECT DEVIC, P1, DOI 10.1109/IEDM.2014.7047135
[8]   DaDianNao: A Machine-Learning Supercomputer [J].
Chen, Yunji ;
Luo, Tao ;
Liu, Shaoli ;
Zhang, Shijin ;
He, Liqiang ;
Wang, Jia ;
Li, Ling ;
Chen, Tianshi ;
Xu, Zhiwei ;
Sun, Ninghui ;
Temam, Olivier .
2014 47TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2014, :609-622
[9]   An Energy-Efficient One-Shot Time-Based Neural Network Accelerator Employing Dynamic Threshold Error Correction in 65 nm [J].
Everson, Luke R. ;
Liu, Muqing ;
Pande, Nakul ;
Kim, Chris H. .
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2019, 54 (10) :2777-2785
[10]   A Spatial Multi-Bit Sub-1-V Time-Domain Matrix Multiplier Interface for Approximate Computing in 65-nm CMOS [J].
Gopal, Srinivasan ;
Agarwal, Pawan ;
Baylon, Joe ;
Renaud, Luke ;
Ali, Sheikh Nijam ;
Pande, Partha Pratim ;
Heo, Deukhyoun .
IEEE JOURNAL ON EMERGING AND SELECTED TOPICS IN CIRCUITS AND SYSTEMS, 2018, 8 (03) :506-518