8T SRAM Cell as a Multibit Dot-Product Engine for Beyond Von Neumann Computing

被引:105
作者
Jaiswal, Akhilesh [1 ]
Chakraborty, Indranil [1 ]
Agrawal, Amogh [1 ]
Roy, Kaushik [2 ]
机构
[1] Purdue Univ, Sch Elect & Comp Engn, W Lafayette, IN 47907 USA
[2] Purdue Univ, Sch Elect Engn, W Lafayette, IN 47907 USA
基金
美国国家科学基金会;
关键词
Convolution; dot product; in-memory computing; SRAMs; von Neumann bottleneck;
D O I
10.1109/TVLSI.2019.2929245
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Large-scale digital computing almost exclusively relies on the von Neumann architecture, which comprises separate units for storage and computations. The energy-expensive transfer of data from the memory units to the computing cores results in the well-known von Neumann bottleneck. Various approaches aimed toward bypassing the von Neumann bottleneck are being extensively explored in the literature. These include in-memory computing based on CMOS and beyond CMOS technologies, wherein by making modifications to the memory array, vector computations can be carried out as close to the memory units as possible. Interestingly, in-memory techniques based on CMOS technology are of special importance due to the ubiquitous presence of field-effect transistors and the resultant ease of large-scale manufacturing and commercialization. On the other hand, perhaps the most important computation required for applications such as machine learning, etc., comprises the dot-product operation. Emerging nonvolatile memristive technologies have been shown to be very efficient in computing analog dot products in an in situ fashion. The memristive analog computation of the dot product results in much faster operation as opposed to digital vector in-memory bitwise Boolean computations. However, challenges with respect to large-scale manufacturing coupled with the limited endurance of memristors have hindered rapid commercialization of memristive-based computing solutions. In this paper, we show that the standard 8 transistor (8T) digital SRAM array can be configured as an analoglike in-memory multibit dot-product engine (DPE). By applying appropriate analog voltages to the read ports of the 8T SRAM array and sensing the output current, an approximate analog-digital DPE can be implemented. We present two different configurations for enabling multibit dot-product computations in the 8T SRAM cell array, without modifying the standard bit-cell structure. We also demonstrate the robustness of the present proposal in presence of nonidealities such as the effect of line resistances and transistor threshold voltage variations. Since our proposal preserves the standard 8T-SRAM array structure, it can be used as a storage element with standard read-write instructions and also as an on-demand analoglike dot-product accelerator.
引用
收藏
页码:2556 / 2567
页数:12
相关论文
共 35 条
  • [1] Agrawal A., 2017, ARXIV171205096
  • [2] [Anonymous], 2012, COMPUTER BRAIN
  • [3] Biswas A, 2018, ISSCC DIG TECH PAP I, P488, DOI 10.1109/ISSCC.2018.8310397
  • [4] 'Memristive' switches enable 'stateful' logic operations via material implication
    Borghetti, Julien
    Snider, Gregory S.
    Kuekes, Philip J.
    Yang, J. Joshua
    Stewart, Duncan R.
    Williams, R. Stanley
    [J]. NATURE, 2010, 464 (7290) : 873 - 876
  • [5] Technology Aware Training in Memristive Neuromorphic Systems for Nonideal Synaptic Crossbars
    Chakraborty, Indranil
    Roy, Deboleena
    Roy, Kaushik
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2018, 2 (05): : 335 - 344
  • [6] Chang L, 2005, 2005 Symposium on VLSI Technology, Digest of Technical Papers, P128
  • [7] A 5.3GHz 8T-SRAM with operation down to 0.41V in 65nm CMOS
    Chang, Leland
    Nakamura, Yutaka
    Montoye, Robert K.
    Sawada, Jim
    Martin, Andrew K.
    Kinoshita, Kiyofumi
    [J]. 2007 SYMPOSIUM ON VLSI CIRCUITS, DIGEST OF TECHNICAL PAPERS, 2007, : 252 - 253
  • [8] Changpinyo S., 2017, The Power of Sparsity in Convolutional Neural Networks
  • [9] PRIME: A Novel Processing-in-memory Architecture for Neural Network Computation in ReRAM-based Main Memory
    Chi, Ping
    Li, Shuangchen
    Xu, Cong
    Zhang, Tao
    Zhao, Jishen
    Liu, Yongpan
    Wang, Yu
    Xie, Yuan
    [J]. 2016 ACM/IEEE 43RD ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2016, : 27 - 39
  • [10] Dong Q, 2017, SYMP VLSI CIRCUITS, pC160, DOI 10.23919/VLSIC.2017.8008465