Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks

被引:8
|
作者
Eckert, Charles [1 ]
Wang, Xiaowei [1 ]
Wang, Jingcheng [2 ]
Subramaniyan, Arun [1 ]
Iyer, Ravi [3 ]
Sylvester, Dennis [4 ]
Blaauw, David [5 ]
Das, Reetuparna [1 ]
机构
[1] Univ Michigan, Dept Comp Sci & Engn, Ann Arbor, MI 48109 USA
[2] Univ Michigan, Dept Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[3] Intel Corp, Santa Clara, CA 95051 USA
[4] Univ Michigan, Elect Engn & Comp Sci, Ann Arbor, MI 48109 USA
[5] Univ Michigan, Ann Arbor, MI 48109 USA
关键词
D O I
10.1109/MM.2019.2908101
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article presents Neural Cache architecture, which repurposes cache structures to transform them into massively parallel compute units capable of running inferences for deep neural networks. Techniques to do in situ arithmetic in SRAM arrays create efficient data mapping, and reducing data movement is proposed. Neural Cache architecture is capable of fully executing convolutional, fully connected, and pooling layers in cache. Our experimental results show that the proposed architecture can improve efficiency over a GPU by 128 x while requiring a minimal area overhead of 2%.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 50 条
  • [1] Neural Cache: Bit-Serial In-Cache Acceleration of Deep Neural Networks
    Eckert, Charles
    Wang, Xiaowei
    Wang, Jingcheng
    Subramaniyan, Arun
    Iyer, Ravi
    Sylvester, Dennis
    Blaauw, David
    Das, Reetuparna
    2018 ACM/IEEE 45TH ANNUAL INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA), 2018, : 383 - 396
  • [2] Bit Prudent In-Cache Acceleration of Deep Convolutional Neural Networks
    Wang, Xiaowei
    Yu, Jiecao
    Augustine, Charles
    Iyer, Ravi
    Das, Reetuparna
    2019 25TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA), 2019, : 81 - 93
  • [3] Bit-Serial Cache: Exploiting Input Bit Vector Repetition to Accelerate Bit-Serial Inference
    Lo, Yun-Chen
    Liu, Ren-Shuo
    2023 60TH ACM/IEEE DESIGN AUTOMATION CONFERENCE, DAC, 2023,
  • [4] Stripes: Bit-Serial Deep Neural Network Computing
    Judd, Patrick
    Albericio, Jorge
    Hetherington, Tayler
    Aamodt, Tor M.
    Moshovos, Andreas
    2016 49TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO), 2016,
  • [5] Stripes: Bit-Serial Deep Neural Network Computing
    Judd, Patrick
    Albericio, Jorge
    Moshovos, Andreas
    IEEE COMPUTER ARCHITECTURE LETTERS, 2017, 16 (01) : 80 - 83
  • [6] Analyzing Cache Side Channels Using Deep Neural Networks
    Zhang, Tianwei
    Zhang, Yinqian
    Lee, Ruby B.
    34TH ANNUAL COMPUTER SECURITY APPLICATIONS CONFERENCE (ACSAC 2018), 2018, : 174 - 186
  • [7] BitSET: Bit-Serial Early Termination for Computation Reduction in Convolutional Neural Networks
    Pan, Yunjie
    Yu, Jiecao
    Lukefahr, Andrew
    Das, Reetuparna
    Mahlke, Scott
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2023, 22 (05)
  • [8] Bit-serial systolic accelerator design for convolution operations in convolutional neural networks
    Li, Lin
    Hu, Jianhao
    Huang, Qiu
    Zhou, Wanting
    IEICE ELECTRONICS EXPRESS, 2020, 17 (20):
  • [9] Application of neural networks to cache replacement
    Khalid, H
    Obaidat, MS
    NEURAL COMPUTING & APPLICATIONS, 1999, 8 (03): : 246 - 256
  • [10] Application of Bit-Serial Arithmetic Units for FPGA Implementation of Convolutional Neural Networks
    Csordas, G.
    Feher, B.
    Kovacshazy, T.
    2018 19TH INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2018, : 322 - 327