Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
来源
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS) | 2018年
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [1] Accelerating Sparse Convolutional Neural Networks with Systolic Arrays on FPGA
    Nehete, Hemkant
    Verma, Gaurav
    Yadav, Shailendra
    Kaushik, Brajesh Kumar
    APPLICATIONS OF MACHINE LEARNING 2023, 2023, 12675
  • [2] Accelerating Convolutional Neural Network Inference Based on a Reconfigurable Sliced Systolic Array
    Zeng, Yixuan
    Sun, Heming
    Katto, Jiro
    Fan, Yibo
    2021 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2021,
  • [3] Kernel Mapping Methods of Convolutional Neural Network in 3D NAND Flash Architecture
    Song, Min Suk
    Hwang, Hwiho
    Lee, Geun Ho
    Ahn, Suhyeon
    Hwang, Sungmin
    Kim, Hyungjin
    ELECTRONICS, 2023, 12 (23)
  • [4] Accelerating 3D Convolutional Neural Network with Channel Bottleneck Module for EEG-Based Emotion Recognition
    Kim, Sungkyu
    Kim, Tae-Seong
    Lee, Won Hee
    SENSORS, 2022, 22 (18)
  • [5] Low-energy, high-accuracy convolutional network inference in 3D crosspoint (3DXP) arrays
    Carletti, F.
    Farronato, M.
    Lepri, N.
    Tortorelli, I
    Pirovano, A.
    Fantini, P.
    Ielmini, D.
    2024 50TH IEEE EUROPEAN SOLID-STATE ELECTRONICS RESEARCH CONFERENCE, ESSERC 2024, 2024, : 412 - 415
  • [6] 3D convolutional neural network for object recognition: a review
    Rahul Dev Singh
    Ajay Mittal
    Rajesh K. Bhatia
    Multimedia Tools and Applications, 2019, 78 : 15951 - 15995
  • [7] 3D convolutional neural network for object recognition: a review
    Singh, Rahul Dev
    Mittal, Ajay
    Bhatia, Rajesh K.
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (12) : 15951 - 15995
  • [8] 3D Face Reconstruction Based on Convolutional Neural Network
    Li Fangmin
    Chen Ke
    Liu Xinhua
    2017 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION (ICICTA 2017), 2017, : 71 - 74
  • [9] Octant Convolutional Neural Network for 3D Point Cloud Analysis
    Xu X.
    Shuai H.
    Liu Q.-S.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (12): : 2791 - 2800
  • [10] A 3D CONVOLUTIONAL NEURAL NETWORK FOR LIGHT FIELD DEPTH ESTIMATION
    Faluvegi, Agota
    Bolsee, Quentin
    Nedevschi, Sergiu
    Dadarlat, Vasile-Teodor
    Munteanu, Adrian
    2019 INTERNATIONAL CONFERENCE ON 3D IMMERSION (IC3D), 2019,