Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
来源
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS) | 2018年
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [11] Parametric cerebral blood flow and arterial transit time mapping using a 3D convolutional neural network
    Kim, Donghoon
    Lipford, Megan E. E.
    He, Hongjian
    Ding, Qiuping
    Ivanovic, Vladimir
    Lockhart, Samuel N. N.
    Craft, Suzanne
    Whitlow, Christopher T. T.
    Jung, Youngkyoo
    MAGNETIC RESONANCE IN MEDICINE, 2023, 90 (02) : 583 - 595
  • [12] RFNet: Convolutional Neural Network for 3D Point Cloud Classification
    Shan X.-Y.
    Sun Z.-L.
    Zeng Z.-G.
    Zidonghua Xuebao/Acta Automatica Sinica, 2023, 49 (11): : 2350 - 2359
  • [13] Efficient binary 3D convolutional neural network and hardware accelerator
    Guoqing Li
    Meng Zhang
    Qianru Zhang
    Zhijian Lin
    Journal of Real-Time Image Processing, 2022, 19 : 61 - 71
  • [14] Facial Expression Recognition Using 3D Convolutional Neural Network
    Byeon, Young-Hyen
    Kwak, Keun-Chang
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2014, 5 (12) : 107 - 112
  • [15] A Separate 3D Convolutional Neural Network Architecture for 3D Medical Image Semantic Segmentation
    Dong, Shidu
    Liu, Zhi
    Wang, Huaqiu
    Zhang, Yihao
    Cui, Shaoguo
    JOURNAL OF MEDICAL IMAGING AND HEALTH INFORMATICS, 2019, 9 (08) : 1705 - 1716
  • [16] Efficient binary 3D convolutional neural network and hardware accelerator
    Li, Guoqing
    Zhang, Meng
    Zhang, Qianru
    Lin, Zhijian
    JOURNAL OF REAL-TIME IMAGE PROCESSING, 2022, 19 (01) : 61 - 71
  • [17] High-performance Convolutional Neural Network Accelerator Based on Systolic Arrays and Quantization
    Li, Yufeng
    Lu, Shengli
    Luo, Jihe
    Pang, Wei
    Liu, Hao
    2019 IEEE 4TH INTERNATIONAL CONFERENCE ON SIGNAL AND IMAGE PROCESSING (ICSIP 2019), 2019, : 335 - 339
  • [18] Predicting 3D particles shapes based on 2D images by using convolutional neural network
    Giannis, Kostas
    Thon, Christoph
    Yang, Guoqing
    Kwade, Arno
    Schilde, Carsten
    POWDER TECHNOLOGY, 2024, 432
  • [19] Development of a Deep Convolutional Neural Network for the Prediction of Pavement Roughness from 3D Images
    Abohamer, Hossam
    Elseifi, Mostafa
    Dhakal, Nirmal
    Zhang, Zhongjie
    Fillastre, Christophe N.
    JOURNAL OF TRANSPORTATION ENGINEERING PART B-PAVEMENTS, 2021, 147 (04)
  • [20] Brain age estimation based on 3D MRI images using 3D convolutional neural network
    Pardakhti, Nastaran
    Sajedi, Hedieh
    MULTIMEDIA TOOLS AND APPLICATIONS, 2020, 79 (33-34) : 25051 - 25065