Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引:0
|
作者
Kung, H. T. [1 ]
McDanel, Bradley [1 ]
Zhang, Sai Qian [1 ]
机构
[1] Harvard Univ, Cambridge, MA 02138 USA
来源
PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS) | 2018年
关键词
systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.
引用
收藏
页码:330 / 336
页数:7
相关论文
共 50 条
  • [21] Brain age estimation based on 3D MRI images using 3D convolutional neural network
    Nastaran Pardakhti
    Hedieh Sajedi
    Multimedia Tools and Applications, 2020, 79 : 25051 - 25065
  • [22] 3D Filtering by Block Matching and Convolutional Neural Network for Image Denoising
    Zou, Bei-Ji
    Guo, Yun-Di
    He, Qi
    Ouyang, Ping-Bo
    Liu, Ke
    Chen, Zai-Liang
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2018, 33 (04) : 838 - 848
  • [23] 3D Filtering by Block Matching and Convolutional Neural Network for Image Denoising
    Bei-Ji Zou
    Yun-Di Guo
    Qi He
    Ping-Bo Ouyang
    Ke Liu
    Zai-Liang Chen
    Journal of Computer Science and Technology, 2018, 33 : 838 - 848
  • [24] A Convolutional Neural Network-Based Method for 3D Object Detection
    Li Y.
    Shi L.
    Wan W.
    Zhao Q.
    Shanghai Jiaotong Daxue Xuebao/Journal of Shanghai Jiaotong University, 2018, 52 (01): : 7 - 12
  • [25] Use 3D Convolutional Neural Network to Inspect Solder Ball Defects
    Lin, Bing-Jhang
    Tsan, Ting-Chen
    Tung, Tzu-Chia
    Lee, You-Hsien
    Fuh, Chiou-Shann
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT I, 2018, 11301 : 263 - 274
  • [26] DSSA: Dual-Side Sparse Systolic Array Architecture for Accelerating Convolutional Neural Network Training
    Chen, Zhengbo
    Yu, Qi
    Zheng, Fang
    Guo, Feng
    Chen, Zuoning
    51ST INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING, ICPP 2022, 2022,
  • [27] RECOGNITION OF 3D SURFACE FRACTAL DIMENSION BASED ON CONVOLUTIONAL NEURAL NETWORK
    Wang, Liuqun
    Lei, Sheng
    Wang, Zijie
    FRACTALS-COMPLEX GEOMETRY PATTERNS AND SCALING IN NATURE AND SOCIETY, 2024,
  • [28] A 3D Convolutional Neural Network for Emotion Recognition based on EEG Signals
    Zhao, Yuxuan
    Yang, Jin
    Lin, Jinlong
    Yu, Dunshan
    Cao, Xixin
    2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020,
  • [29] Urban Dynamic Logistics Pattern Mining with 3D Convolutional Neural Network
    Wen, Rong
    yan, Wenjing
    2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 3168 - 3174
  • [30] Brain Dynamic States Analysis based on 3D Convolutional Neural Network
    Hung, Yu-Chia
    Wang, Yu-Kai
    Prasad, Mukesh
    Lin, Chin-Teng
    2017 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2017, : 222 - 227