Mapping Systolic Arrays onto 3D Circuit Structures: Accelerating Convolutional Neural Network Inference

被引：0

作者：

Kung, H. T. ^{[1
]}

McDanel, Bradley ^{[1
]}

Zhang, Sai Qian ^{[1
]}

机构：

[1] Harvard Univ, Cambridge, MA 02138 USA

来源：

PROCEEDINGS OF THE 2018 IEEE INTERNATIONAL WORKSHOP ON SIGNAL PROCESSING SYSTEMS (SIPS) | 2018年

关键词：

systolic array; convolutional neural network (CNN); deep learning; inference latency; accelerator; 3D-IC implementation; wiring length; power consumption; FPGA;

D O I：

暂无

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In recent years, numerous designs have used systolic arrays to accelerate convolutional neural network (CNN) inference. In this work, we demonstrate that we can further speed up CNN inference and lower its power consumption by mapping systolic arrays onto 3D circuit structures as opposed to conventional 2D structures. Specifically, by operating in 3D space, a wide systolic array consisting of a number of subarrays can efficiently implement wide convolutional layers prevalent in state of the art CNNs. Additionally, by accumulating intermediate results along the third dimension, systolic arrays can process partitioned data channels in parallel with reduced data skew for lowered inference latency. We present a building block design using through-silicon vias (TSVs) for the 3D realization of systolic subarrays. We validate the 3D scheme using a 2.5D FPGA design and demonstrate that when mapped onto 3D structures wide systolic arrays can scale up in size without increasing wiring length in interconnecting subarrays. Further, by taking full advantage of 3D structures, we are able to pipeline inference across multiple layers of a CNN over a series of systolic arrays, dramatically reducing the inference time per input sample. These improvements lead to significantly reduced inference latency, which is especially important for real-time applications where it is common to process samples one at a time.

引用

页码：330 / 336

页数：7

共 50 条

[31] Computed Tomography Image Enhancement Using 3D Convolutional Neural Network
Li, Meng
Shen, Shiwen
Gao, Wen
Hsu, William
Cong, Jason
DEEP LEARNING IN MEDICAL IMAGE ANALYSIS AND MULTIMODAL LEARNING FOR CLINICAL DECISION SUPPORT, DLMIA 2018, 2018, 11045 : 291 - 299
[32] Deep convolutional neural network for 3D mineral identification and liberation analysis
Tang, Kunning
Da Wang, Ying
Mostaghimi, Peyman
Knackstedt, Mark
Hargrave, Chad
Armstrong, Ryan T.
MINERALS ENGINEERING, 2022, 183
[33] Research on 3D Convolutional Neural Network and Its Application to Video Understanding
Bai, Jing
Yang, Zhanyuan
Peng, Bin
Li, Wenjing
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (06) : 2273 - 2283
[34] Violence Detection Using Spatiotemporal Features with 3D Convolutional Neural Network
Ullah, Fath U. Min
Ullah, Amin
Muhammad, Khan
Ul Haq, Ijaz
Baik, Sung Wook
SENSORS, 2019, 19 (11)
[35] 1D-FALCON: Accelerating Deep Convolutional Neural Network Inference by Co-optimization of Models and Underlying Arithmetic Implementation
Maji, Partha
Mullins, Robert
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, PT II, 2017, 10614 : 21 - 29
[36] Brain Age Estimation using Brain MRI and 3D Convolutional Neural Network
Pardakhti, Nastsrsn
Sajedi, Hedieh
2019 9TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE 2019), 2019, : 386 - 390
[37] Working activity recognition approach based on 3D deep convolutional neural network
Liu T.
Lu Z.
Sun Y.
Liu F.
He B.
Zhong J.
Jisuanji Jicheng Zhizao Xitong/Computer Integrated Manufacturing Systems, CIMS, 2020, 26 (08): : 2143 - 2156
[38] 3D Map Optimization with Fully Convolutional Neural Network and Dynamic Local NDT
Shen, Zebang
Xu, Yichong
Sun, Muchen
Carballo, Alexander
Zhou, Qingguo
2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 4404 - 4411
[39] Dynamic Hand Gesture Recognition Based on 3D Convolutional Neural Network Models
Zhang, Wenjin
Wang, Jiacun
PROCEEDINGS OF THE 2019 IEEE 16TH INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL (ICNSC 2019), 2019, : 224 - 229
[40] A Convolutional Neural Network based 3D Ball Tracking by Detection in Soccer Videos
Kamble, Paresh R.
Keskar, Avinash G.
Bhurchandi, Kishor M.
ELEVENTH INTERNATIONAL CONFERENCE ON MACHINE VISION (ICMV 2018), 2019, 11041

← 1 2 3 4 5 →